🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
-
Updated
Jun 4, 2026 - Python
🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
逐行对照 MiniMind 源码精读、并延伸到大模型技术体系的中文学习笔记 —— 预训练 / SFT / DPO / PPO / GRPO、训练机制、MiniMind2→3 版本对照、真实实验证据。
The Cowork Agent for Everything — trainable advertising AI + 14 platform MCP servers + agent skills. Based on minimind (42k stars). Train from zero in 2 hours.
练习时长两年半的 AI 大模型 (实际 26M params,2.5B = 两年半) | ikun meme-culture chatbot 🐔🏀
A beginner-friendly VLA project starter: CPU smoke, ACT + PushT-style imitation learning, rollout eval, and resume-ready docs.
从0到1手写的中文小型大语言模型 — minimind 风格教学项目:RMSNorm·RoPE·GQA·SwiGLU,完整 tokenizer→pretrain→SFT 管线,单张 RTX 3060 可训。A from-scratch Chinese LLM for learning.
📝 Enhance your understanding of large model training with detailed, clear annotations for the MiniMind project, tailored for Chinese learners.
Add a description, image, and links to the minimind topic page so that developers can more easily learn about it.
To associate your repository with the minimind topic, visit your repo's landing page and select "manage topics."