Commit Graph

14 Commits

Author SHA1 Message Date
KMnO4-zx
f909cd1a87 docs:修改内容结构 && update readme 2025-06-03 18:52:33 +08:00
KMnO4-zx
f9fe12d99a docs:add docsify deploy 2025-05-25 00:02:24 +08:00
KMnO4-zx
c16ee23323 docs:第六章 大模型训练流程实践 图片格式 参考格式修改 2025-05-13 20:42:51 +08:00
Logan Zou
360dd41c56 update C6 2025-04-26 21:52:12 +08:00
Logan Zou
84c0a2d875 finish C6 2025-04-26 21:30:48 +08:00
Logan Zou
ad530bc3ab Create 6.4 高效微调.md 2025-04-26 16:20:29 +08:00
Logan Zou
b9be826700 Delete docs/chapter6/7.2 奖励模型.md 2025-04-26 16:01:13 +08:00
Logan Zou
bf91fa3c86 Update and rename 7.1 强化学习的目标.md to 6.4[WIP] 偏好对齐.md 2025-04-26 16:00:50 +08:00
Logan Zou
106fd678cd finish 6.2 2025-04-25 16:33:27 +08:00
Logan Zou
072f919c10 finish 6.1 2025-04-25 15:43:36 +08:00
Logan Zou
3afc880bc6 finish 6.1.1 2025-04-25 10:50:50 +08:00
Logan Zou
b4327f741a add ch6 code 2025-04-25 10:04:43 +08:00
KMnO4-zx
81bc97f434 修改项目结构+7,4 一部分 2025-04-21 22:15:49 +08:00
Logan Zou
ec7d0ef487 init ch6 2025-04-10 17:54:58 +08:00