Commit Graph

10 Commits

Author SHA1 Message Date
Logan Zou
84c0a2d875 finish C6 2025-04-26 21:30:48 +08:00
Logan Zou
ad530bc3ab Create 6.4 高效微调.md 2025-04-26 16:20:29 +08:00
Logan Zou
b9be826700 Delete docs/chapter6/7.2 奖励模型.md 2025-04-26 16:01:13 +08:00
Logan Zou
bf91fa3c86 Update and rename 7.1 强化学习的目标.md to 6.4[WIP] 偏好对齐.md 2025-04-26 16:00:50 +08:00
Logan Zou
106fd678cd finish 6.2 2025-04-25 16:33:27 +08:00
Logan Zou
072f919c10 finish 6.1 2025-04-25 15:43:36 +08:00
Logan Zou
3afc880bc6 finish 6.1.1 2025-04-25 10:50:50 +08:00
Logan Zou
b4327f741a add ch6 code 2025-04-25 10:04:43 +08:00
KMnO4-zx
81bc97f434 修改项目结构+7,4 一部分 2025-04-21 22:15:49 +08:00
Logan Zou
ec7d0ef487 init ch6 2025-04-10 17:54:58 +08:00