From f9fe12d99a39e43efccbc8ec41c1ef97d79bfc2c Mon Sep 17 00:00:00 2001 From: KMnO4-zx <1021385881@qq.com> Date: Sun, 25 May 2025 00:02:24 +0800 Subject: [PATCH] =?UTF-8?q?docs=EF=BC=9Aadd=20docsify=20deploy?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- README.md | 2 +- docs/.nojekyll | 0 docs/README.md | 99 +++++++++++++++--- docs/_sidebar.md | 17 +-- docs/chapter2/第二章 Transformer架构.md | 14 +-- docs/chapter3/第三章 预训练语言模型.md | 30 +++--- docs/chapter4/第四章 大语言模型.md | 10 +- docs/chapter5/第五章 动手搭建大模型.md | 4 +- docs/chapter6/第六章 大模型训练流程实践.md | 18 ++-- docs/chapter7/第七章 大模型应用.md | 18 ++-- .../figures => images/2-figures}/1-0.png | Bin .../figures => images/2-figures}/1-1.png | Bin .../figures => images/2-figures}/1-2.png | Bin .../figures => images/2-figures}/1-3.jpeg | Bin .../figures => images/2-figures}/2-0.jpg | Bin .../figures => images/2-figures}/3-0.png | Bin .../figures => images/2-figures}/3-1.png | Bin .../figures => images/3-figures}/1-0.png | Bin .../figures => images/3-figures}/1-1.png | Bin .../figures => images/3-figures}/1-2.png | Bin .../figures => images/3-figures}/1-3.png | Bin .../figures => images/3-figures}/1-4.png | Bin .../figures => images/3-figures}/1-5.png | Bin .../figures => images/3-figures}/2-0.png | Bin .../figures => images/3-figures}/2-1.png | Bin .../figures => images/3-figures}/2-2.png | Bin .../figures => images/3-figures}/2-3.png | Bin .../figures => images/3-figures}/2-4.png | Bin .../figures => images/3-figures}/3-0.png | Bin .../figures => images/3-figures}/3-1.png | Bin .../figures => images/3-figures}/3-2.png | Bin .../figures => images/3-figures}/3-3.png | Bin .../figures => images/4-figures}/2-0.jpg | Bin .../figures => images/4-figures}/2-1.jpg | Bin .../figures => images/4-figures}/2-2.jpg | Bin .../figures => images/4-figures}/2-3.png | Bin .../figures => images/4-figures}/2-4.jpg | Bin docs/images/5-images/pretrain_dataset.png | Bin 0 -> 23499 bytes docs/images/5-images/sftdataset.png | Bin 0 -> 25838 bytes docs/images/6-images/1-1.png | Bin 0 -> 353320 bytes docs/images/6-images/1-2.png | Bin 0 -> 652820 bytes docs/images/6-images/1-3.png | Bin 0 -> 250193 bytes docs/images/6-images/1-4.png | Bin 0 -> 220719 bytes docs/images/6-images/1-5.png | Bin 0 -> 353275 bytes docs/images/6-images/1-6.png | Bin 0 -> 111942 bytes docs/images/6-images/1-7.png | Bin 0 -> 502182 bytes docs/images/6-images/3-1.png | Bin 0 -> 197266 bytes docs/images/6-images/3-2.jpg | Bin 0 -> 60341 bytes docs/images/6-images/7.1-1.png | Bin 0 -> 21783 bytes .../7-images/7-1-Open LLM Leaderboard.png | Bin 0 -> 182050 bytes .../7-1-lmsys Chatbot Arena Leaderboard.png | Bin 0 -> 210640 bytes docs/images/7-images/7-1-opencompass.png | Bin 0 -> 210370 bytes docs/images/7-images/7-1-垂直领域榜单.png | Bin 0 -> 271293 bytes docs/images/7-images/7-2-rag.png | Bin 0 -> 409953 bytes docs/images/7-images/7-2-tinyrag.png | Bin 0 -> 579120 bytes docs/images/7-images/7-3-Agent工作原理.png | Bin 0 -> 445253 bytes docs/images/7-images/7-3-Tiny_Agent.jpg | Bin 0 -> 103730 bytes .../images/7-images/7-3-tinyagent-example.png | Bin 0 -> 777131 bytes docs/images/datawhale.png | Bin 0 -> 115785 bytes docs/images/head.jpg | Bin 0 -> 168496 bytes docs/index.html | 57 ++++++++++ docs/前言.md | 25 +++++ 62 files changed, 225 insertions(+), 69 deletions(-) create mode 100644 docs/.nojekyll rename docs/{chapter2/figures => images/2-figures}/1-0.png (100%) rename docs/{chapter2/figures => images/2-figures}/1-1.png (100%) rename docs/{chapter2/figures => images/2-figures}/1-2.png (100%) rename docs/{chapter2/figures => images/2-figures}/1-3.jpeg (100%) rename docs/{chapter2/figures => images/2-figures}/2-0.jpg (100%) rename docs/{chapter2/figures => images/2-figures}/3-0.png (100%) rename docs/{chapter2/figures => images/2-figures}/3-1.png (100%) rename docs/{chapter3/figures => images/3-figures}/1-0.png (100%) rename docs/{chapter3/figures => images/3-figures}/1-1.png (100%) rename docs/{chapter3/figures => images/3-figures}/1-2.png (100%) rename docs/{chapter3/figures => images/3-figures}/1-3.png (100%) rename docs/{chapter3/figures => images/3-figures}/1-4.png (100%) rename docs/{chapter3/figures => images/3-figures}/1-5.png (100%) rename docs/{chapter3/figures => images/3-figures}/2-0.png (100%) rename docs/{chapter3/figures => images/3-figures}/2-1.png (100%) rename docs/{chapter3/figures => images/3-figures}/2-2.png (100%) rename docs/{chapter3/figures => images/3-figures}/2-3.png (100%) rename docs/{chapter3/figures => images/3-figures}/2-4.png (100%) rename docs/{chapter3/figures => images/3-figures}/3-0.png (100%) rename docs/{chapter3/figures => images/3-figures}/3-1.png (100%) rename docs/{chapter3/figures => images/3-figures}/3-2.png (100%) rename docs/{chapter3/figures => images/3-figures}/3-3.png (100%) rename docs/{chapter4/figures => images/4-figures}/2-0.jpg (100%) rename docs/{chapter4/figures => images/4-figures}/2-1.jpg (100%) rename docs/{chapter4/figures => images/4-figures}/2-2.jpg (100%) rename docs/{chapter4/figures => images/4-figures}/2-3.png (100%) rename docs/{chapter4/figures => images/4-figures}/2-4.jpg (100%) create mode 100644 docs/images/5-images/pretrain_dataset.png create mode 100644 docs/images/5-images/sftdataset.png create mode 100644 docs/images/6-images/1-1.png create mode 100644 docs/images/6-images/1-2.png create mode 100644 docs/images/6-images/1-3.png create mode 100644 docs/images/6-images/1-4.png create mode 100644 docs/images/6-images/1-5.png create mode 100644 docs/images/6-images/1-6.png create mode 100644 docs/images/6-images/1-7.png create mode 100644 docs/images/6-images/3-1.png create mode 100644 docs/images/6-images/3-2.jpg create mode 100644 docs/images/6-images/7.1-1.png create mode 100644 docs/images/7-images/7-1-Open LLM Leaderboard.png create mode 100644 docs/images/7-images/7-1-lmsys Chatbot Arena Leaderboard.png create mode 100644 docs/images/7-images/7-1-opencompass.png create mode 100644 docs/images/7-images/7-1-垂直领域榜单.png create mode 100644 docs/images/7-images/7-2-rag.png create mode 100644 docs/images/7-images/7-2-tinyrag.png create mode 100644 docs/images/7-images/7-3-Agent工作原理.png create mode 100644 docs/images/7-images/7-3-Tiny_Agent.jpg create mode 100644 docs/images/7-images/7-3-tinyagent-example.png create mode 100644 docs/images/datawhale.png create mode 100644 docs/images/head.jpg create mode 100644 docs/index.html create mode 100644 docs/前言.md diff --git a/README.md b/README.md index da346f4..7793411 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,5 @@
+
+ 深入理解 LLM 核心原理,动手实现你的第一个大模型
+
+ 扫描二维码关注 Datawhale 公众号,获取更多优质开源内容
+⭐ 如果这个项目对你有帮助,请给我们一个 Star!
+
+
图2.1 前馈神经网络
+
图2.2 卷积神经网络
+
图2.3 循环神经网络
+
图2.4 多头注意力机制
+
图2.5 编码器-解码器结构
+
图2.6 编码结果
+
图2.7 Transformer 模型结构
+
图3.1 BERT 模型结构
+
图3.2 BERT 模型简略结构
+
图3.3 prediction_heads 结构
+
图3.4 Encoder Layer 结构
+
图3.5 Intermediate 结构
+
图3.6 BERT 注意力机制结构
+
图3.7 T5 模型详细结构
+
图3.8 T5 模型整体结构
+
图3.9 Encoder 和 Decoder
+
图3.10 Self-Attention 结构
+
图3.11 T5 的大一统思想
+
图3.12 GPT 模型结构
+
图3.13 LLaMA-3 模型结构
+
图3.14 alt text
+
图3.15 alt text
+
图4.1 训练 LLM 的三个阶段
+
图4.2 模型、数据并行
+
图4.3 模型并行
+
图4.4 ChatGPT 训练三个的阶段
+
图4.5 PPO 训练流程
+
图5.1 预训练损失函数计算
+
图5.2 SFT 损失函数计算
+
图6.1 Hugging Face Transformers
+
图6.2 Hugging Face Transformers 模型社区
+
图6.3 Qwen-2.5-1.5B
+
图6.4 Qwen-2.5-1.5B config.json 文件
+
图6.5 模型下载标识
+
图6.6 模型结构输出结果
+
图6.7 数据集展示
+
图6.8 Adapt Tuning
+
图6.9 LoRA
+
图 7.1 Open LLM Leaderboard
+
图7.2 Lmsys Chatbot Arena Leaderboard
+
图7.3 OpenCompass
+
图7.4 垂直领域榜单
+
图7.5 TinyRAG 项目结构
+
图7.6 RAG 流程图
+
图7.7 Agent 工作原理
+
图7.8 效果示意图
+
图7.9 Agent 工作流程
}dzA>o(X=>-usBrfsI
zv;mw3_l=VGh`2NXd-x4?gSZui`RcgacM<#11H-+sAr){&CVsDqY;qH^()Yn-n~rMz
zH!3TEVifm=*iL=APSA!^L|S8hZ}T@*-5l$UaWDJp78Q6pnq#QK+?|#2-5&>hJ&X@@
zcxLHfpmTZa!Q2j;M%xEo9~D)_-nIdS>^zYOh9NL=^h~?qJs#o367@?c+dp;gAy+L{
zDl=WKn1A#8HB%H&!Q^III$_&xkG$8`ms_j0b_CCgC*0x_{1@#XZgylTd cVTb)=bRDF6aAbU+
z;@xR)84raBbeC!@7xSaFM^ZjzVUVL=$M8KKA9I|Xc9`Cj4YYopf3_oo_<@%f0&a`Z
z1IU`Q&ppk0SNU( 1IG-z>eIh5-lz
zE_;H9(V(6Httsxxt$sfOeuW a3`V!N)_1b_Bz{VFv5+cJoe;UdEJ30XL2rM^X^KTLuGJdHzID8N2u-SjZ
zb;>Kb_2PcHIdMH-%W~jN->st2Bd30n8J8(CkOPgXx+L ?hg0+J<60<6L~%;MBj8}vY6g>a%qq?Gg^~5sM!BH{
zr$Zlti!bw4r+PXtI0fu;^((Ue@^F3FKRS?ALcC{7Nxbx?3vKD<==ud)
zNs(Olh#m-S?X?nX{4cQ8ssC@4iciX3ot&kk^K-{z72px)S0i5LXf7_bCQhdG#rD_E
zxA|Ux?7(8ovzb%Lt}9Q%Nz!dOwtP5J^7k0^FWG=hv{ZwPr{%x{29S}SN9eDb5h&l5
za~%-{B0xm}61Y4>^QHQ<&fN-m6ZIr7?B?RN;;3jnWnJTlZ`B&~L(E=4#PCO1c6${u
zyXH|JQz1*Uy<1JgHX?@i>Mo!G7UkXCwS+!7RENpSvCVEU3B692zSXmePYu4-VZ{qw
zd4zs?KL~#po(@+oRV>!)`#q3!kNR$aE(lwnlae{d-w(B}?EaE{gJd3srN
zF%*-aOIS<0KzvtKNsUn9poeq;B(t}mWBiDuOII?j!|d1RRPfC?OE|d52z=SPJznCL
z|4m-M8IOtii<>D!E(|y}fEfBog5=TMo0
kHu0TgQu~575mFUZVCcCCzxU#R%r^6);B%=vO7Wy)fojq+zJKIIR2TrIoy>
zGT-Amu29c`=y#~upO#37j!{zpDi*x%(hgZ^j4HreXw%J}Jp>PU)2e6#u1)muKJUlu
zsv}LP{$yr}b>-r%hNArsS*=zYlJZ|+1{
Lojp#}$r4bsqX2D4qE47ua2+Bj1sHvyd;|{>5-jnP2_}N;c=NKN
z%Gf=3u|9zI88-iFwtz@$fY%tmX6?Nts?2
Va?QpFJPSccX2`X!ner9g
z1Rii*ioH8V;0H;mT<|#UUb2;9198Dx%xPQjLq3^z&!mFpf?fJP)Gs)pB+${DE~D#{
zA(mcX^V9=TJgYLofQ@~kXpg?=SQ+e`j?qD%F=ue4q`#