Lordog/dive-into-llms

GitHub: Lordog/dive-into-llms

该教程提供大模型从入门到安全对齐的编程实践与教学资源，解决教与学大模型方法分散、缺乏实战案例的问题。

Stars: 43300 | Forks: 5227

## 💡 Updates 2025/06/06 感谢各位朋友们的关注和积极反馈！我们从以下两个方面对本教程进行了更新： - [x] 上线国产化《大模型开发全流程》公益教程（含PPT、实验手册和视频），此处特别感谢华为昇腾社区的支持！ - [x] 在原系列编程实践教程的基础上进行内容更新，并增加了新的主题（数学推理、GUI Agent、大模型对齐、隐写术等）！ ## 🎯 项目动机《动手学大模型》系列编程实践教程，由上海交通大学《自然语言处理前沿技术》（NIS8021）、《人工智能安全技术》课程（NIS3353）讲义拓展而来（教师：[张倬胜](https://bcmi.sjtu.edu.cn/home/zhangzs/)），旨在提供大模型相关的入门编程参考。本教程属公益性质、完全免费。通过简单实践，帮助同学们快速入门大模型，更好地开展课程设计或学术研究。 ## 📚 教程目录 | 教程内容 | 简介 | 地址 | | ---------------- | ------------------------------------------------------------ | ------------------------------------------------------------ | | 微调与部署 | 预训练模型微调与部署指南：想提升预训练模型在指定任务上的性能？让我们选择合适的预训练模型，在特定任务上进行微调，并将微调后的模型部署成方便使用的Demo！ | [[课件](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter1/dive-into-llm.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter1/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter1/dive-tuning.ipynb)] | | 提示学习与思维链 | 大模型的API调用与推理指南：“AI在线求鼓励？大模型对一些问题的回答令人大跌眼镜，但它可能只是想要一句「鼓励」” | [[课件](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter2/dive-into-prompting.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter2/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter2/dive-prompting.ipynb)] | | 知识编辑 | 语言模型的编辑方法和工具：想操控语言模型在对指定知识的记忆？让我们选择合适的编辑方法，对特定知识进行编辑，并将对编辑后的模型进行验证！ | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter3/dive_edit_0410.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter3/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter3/dive_edit.ipynb)] | | 数学推理 | 如何让大模型学会数学推理？让我们快速蒸馏一个迷你R1！ | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter4/math.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter4/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter4/sft_math.ipynb)] | | 模型水印 | 语言模型的文本水印：在语言模型生成的内容中嵌入人类不可见的水印 | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter5/watermark.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter5/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter5/watermark.ipynb)] | | 越狱攻击 | 想要得到更好的安全，要先从学会攻击开始。让我们了解越狱攻击如何撬开大模型的嘴！ | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter6/dive-Jailbreak.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter6/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter6/dive-jailbreak.ipynb)] | | 大模型隐写 | “看不见的墨水”！想让大模型在流畅回答的同时，悄悄携带只有“自己人”能识别的信息吗？大模型隐写告诉你！ | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter7/stega.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter7/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter7/llm_stega.ipynb)] | | 多模态模型 | 作为能够更充分模拟真实世界的多模态大语言模型，其如何实现更强大的多模态理解和生成能力？多模态大语言模型是否能够帮助实现AGI？ | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter8/mllms.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter8/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter8/mllms.ipynb)] | | GUI智能体 | 想要饭来张口、解放双手？那么让我们一起来让AI Agent替你点外卖、回消息、购物比价吧！ | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter9/GUIagent.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter9/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter9/GUIagent.ipynb)] | | 智能体安全 | 大模型智能体迈向了未来操作系统之旅。然而，大模型在开放智能体场景中能意识到风险威胁吗？ | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter10/dive-into-safety.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter10/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter10/agent.ipynb)] | | RLHF安全对齐 | 基于PPO的RLHF实验指南：本教程”十分危险“，阅读后请检查你的大模型是否在冷笑。 | [[课件](https://github.com/Lordog/dive-into-llms/blob/main/documents/chapter11/RLHF.pdf)] [[教程](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter11/README.md)] [[脚本](https://github.com/Lordog/dive-into-llms/tree/main/documents/chapter11/RLHF.ipynb)] | ## 🔥 新上线：国产化《大模型开发全流程》 - **✨ 我们联合华为昇腾推出的《大模型开发全流程》公益教程正式上线！前沿技术+代码实践，手把手带你玩转AI大模型 ✨**: 在《动手学大模型》原系列教程的基础上，我们联合华为开发了《大模型开发全流程》系列课程。本系列教程基于昇腾基础软硬件开发，覆盖PPT、实验手册、视频等教程形式。该教程分为初级、中级、高级系列，面向不同的大模型实践需求，旨在将前沿技术通过代码实践的方式，为相关研究者、开发者由浅入深地提供快速上手、应用昇腾已支持模型和全新模型迁移调优的全流程开发指南。 - **🚀 前往昇腾社区探索《大模型开发全流程》系列课程**： 👉《[大模型开发学习专区](https://www.hiascend.com/edu/growth/lm-development#classification-floor-1)》@ 昇腾社区 👈 - **✨ 课程内容展示 ✨**