CANN Recipes
Docs

Training Recipes

面向 LLM / 多模态训练优化的实践合集。

文档正文

Training Recipes

面向 LLM / 多模态训练优化的实践合集。

Repo: https://gitcode.com/cann/cann-recipes-train

Featured Recipes

Card Level Description Link
Qwen2.5 RL (Starter) Beginner 单卡 Atlas A2 入门样例,使用 verl 在数学推理数据集上训练 https://gitcode.com/cann/cann-recipes-train/-/blob/master/llm_rl/qwen2_5/verl_npu_demo/README.md
Qwen3 Tool Agent RL Intermediate 端到端 agent RL 训练,启用 asyncLLM 与 agent_loop https://gitcode.com/cann/cann-recipes-train/-/blob/master/agent_rl/qwen3_tool_agent/README.md
DeepSeek-R1 RL Advanced veRL + MindSpeed + vLLM-Ascend,在 Atlas A3 上实现 GRPO 高吞吐训练 https://gitcode.com/cann/cann-recipes-train/-/blob/master/llm_rl/deepseek/README.md
Qwen3-235B-A22B RL Advanced 2k+32k 长序列 GRPO/DAPO 训练优化 https://gitcode.com/cann/cann-recipes-train/-/blob/master/llm_rl/qwen3/README.md
Qwen3-32B SAM Advanced RL 训练场景启用 SAM 投机推理,性能提升约 10% https://gitcode.com/cann/cann-recipes-train/-/blob/master/llm_rl/qwen3/README.md

Related Features

Card Description Link
SAM Speculative Decoding SAM 无损投机推理 https://gitcode.com/cann/cann-recipes-train/-/blob/master/docs/features/sam_speculative_decoding.md
Rollout Rebalance RL 推理阶段调度与均衡 https://gitcode.com/cann/cann-recipes-train/-/blob/master/docs/features/rollout_rebalance.md