Leoda

Home

❯

Big Feature

❯

Training optimization

Training optimization

Jul 02, 20261 min read

DModel project

  • DModel 简单摸底 0. goin ✅ 2026-06-30
  • 梳理当前Megatron内的 ep overlap 1. MoE EP Overlap ✅ 2026-06-30
  • 输出详细的DModel backward Wgrad 计算与 dispatch backward的通信的 overlap 1. backward ep overlap design

Graph View

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community