model-architecture-diagram

Installation
SKILL.md

Model Architecture Diagram

Workflow

Return only public original diagrams indexed by this skill.

  1. Run the bundled resolver:
python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py "<model name>"
  1. If the resolver returns kind: existing, return the raw image Markdown it prints and preserve the source attribution line.
  2. If the resolver returns kind: no_match, tell the user that no public original architecture diagram is indexed for that model.

Source Priority

Use references/diagram-index.json as the source of truth. It stores raw GitHub image URLs from:

  • datawhalechina/self-llm
  • CalvinXKY/InfraTech
  • Tongyi-MAI/Z-Image
  • Wan-Video/Wan2.1
  • Wan-Video/Wan2.2
  • Tencent-Hunyuan/HunyuanVideo
  • Tencent-Hunyuan/Hunyuan3D-2
  • brayevalerien/Flux.1-Architecture-Diagram

Prefer detailed implementation, cookbook, or architecture-card diagrams over paper figures. Good sources show module boundaries, dataflow, MoE / attention / cache paths, or model-specific runtime structure rather than only a high-level paper overview. Official repository diagrams and curated implementation diagrams are first choice; paper figures are fallback only when no more detailed public original diagram is indexed.

Do not copy remote image binaries into the skill. Return the raw GitHub URLs so the chat renderer can display the original image.

Existing Diagram Rule

For a direct match, show the original image. Good direct matches include:

  • DeepSeek V3/V3.2/V4, GLM-5, Kimi K2/K2.5, MiniMax M2.5, Qwen3.5, Qwen3-VL, and Step 3.5 Flash from InfraTech.
  • Hunyuan-A13B, Kimi-VL, Qwen3, Qwen3-VL detail flows, MiniMax M2, and Llama 4 architecture/module diagrams from self-llm.
  • Z-Image, Wan2.1, Wan2.2, HunyuanVideo, Hunyuan3D 2.0, and FLUX.1 diffusion architecture/module diagrams from public GitHub sources.

If multiple diagrams match, show all high-confidence matches up to the resolver's default limit. For example, DeepSeek V3 may return the full architecture plus MLA MHA/MQA diagrams.

Hosted Original Diagram Gallery

Do not commit the sgl-cookbook-model-architecture-images/ gallery into the repository. The public-original image set is hosted as a GitHub Release asset and indexed by a GitHub issue.

Current hosted artifact:

The artifact contains 44 public original diagram image files from the indexed upstream repositories, plus a lightweight index.html, index.md, manifest.json, HTML contact sheet, and architecture-audit.md.

To inspect the gallery locally:

curl -L -o /tmp/sgl-cookbook-model-architecture-images-2026-05-02.zip \
  https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/download/sgl-cookbook-architecture-images-2026-05-02/sgl-cookbook-model-architecture-images-2026-05-02.zip
unzip -q /tmp/sgl-cookbook-model-architecture-images-2026-05-02.zip -d /tmp
open /tmp/sgl-cookbook-model-architecture-images/index.html

Useful Commands

List known original diagram aliases:

python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py --list-known

Emit JSON for automation:

python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py "GLM-5" --format json

References

  • references/diagram-index.json: original diagram link index and aliases.
  • references/source-notes.md: audited source repositories and local cache paths.
Related skills

More from bbuf/sglang-auto-driven-skills

Installs
2
GitHub Stars
202
First Seen
9 days ago