docs: add gfx1200 (Navi 44) alongside gfx1201 for RDNA4 support#1386
Open
0xDELUXA wants to merge 1 commit into
Open
docs: add gfx1200 (Navi 44) alongside gfx1201 for RDNA4 support#13860xDELUXA wants to merge 1 commit into
0xDELUXA wants to merge 1 commit into
Conversation
Author
|
Unfortunately, I can’t properly test gfx1200 support locally since I don’t use Linux, and aiter doesn’t have Windows support: ROCm/aiter#2869. |
gfx1200 (Navi 44: RX 9060 / RX 9060 XT) is the sibling RDNA4 chip to gfx1201 (Navi 48: RX 9070 / RX 9070 XT / AI PRO R9700) and rides the same Triton fallback path. The only difference is the arch string passed to GPU_ARCHS when building aiter. Name gfx1200 wherever the arch is declared or built: - README news entry - Ministral-3-8B build-aiter prerequisite note - Qwen3-8B-FP8 build-aiter prerequisite note + rmsnorm_quant JIT note Benchmark tables remain labeled as gfx1201-measured; no results are claimed on gfx1200 hardware.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
The README News entry and the two RDNA4 recipes added in #811 name only gfx1201 (Navi 48 - RX 9070 / RX 9070 XT / AI PRO R9700). Its sibling RDNA4 chip gfx1200 (Navi 44 - RX 9060 / RX 9060 XT) rides the exact same code path but is never mentioned, so a 9060 / 9060 XT owner has no signal that ATOM covers their card. This PR names gfx1200 alongside gfx1201 everywhere the arch is declared or built, so the docs match the support that already exists.
Technical Details
This is intentionally docs-only, mirroring how gfx1201 itself shipped.
#811 deliberately removed every arch-specific dispatch from ATOM Python (the
_is_gfx1201/_detect_gfx1201gates, theATOM_NATIVE_TRITON_ATTNenv var, theselector.pyallowlists, thescripts/gfx1201/config workaround) and pushed the fallbacks down into aiter, leaving ATOM 100% capability-driven. There is therefore no arch gate in ATOM to extend for a new RDNA4 chip - the only place the arch is named is the documentation.The kernel-level RDNA4 support gfx1201 relies on already covers gfx1200. The aiter PRs #811 depends on were written for the RDNA4 family and name gfx1200 explicitly alongside gfx1201:
So the only per-chip difference is the
GPU_ARCHSstring when building aiter (ROCm/aiter#3846) and typically less VRAM.Changes:
Navi 4 (RDNA4 / gfx1200, gfx1201)and lists the RX 9060 / RX 9060 XT (Navi 44) cards.GPU_ARCHS=gfx1200as well asgfx1201.rmsnorm_quantJIT note.Test Plan
git grepthat nogfx1201literal exists in ATOM Python/Rust onmain(onlygfx1250/gfx94x/gfx95xare special-cased), so this change cannot touch a code path - it is documentation only.mainand that the rendered Markdown links/anchors are unchanged.Test Result
Docs-only change, so no code paths are exercised. atom/aiter has no Windows support and I'm a Windows-only user, so I can't run this end-to-end locally. The benchmark/accuracy tables are therefore left labeled gfx1201-measured and no numbers are claimed for gfx1200 - the change is scoped to "same path, same build prerequisite." Happy for a maintainer with gfx1200 hardware to verify, and I'll add a results block if it reproduces.
Submission Checklist