rdna4
Here are 23 public repositories matching this topic...
Windows-only version of ComfyUI which uses AMD's official ROCm and PyTorch libraries to get better performance with AMD GPUs. [auto-installation and popular performance enhancing packages like triton * sage-attention * flash-attention * bitsandbytes included ]
-
Updated
Jun 28, 2026 - Python
The intelligent OptiScaler installer Linux gamers needed. Automates FSR4, XeSS & DLSS configuration with GPU-optimized profiles for RDNA3/4, Arc & RTX cards.
-
Updated
Jan 27, 2026 - Shell
Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.
-
Updated
Jun 22, 2026 - Python
llama.cpp with native AMD RDNA4 (gfx1201) ROCm 7.11 support - 98.97 tok/s AI inference, competitive with RTX 4070 Ti, 32GB VRAM
-
Updated
Jan 3, 2026 - C++
Fine-tune your own LLM on an AMD Radeon GPU — the easy, tested way. QLoRA via ROCm on Windows/WSL2 & Linux, a worked Gemma-4 example, a reusable live training dashboard, and a smoke test that proves the loss falls.
-
Updated
Jun 20, 2026 - HTML
RAM-Backed MCP Memory Architecture for Consumer LLM Inference — 900K token context on 16GB VRAM
-
Updated
Mar 27, 2026 - Python
AWQ/W4A16 quantization, benchmarking and public release of Polish LLMs (PLLuM, Bielik) on AMD RDNA 4 (ROCm + vLLM-native) — reproducible methodology and hardware-envelope studies.
-
Updated
Jun 21, 2026 - Python
Real-time dynamic GPU undervolting tool for AMD Radeon RX 9070 XT (RDNA4) on Windows. Automatically switches voltage offsets based on live core clock via AMD's official ADLX SDK — no static slider limitations. Features GUI dashboard, CLI controller, crash logger with TDR detection, and full preset management.
-
Updated
Mar 18, 2026 - Python
Fork of xenia-edge (has207/xenia-edge) with critical bug fixes: posix_spawn replaces fork+exec to avoid AppImage FUSE unmount race on Linux, in_process_title_relaunch takes precedence over GameMode, improved filesystem path resolution with proper XDG/AppImage support, full codebase audit with 30+ bug fixes across CPU/GPU/kernel/UI/Android.
-
Updated
Jun 22, 2026 - C++
Fixes and patches for NVlabs/nvdiffrast to support AMD ROCm 7.1 and Wave64 architectures (gfx1100, gfx1201).
-
Updated
May 1, 2026 - Shell
Run PyTorch natively on Windows 11 with an AMD RX 9070 XT (RDNA4 / gfx1201) on stable ROCm 7.2.1 — no WSL2, no Linux, no ZLUDA. Exact pinned wheel URLs, runtime env vars, documented RDNA4 pitfalls (broken nightlies, no xformers/flash-attn/bitsandbytes), and real benchmarks for ComfyUI, SD, LLMs, RVC/TTS. Verified on one 9070 XT.
-
Updated
Jun 16, 2026 - Batchfile
Improve this page
Add a description, image, and links to the rdna4 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rdna4 topic, visit your repo's landing page and select "manage topics."