rdna4

Here are 23 public repositories matching this topic...

zolotukhin / zinc

Zig INferenCe Engine — Local LLM inference on AMD GPUs and Apple Silicon

amd gpu zig pytorch transformer openai gpt amdgpu rdna3 rdna4 qwen3

Updated Jun 28, 2026
Zig

Windows-only version of ComfyUI which uses AMD's official ROCm and PyTorch libraries to get better performance with AMD GPUs. [auto-installation and popular performance enhancing packages like triton * sage-attention * flash-attention * bitsandbytes included ]

windows triton rdna rocm miopen bitsandbytes flash-attention rdna3 rdna2 rdna4 sage-attention rdna1

Updated Jun 28, 2026
Python

ind4skylivey / 0ptiscaler4linux

Star

The intelligent OptiScaler installer Linux gamers needed. Automates FSR4, XeSS & DLSS configuration with GPU-optimized profiles for RDNA3/4, Arc & RTX cards.

vulkan proton linux-tools shell-scripting upscaling mesa gaming-performance gpu-optimization dlss linux-gaming xess steam-deck rdna3 frame-generation rdna4 optiscaler fsr4 amd-fsr

Updated Jan 27, 2026
Shell

GPUOpen-Tools / isa_spec_manager

Star

Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.

amd gpu specification isa rdna cdna mi300 rdna3 rdna2 cdna3 cdna2 rdna4

Updated Apr 9, 2026
C++

maeddesg / vulkanforge

Star

LLM inference engine for AMD RDNA4 — Rust + Vulkan compute shaders, gguf & native FP8.

rust machine-learning amd vulkan inference mesa llm fp8 gguf rdna4 gfx1201 gemma4

Updated Jun 20, 2026
Rust

miklebel / adrenalift

Star

Windows tool to bypass AMD driver clock gating and unlock real GPU boost clocks (RDNA4)

windows amd gpu overclock amdgpu powerplay overclocking rdna4

Updated Jun 7, 2026

KaiFelixBennett / gemma4-turboquant-rdna4

Star

Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.

amd hip quantization gemma rocm kv-cache long-context llama-cpp local-llm llm-inference flash-attention rdna4 gfx1201 turboquant

Updated Jun 22, 2026
Python

tlee933 / llama.cpp-rdna4-gfx1201

Star

llama.cpp with native AMD RDNA4 (gfx1201) ROCm 7.11 support - 98.97 tok/s AI inference, competitive with RTX 4070 Ti, 32GB VRAM

machine-learning amd gpu hip rocm ai-inference llm llama-cpp rdna4 gfx1201

Updated Jan 3, 2026
C++

KaiFelixBennett / RadeonForge

Star

Fine-tune your own LLM on an AMD Radeon GPU — the easy, tested way. QLoRA via ROCm on Windows/WSL2 & Linux, a worked Gemma-4 example, a reusable live training dashboard, and a smoke test that proves the loss falls.

amd pytorch lora gemma rocm radeon fine-tuning peft wsl2 llm llama-cpp qlora rdna4 gfx1201

Updated Jun 20, 2026
HTML

buptanswer / mineru

Star

让 MinerU 在 AMD 显卡上以 vLLM + hybrid-auto-engine 满血运行！本项目是对官方文档中 AMD/ROCm 生态支持的有效补充。针对 MinerU 3.x 提供了一套完整的 ROCm 7.x + PyTorch 2.11.0 + vLLM 源码编译与部署方案，解析质量与速度对标 NVIDIA 旗舰显卡。

python pytorch data-extraction rocm e-book amd-gpu rag wsl2 llm document-parsing vllm mineru rdna4

Updated May 27, 2026
Python

anna-claudette / angruvadal

Star

RAM-Backed MCP Memory Architecture for Consumer LLM Inference — 900K token context on 16GB VRAM

amd mcp rocm llm llama-cpp local-llm context-window rdna4 consumer-gpu rotorquant

Updated Mar 27, 2026
Python

kicrazom / navimed-umb

Star

AWQ/W4A16 quantization, benchmarking and public release of Polish LLMs (PLLuM, Bielik) on AMD RDNA 4 (ROCm + vLLM-native) — reproducible methodology and hardware-envelope studies.

linux pytorch bom nut kubuntu homelab rocm system-monitoring amd-gpu ups-monitoring hardware-documentation local-llm rdna4 ai-workstation radeon-r9700

Updated Jun 21, 2026
Python

AlanHuang99 / qwen3.6-mtp-stack

Star

Reproducible Docker + LiteLLM stack for running Qwen 3.6 27B with MTP on AMD RDNA4 (gfx1201). Verified on Radeon AI PRO R9700 32 GB. Built from llama.cpp PR #22673.

amd vulkan homelab llama-cpp qwen speculative-decoding litellm rdna4

Updated May 8, 2026
Python

xnyzer / ollama-rocm

Star

Ollama with ROCm 7 GPU acceleration for AMD RDNA 4 (RX 9070, RX 9070 XT, RX 9060 XT - gfx1201) on Windows 11

windows amd rocm llm local-llm ollama rdna4 gfx1201 rx-9070-xt hip-sdk

Updated Jun 14, 2026
PowerShell

doublemover / RNS8

Sponsor

Star

RNS8 explores exact integer matrix multiplication on AMD GPU matrix engines.

hpc amd matrix hip rdna crt amdgpu rocm ck rns cdna matrix-engine rdna3 rdna4 hipblaslt rocwmma

Updated Jun 11, 2026
C++

eikkapine / ClawVolt

Star

Real-time dynamic GPU undervolting tool for AMD Radeon RX 9070 XT (RDNA4) on Windows. Automatically switches voltage offsets based on live core clock via AMD's official ADLX SDK — no static slider limitations. Features GUI dashboard, CLI controller, crash logger with TDR detection, and full preset management.

Updated Mar 18, 2026
Python

0x00405A00 / rocm-smi-visualizer

Star

Visualize and monitor ROCm-SMI data locally or remotely via REST API — a more convenient alternative to rocm-smi.

ai hpc amd ml rocm radeon rocm-smi rdna3 rdna4

Updated Apr 5, 2026
Python

ryan66699986-lab / xenia-edge-fork

Star

Fork of xenia-edge (has207/xenia-edge) with critical bug fixes: posix_spawn replaces fork+exec to avoid AppImage FUSE unmount race on Linux, in_process_title_relaunch takes precedence over GameMode, improved filesystem path resolution with proper XDG/AppImage support, full codebase audit with 30+ bug fixes across CPU/GPU/kernel/UI/Android.

appimage xenia linux-gaming rdna4 xbox-360-emulator

Updated Jun 22, 2026
C++

tashibi / nvdiffrast-rocm-patch

Star

Fixes and patches for NVlabs/nvdiffrast to support AMD ROCm 7.1 and Wave64 architectures (gfx1100, gfx1201).

hip amdgpu rocm rdna3 rdna4 nvdiffrast instantmesh

Updated May 1, 2026
Shell

cantascendia / rocm-rdna4-windows

Star

Run PyTorch natively on Windows 11 with an AMD RX 9070 XT (RDNA4 / gfx1201) on stable ROCm 7.2.1 — no WSL2, no Linux, no ZLUDA. Exact pinned wheel URLs, runtime env vars, documented RDNA4 pitfalls (broken nightlies, no xformers/flash-attn/bitsandbytes), and real benchmarks for ComfyUI, SD, LLMs, RVC/TTS. Verified on one 9070 XT.

Updated Jun 16, 2026
Batchfile

Improve this page

Add a description, image, and links to the rdna4 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rdna4 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rdna4

Here are 23 public repositories matching this topic...

zolotukhin / zinc

patientx-cfz / comfyui-rocm

ind4skylivey / 0ptiscaler4linux

GPUOpen-Tools / isa_spec_manager

maeddesg / vulkanforge

miklebel / adrenalift

KaiFelixBennett / gemma4-turboquant-rdna4

tlee933 / llama.cpp-rdna4-gfx1201

KaiFelixBennett / RadeonForge

buptanswer / mineru

anna-claudette / angruvadal

kicrazom / navimed-umb

AlanHuang99 / qwen3.6-mtp-stack

xnyzer / ollama-rocm

doublemover / RNS8

eikkapine / ClawVolt

0x00405A00 / rocm-smi-visualizer

ryan66699986-lab / xenia-edge-fork

tashibi / nvdiffrast-rocm-patch

cantascendia / rocm-rdna4-windows

Improve this page

Add this topic to your repo