Skip to content

fix(rtpllm): adapt to RTP-LLM PyAttentionInputs host/device field rename#1412

Open
Jonathan-hwx wants to merge 2 commits into
ROCm:mainfrom
Jonathan-hwx:fix/rtp-host-device-rename-cuda-graph-acc
Open

fix(rtpllm): adapt to RTP-LLM PyAttentionInputs host/device field rename#1412
Jonathan-hwx wants to merge 2 commits into
ROCm:mainfrom
Jonathan-hwx:fix/rtp-host-device-rename-cuda-graph-acc

fix(rtpllm): don't use cache-store block table under CUDA-graph capture

00d0768
Select commit
Loading
Failed to load commit list.