Skip to content

[None][fix] configure DeepGEMM PDL during engine init#15004

Open
liji-nv wants to merge 1 commit into
NVIDIA:feat/deepseek_v4from
liji-nv:fix/deepgemm-pdl-engine-init
Open

[None][fix] configure DeepGEMM PDL during engine init#15004
liji-nv wants to merge 1 commit into
NVIDIA:feat/deepseek_v4from
liji-nv:fix/deepgemm-pdl-engine-init

Conversation

@liji-nv
Copy link
Copy Markdown
Collaborator

@liji-nv liji-nv commented Jun 5, 2026

@coderabbitai summary

Description

Test Coverage

PR Checklist

Please review the following before submitting your PR:

  • PR description clearly explains what and why. If using CodeRabbit's summary, please make sure it makes sense.

  • PR Follows TRT-LLM CODING GUIDELINES to the best of your knowledge.

  • Test cases are provided for new code paths (see test instructions)

  • If PR introduces API changes, an appropriate PR label is added - either api-compatible or api-breaking. For api-breaking, include BREAKING in the PR title.

  • Any new dependencies have been scanned for license and vulnerabilities

  • CODEOWNERS updated if ownership changes

  • Documentation updated as needed

  • Update tava architecture diagram if there is a significant design change in PR.

  • The reviewers assigned automatically/manually are appropriate for the PR.

  • Please check this after reviewing the above items as appropriate for this PR.

GitHub Bot Help

To see a list of available CI bot commands, please comment /bot help.

Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
@liji-nv liji-nv requested review from a team as code owners June 5, 2026 12:36
@liji-nv liji-nv requested review from achartier and yizhang-nv and removed request for a team June 5, 2026 12:36
@liji-nv
Copy link
Copy Markdown
Collaborator Author

liji-nv commented Jun 5, 2026

/bot run --disable-fail-fast --add-multi-gpu-tests

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #52341 Bot args parsing error: usage: /bot [-h]
{run,kill,skip,submit,reviewers,reuse-pipeline,reuse-review} ...
/bot: error: unrecognized arguments: --add-multi-gpu-tests

Link to invocation

@jiaganc
Copy link
Copy Markdown
Collaborator

jiaganc commented Jun 5, 2026

/bot run

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #52362 [ run ] triggered by Bot. Commit: d648a2e Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #52362 [ run ] completed with state SUCCESS. Commit: d648a2e
/LLM/main/L0_MergeRequest_PR pipeline #41658 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

CI Agent Failure Analysis

Link to invocation


from tensorrt_llm import deep_gemm

deep_gemm.set_pdl(os.environ.get("TRTLLM_ENABLE_PDL", "1") == "1")
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We could re-use get_env_enable_pdl from flashinfer_utils.py here.


def _configure_deep_gemm_pdl() -> None:
global _DEEP_GEMM_PDL_CONFIGURED
if _DEEP_GEMM_PDL_CONFIGURED:
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure if the check is really useful since that function is called from __init__, so it should only be called once. If it was called from multiple places, we should have proper locking anyway.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants