-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix[nvbug/5286515]: trtllm-llmapi-launch on single node single gpu
#4428
opened May 19, 2025 by
Superjomn
Loading…
draft: [Auto Deploy] Deepseek support for MLA attention with compressed kv
#4427
opened May 19, 2025 by
sugunav14
Loading…
1 of 3 tasks
Update "Roadmap" link under README.md to the issues with Roadmap label
#4425
opened May 19, 2025 by
AdamzNV
Loading…
[nvbug/5028235][fix]pytest bindings tokens logtis comparison.
#4424
opened May 19, 2025 by
dominicshanshan
Loading…
[DON'T MERGER] Sharing only: Add quickstart dataset
#4423
opened May 19, 2025 by
SimengLiu-nv
•
Draft
[feat] Multi-block mode for Hopper spec dec XQA kernel
#4416
opened May 18, 2025 by
jhaotingc
Loading…
[TRTLLM-4932] Add CLI accuracy tests for Phi-4-mini-instruct
#4415
opened May 18, 2025 by
moraxu
Loading…
Scaffoldingllm supports MCP
Community Engagement
Community want to contribute
#4410
opened May 17, 2025 by
wu1du2
Loading…
test(perf): Extend the Llama-Nemotron-Nano-8B perf-integration-tests (pyt)
#4407
opened May 16, 2025 by
venkywonka
•
Draft
[nvbug/5285881][fix] Fix chunked prefill + overlap scheduler
#4402
opened May 16, 2025 by
mikeiovine
Loading…
feature: align sample state with trtllm sampler sample state
#4401
opened May 16, 2025 by
netanel-haber
•
Draft
doc: [TRTLLM-325]Integrate the NGC image in Makefile automation and document
#4400
opened May 16, 2025 by
MartinMarciniszyn
Loading…
fix: [nvbugs/5287097] Align PP layer distribution between pytorch and TRT flow.
#4399
opened May 16, 2025 by
yuxianq
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.