-
-
Notifications
You must be signed in to change notification settings - Fork 5.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model] Add support for GraniteMoeShared models
#13313
opened Feb 15, 2025 by
tjohnson31415
Loading…
[Bugfix] Massage MLA's usage of flash attn for RoCM
AMD GPU
bug
Something isn't working
ready
ONLY add when PR is ready to merge/full CI is needed
#13310
opened Feb 14, 2025 by
tlrmchlsmth
Loading…
[V1][Tests] Adding additional testing for multimodal models to V1
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#13308
opened Feb 14, 2025 by
andoorve
Loading…
Upstream prefix prefill speed up
ci/build
needs-rebase
v1
#13305
opened Feb 14, 2025 by
maleksan85
•
Draft
[Quant] Move to ONLY add when PR is ready to merge/full CI is needed
packed_modules_mapping
from class var to instance var
needs-rebase
quantization
ready
#13304
opened Feb 14, 2025 by
kylesayrs
Loading…
[BugFix] Don't scan entire cache dir when loading model
bug
Something isn't working
force-merge
ready
ONLY add when PR is ready to merge/full CI is needed
#13302
opened Feb 14, 2025 by
njhill
Loading…
[Frontend][Docs] Transcription API streaming
documentation
Improvements or additions to documentation
frontend
#13301
opened Feb 14, 2025 by
NickLucche
Loading…
[V1][WIP] 2nd try of Hybrid allocator for full attention & sliding window attention interleaved models
needs-rebase
v1
#13296
opened Feb 14, 2025 by
heheda12345
•
Draft
[Metrics] Add Improvements or additions to documentation
v1
--show-hidden-metrics-for-version
CLI arg
documentation
#13295
opened Feb 14, 2025 by
markmc
Loading…
[Misc] Prevent vLLM from searching for files on the hub when the model is a local path
#13292
opened Feb 14, 2025 by
maxdebayser
Loading…
Minor fix in documentation for tool_calling.md
documentation
Improvements or additions to documentation
#13291
opened Feb 14, 2025 by
tugot17
Loading…
Missing comment explaining VDR variable in GGUF kernels
#13290
opened Feb 14, 2025 by
SzymonOzog
Loading…
[V1][Metrics] Add iteration_tokens_total histogram from V0
v1
#13288
opened Feb 14, 2025 by
markmc
Loading…
[Bugfix] Fix qwen2.5-vl image processor
ready
ONLY add when PR is ready to merge/full CI is needed
#13286
opened Feb 14, 2025 by
Isotr0py
Loading…
[Bugfix]Fix search start_index of stop_checker
ready
ONLY add when PR is ready to merge/full CI is needed
#13280
opened Feb 14, 2025 by
xu-song
Loading…
[Misc] Make interval configurable for logging stats
v1
#13275
opened Feb 14, 2025 by
Sakalya
Loading…
[Bugfix][Docs] Fix offline Whisper
documentation
Improvements or additions to documentation
force-merge
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#13274
opened Feb 14, 2025 by
NickLucche
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.