ROCm / vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 41
Star 75

Code
Issues 9
Pull requests 29
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/vllm

Labels 12 Milestones 0

New pull request New

29 Open 483 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Updated README.md with April 29 results

#526 opened Apr 27, 2025 by Mcirino1

Loading…

fix total_num_seq

#525 opened Apr 27, 2025 by hliuca

Loading…

BF16 Skinny Optimization

#520 opened Apr 22, 2025 by amd-hhashemi

Loading…

integrate aiter

#516 opened Apr 18, 2025 by fsx950223

Loading…

Enable RPD Profiler in OpenAI server

#513 opened Apr 15, 2025 by rebklee

Loading…

removing quant and kv-cache fp8 from deepseek run instructions

#509 opened Apr 9, 2025 by arakowsk-amd

Loading…

creating 1 gpu agents on OCI cluster

#488 opened Mar 24, 2025 by dhonnappa-amd

Loading…

WIP: Fixes to kernel tests

#487 opened Mar 24, 2025 by hissu-hyvarinen

Loading…

Handling input dim size greater than 3 in tuned_gemm.py

#482 opened Mar 13, 2025 by charlifu

Loading…

EXPERIMENTING WITH K8S // NO NEED TO MERGE // Rocm vllm ci fix nd k8 osci

#477 opened Mar 12, 2025 by Alexei-V-Ivanov-AMD

Loading…

Rocm vllm ci fix

#468 opened Mar 10, 2025 by Alexei-V-Ivanov-AMD

Loading…

Test Queues

#456 opened Feb 28, 2025 by dhonnappa-amd • Draft

Enable custom paged attention kernel for Navi 3/4

#446 opened Feb 24, 2025 by hyoon1

Loading…

Dummy PR, no need to merge

#438 opened Feb 19, 2025 by hissu-hyvarinen

Loading…

updating dev-docker README 20250214

#426 opened Feb 14, 2025 by arakowsk-amd • Draft

Updating ISL and OSL to align with reported benchmark table

#424 opened Feb 14, 2025 by eduand-alvarez

Loading…

K8test baseline -> Testing a single MI300 8x GPU node for CI performance // no need to merge

#409 opened Feb 6, 2025 by Alexei-V-Ivanov-AMD

Loading…

K8 node testing // no need to merge

#404 opened Feb 4, 2025 by Alexei-V-Ivanov-AMD

Loading…

Fp8 header

#396 opened Jan 31, 2025 by gshtras

Loading…

Test queue with 8 gpu

#393 opened Jan 29, 2025 by dhonnappa-amd

Loading…

[Bugfix] Deepseek v3 fix max_num_batched_tokens

#386 opened Jan 24, 2025 by Concurrensee • Draft

Switching building to MI300. stale

#380 opened Jan 22, 2025 by Alexei-V-Ivanov-AMD

Loading…

Adding Deepseek instruct + update manifest stale

#379 opened Jan 22, 2025 by arakowsk-amd

Loading…

Add TritonScaledMMLinearKernel to fix broken support for int8 models stale

#377 opened Jan 21, 2025 by rasmith

Loading…

Trying to pass toml file as a parameter to codespell stale

#376 opened Jan 21, 2025 by gshtras

Loading…

Previous 1 2 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly