Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[ROCM][CI] Fix AMD Examples Test Group ci/build documentation Improvements or additions to documentation rocm Related to AMD ROCm
#30276 opened Dec 8, 2025 by Concurrensee Loading…
[AMD] Amd/deepseek aiter fusions deepseek Related to DeepSeek models needs-rebase rocm Related to AMD ROCm v1
#30274 opened Dec 8, 2025 by k50112113 Draft
[CI/Build] Use spawn subprocess for ROCm documentation Improvements or additions to documentation rocm Related to AMD ROCm
#30272 opened Dec 8, 2025 by rjrock Loading…
3 of 5 tasks
[ROCm][CI][Bugfix] Multi-Modal Model Support Fixes and Attention Backend Improvements ci/build multi-modality Related to multi-modality (#4194) qwen Related to Qwen models rocm Related to AMD ROCm
#30270 opened Dec 8, 2025 by AndreasKaratzas Loading…
[Bugfix] Fix DeepGEMM after #29546 ready ONLY add when PR is ready to merge/full CI is needed
#30267 opened Dec 8, 2025 by zhewenl Loading…
[Frontend] Fixes anthropic streaming message_start usage nesting frontend ready ONLY add when PR is ready to merge/full CI is needed
#30266 opened Dec 8, 2025 by bbartels Loading…
5 tasks
Multiple Hybrid KV Cache Coordinator v1
#30263 opened Dec 8, 2025 by roikoren755 Loading…
3 of 5 tasks
[Feature]: OpenTelemetry Metrics Support v1
#30258 opened Dec 8, 2025 by mladjan-gadzic Draft
3 of 5 tasks
[bugfix][quantization] Fix fp8 per_tensor scale shape ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm v1
#30257 opened Dec 8, 2025 by haoyangli-amd Loading…
[ROCm] Use aiter.topk_sigmoid in llama4 llama Related to Llama models rocm Related to AMD ROCm
#30255 opened Dec 8, 2025 by tpopp Loading…
gptq marlin quantization support for fused moe with lora
#30254 opened Dec 8, 2025 by Bhanu068 Loading…
3 of 5 tasks
fix: DeepSeek-V3.2 DeepGEMM RuntimeError deepseek Related to DeepSeek models
#30251 opened Dec 8, 2025 by KeeProMise Loading…
5 tasks
[gpt-oss] Add model_identity to system message retrieval for harmony chat template frontend gpt-oss Related to GPT-OSS models
#30247 opened Dec 8, 2025 by lyuwen Loading…
5 tasks
[Bugfix] Fix fusion for VL models
#30244 opened Dec 8, 2025 by ElizaWszola Loading…
[Feature] skip language model in Encoder qwen Related to Qwen models
#30242 opened Dec 8, 2025 by Bounty-hunter Loading…
5 tasks
[Bugfix] fix streaming final output for non harmony frontend gpt-oss Related to GPT-OSS models
#30237 opened Dec 8, 2025 by penfree Loading…
Bump actions/stale from 10.1.0 to 10.1.1 ci/build dependencies Pull requests that update a dependency file github_actions Pull requests that update GitHub Actions code
#30234 opened Dec 8, 2025 by dependabot bot Loading…
Bump actions/checkout from 6.0.0 to 6.0.1 ci/build dependencies Pull requests that update a dependency file github_actions Pull requests that update GitHub Actions code
#30233 opened Dec 8, 2025 by dependabot bot Loading…
[responsesAPI][6] Fix multi turn MCP tokenization documentation Improvements or additions to documentation frontend gpt-oss Related to GPT-OSS models
#30230 opened Dec 8, 2025 by qandrew Loading…
Fix scheduler yield on arm
#30228 opened Dec 8, 2025 by wangxiyuan Loading…
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.