Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Revert "[Fix]Load kv-cache dtype from hf_quant_config.json automatically" ci-failure Issue about an unexpected test failure in CI ready ONLY add when PR is ready to merge/full CI is needed
#30653 opened Dec 14, 2025 by robertgshaw2-redhat Loading…
Strengthen input validation and tests for 'parse_raw_prompts’.
#30652 opened Dec 14, 2025 by mivehk Loading…
3 of 5 tasks
[Bugfix] CustomAR + TritonAttn[AMPERE] + FULL_CG - gpt-oss gpt-oss Related to GPT-OSS models nvidia
#30650 opened Dec 14, 2025 by bbrowning Loading…
additional protection for CVE-2025-62164 frontend multi-modality Related to multi-modality (#4194)
#30649 opened Dec 14, 2025 by wenqiglantz Loading…
fix: fix engine initialization fails with ValueError
#30645 opened Dec 14, 2025 by leejianwoo-collab Loading…
5 tasks done
[Bugfix] Fix RequestOutput miss lora_request llama Related to Llama models ready ONLY add when PR is ready to merge/full CI is needed v1
#30636 opened Dec 14, 2025 by jeejeelee Loading…
5 tasks
Update docs README.md to add NVFP4 quantization support documentation Improvements or additions to documentation
#30634 opened Dec 14, 2025 by omrialmog Loading…
tuned fused configs for B300
#30629 opened Dec 14, 2025 by navmarri14 Loading…
[MoE][Refactor 1/N] Separate Online Quantization
#30627 opened Dec 13, 2025 by robertgshaw2-redhat Loading…
5 tasks
[docker] Restructure Dockerfile for more efficient and cache-friendly builds ci/build documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#30626 opened Dec 13, 2025 by amrmahdi Loading…
[CI/Build] Ignore max transformers version skipping for initialization tests ready ONLY add when PR is ready to merge/full CI is needed
#30619 opened Dec 13, 2025 by Isotr0py Loading…
1 of 5 tasks
[Docs] Add FlashInfer environment variables to env_vars documentation documentation Improvements or additions to documentation
#30616 opened Dec 13, 2025 by majiayu000 Loading…
2 tasks done
[Bugfix] Add validation for tool requests when tool_parser is unavailable documentation Improvements or additions to documentation frontend ready ONLY add when PR is ready to merge/full CI is needed
#30613 opened Dec 13, 2025 by majiayu000 Loading…
2 tasks done
[ROCm][Perf] Replace cat to bmm's inplace write when aiter enabled rocm Related to AMD ROCm v1
#30611 opened Dec 13, 2025 by ganyi1996ppo Loading…
5 tasks
[FixBug]fix gpt-oss v1/completions response bug frontend gpt-oss Related to GPT-OSS models tool-calling
#30608 opened Dec 13, 2025 by princepride Loading…
3 of 5 tasks
[Bugfix] Improve DCP error hint in cp_utils v1
#30607 opened Dec 13, 2025 by jliu9515 Loading…
3 of 5 tasks
ProTip! What’s not been updated in a month: updated:<2025-11-14.