-
-
Notifications
You must be signed in to change notification settings - Fork 14.4k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[LoRA] Add LoRA support for Qwen3OmniMoeThinkerForConditionalGeneration
qwen
Related to Qwen models
#37193
opened Mar 16, 2026 by
pratapyash
Loading…
4 tasks done
[Performance] Enable Triton autotuning disk cache by default
#37188
opened Mar 16, 2026 by
arpera
Loading…
2 of 5 tasks
[Tool Parser] Qwen3Coder: incremental string streaming, trailing newline fix, and whitespace content filter
qwen
Related to Qwen models
#37187
opened Mar 16, 2026 by
ec-jt
Loading…
3 of 5 tasks
[Pixtral] Enable Pixtral language model support Eagle3
#37182
opened Mar 16, 2026 by
Flechman
Loading…
Add ability to replace oot ops when using lora
#37181
opened Mar 16, 2026 by
kyuyeunk
Loading…
5 tasks
[XPU] skip unsupported ut and update test_nixl_connector
ci/build
kv-connector
v1
#37179
opened Mar 16, 2026 by
zhenwei-intel
Loading…
5 tasks
Bugfix for offloading+prefetch for GLM-4.7-FP8
bug
Something isn't working
#37178
opened Mar 16, 2026 by
sfbemerk
Loading…
Fix KV cache memory estimation for hybrid Mamba/Attention models
v1
#37177
opened Mar 16, 2026 by
xueliangyang-oeuler
Loading…
5 tasks
Fix KV cache size estimation regression in v0.17+
v1
#37172
opened Mar 16, 2026 by
xueliangyang-oeuler
Loading…
5 tasks
[Frontend] feat: add streaming support for token generation endpoint
frontend
#37171
opened Mar 16, 2026 by
hhk7734
Loading…
3 of 5 tasks
[Bugfix] Fix prompt_embeds precision divergence with MTP speculative …
bug
Something isn't working
speculative-decoding
v1
#37170
opened Mar 16, 2026 by
leihuang-sketch
Loading…
5 tasks
[Bugfix] Fix "Already borrowed" tokenizer race in Hermes tool parser
bug
Something isn't working
#37169
opened Mar 16, 2026 by
stonelazy
Loading…
Fix issue #37103: Remove shape mismatch warnings in FLA operations
#37166
opened Mar 16, 2026 by
xueliangyang-oeuler
Loading…
5 tasks
[perf][connector] optimize build_connector_meta when host buffer transfer is not used
kv-connector
#37165
opened Mar 16, 2026 by
youkaichao
Loading…
5 tasks
[Bugfix] Fix TOCTOU race in KV block allocator causing prefix-cache block theft
bug
Something isn't working
needs-rebase
v1
#37164
opened Mar 16, 2026 by
AbhiOnGithub
Loading…
Fix issue #37103: Remove shape mismatch warnings in FLA operations
#37163
opened Mar 16, 2026 by
xueliangyang-oeuler
Loading…
5 tasks
Fix issue #37103: Remove shape mismatch warnings in FLA operations
#37161
opened Mar 16, 2026 by
xueliangyang-oeuler
Loading…
5 tasks
[Bugfix] Fix mock.patch resolution failure for standalone_compile.FakeTensorMode on Python <= 3.10
bug
Something isn't working
#37158
opened Mar 16, 2026 by
dbari
Loading…
3 of 5 tasks
[openapi] remove redundant exception stack trace[4/N]
frontend
#37157
opened Mar 16, 2026 by
andyxning
Loading…
5 tasks
Fix reasoning parser CI failure for seedoss and glm4 moe
#37154
opened Mar 16, 2026 by
xueliangyang-oeuler
Loading…
5 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.