-
Notifications
You must be signed in to change notification settings - Fork 45
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Refactor moe codebase
ready
ONLY add when PR is ready to merge/full CI is needed
#1199
opened Nov 29, 2025 by
kyuyeunk
Loading…
[Bugfix] Fix error when using trust remote code
ready
ONLY add when PR is ready to merge/full CI is needed
#1198
opened Nov 27, 2025 by
kyuyeunk
Loading…
[CI] Improve the procedure of waiting vllm serve
#1196
opened Nov 27, 2025 by
dennisYehCienet
Loading…
[Spec][Eagle3] Pass state through jit boundary to prevent long compilation time
#1192
opened Nov 27, 2025 by
py4
Loading…
fix(rpa-v3): add sliding window mask to h64 kernel and attention_sink to h128
#1185
opened Nov 26, 2025 by
erfanzar
Loading…
[do not merge] test status check POC
ready
ONLY add when PR is ready to merge/full CI is needed
#1168
opened Nov 25, 2025 by
khluu
Loading…
[Feat][WIP][TPU Offload] KV cache offload to local cpu buffer
#1163
opened Nov 24, 2025 by
juncgu-google
Loading…
[MISC] Removed problematic local path for CONFTEST_DIR
#1141
opened Nov 20, 2025 by
JiriesKaileh
Loading…
[Misc] Fix model dtype not being configured correctly
ready
ONLY add when PR is ready to merge/full CI is needed
#1093
opened Nov 13, 2025 by
kyuyeunk
Loading…
Enable Pipeline Parallelism on Jax models
ready
ONLY add when PR is ready to merge/full CI is needed
#1077
opened Nov 12, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Enable Pipeline Parallelism on Jax runner
ready
ONLY add when PR is ready to merge/full CI is needed
#1053
opened Nov 8, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
[Docs] fix dead links in multiple documentation pages
#1027
opened Nov 6, 2025 by
mattheliu
Loading…
3 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.