-
Notifications
You must be signed in to change notification settings - Fork 59
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Unskip the lora column_parallel_packed test on v7x
#1314
opened Dec 15, 2025 by
vanbasten23
Loading…
[CI] This PR enhances testing of the CI procedures on both v6e and v7x.
#1311
opened Dec 15, 2025 by
dennisYehCienet
•
Draft
Refactor tuning for RPA HD64 kernel tuning to improve RPA kernel throughput
ready
ONLY add when PR is ready to merge/full CI is needed
#1308
opened Dec 14, 2025 by
helloworld1
Loading…
Add Quantized Weights Support for MoE Layers
ready
ONLY add when PR is ready to merge/full CI is needed
#1300
opened Dec 12, 2025 by
kyuyeunk
Loading…
[do not merge ]Get all change files instead of last commit when bootstrap.
ready
ONLY add when PR is ready to merge/full CI is needed
#1299
opened Dec 12, 2025 by
QiliangCui
Loading…
[test do not review]
ready
ONLY add when PR is ready to merge/full CI is needed
#1298
opened Dec 12, 2025 by
QiliangCui
Loading…
Add dummy placeholder for unsupported models in the support matrix
ready
ONLY add when PR is ready to merge/full CI is needed
#1291
opened Dec 12, 2025 by
boe20211
Loading…
[JAX][MoE] Integrate multiple MoE kernels in MoE modules
#1287
opened Dec 11, 2025 by
bzgoogle
Loading…
[multihost] Integrate expert parallelism to RayExecutor
#1282
opened Dec 10, 2025 by
Lumosis
Loading…
[multihost] Use make_array_from_process_local_data to create global array instead of device_put
#1281
opened Dec 10, 2025 by
Lumosis
Loading…
[do not review][do not submit]
ready
ONLY add when PR is ready to merge/full CI is needed
#1277
opened Dec 10, 2025 by
QiliangCui
Loading…
Fix TPU7x chip counting to account for chiplet architecture
#1266
opened Dec 8, 2025 by
burbajr
Loading…
Add workflow to build vLLM-TPU wheel using PyPI tpu-inference
ready
ONLY add when PR is ready to merge/full CI is needed
[DRAFT] Optimize Dockerfile to reduce image size and build time.
#1226
opened Dec 3, 2025 by
py4
Loading…
[CI] Fix awq dtype
ready
ONLY add when PR is ready to merge/full CI is needed
#1220
opened Dec 2, 2025 by
kyuyeunk
Loading…
Save size in scalar scratch for bo and bq
ready
ONLY add when PR is ready to merge/full CI is needed
#1201
opened Dec 1, 2025 by
rupengliu-meta
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-11-15.