[TPU] support disabling xla compilation cache #15567

yaochengji · 2025-03-26T19:15:41Z

The PyTorch/XLA compilation cache uses the Torch IR to generate keys. Consequently, changes in optimization flags, which affect compilation results, don't change the cache key. This can result in the wrong compilation being used. To prevent this, disabling the XLA compilation cache during development is recommended. We can disable it by export VLLM_XLA_CACHE_PATH=

Signed-off-by: Chengji Yao <chengjiyao@google.com>

github-actions · 2025-03-26T19:15:50Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

alexm-redhat

Good catch @yaochengji !

Signed-off-by: Chengji Yao <chengjiyao@google.com>

Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: xinyuxiao <xinyuxiao2024@gmail.com>

Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

Signed-off-by: Chengji Yao <chengjiyao@google.com>

[TPU] support disabling xla compilation cache

e0557b5

Signed-off-by: Chengji Yao <chengjiyao@google.com>

yaochengji requested review from WoosukKwon, robertgshaw2-redhat, njhill, ywang96, comaniac and alexm-redhat as code owners March 26, 2025 19:15

mergify bot added the v1 label Mar 26, 2025

alexm-redhat approved these changes Mar 26, 2025

View reviewed changes

alexm-redhat enabled auto-merge (squash) March 26, 2025 19:29

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 26, 2025

mgoin approved these changes Mar 26, 2025

View reviewed changes

mgoin added the tpu Related to Google TPUs label Mar 26, 2025

alexm-redhat merged commit e74ff40 into vllm-project:main Mar 27, 2025
42 checks passed

lengrongfu pushed a commit to lengrongfu/vllm that referenced this pull request Apr 2, 2025

[TPU] support disabling xla compilation cache (vllm-project#15567)

3d49cb8

Signed-off-by: Chengji Yao <chengjiyao@google.com>

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Apr 2, 2025

[TPU] support disabling xla compilation cache (vllm-project#15567)

36865a7

Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Alex4210987 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Apr 5, 2025

[TPU] support disabling xla compilation cache (vllm-project#15567)

7dedc96

Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: xinyuxiao <xinyuxiao2024@gmail.com>

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[TPU] support disabling xla compilation cache (vllm-project#15567)

32bbe1d

Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

nishith-fujitsu pushed a commit to nishith-fujitsu/vllm that referenced this pull request Apr 9, 2025

[TPU] support disabling xla compilation cache (vllm-project#15567)

5ad5f71

Signed-off-by: Chengji Yao <chengjiyao@google.com>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TPU] support disabling xla compilation cache #15567

[TPU] support disabling xla compilation cache #15567

yaochengji commented Mar 26, 2025

github-actions bot commented Mar 26, 2025

alexm-redhat left a comment

[TPU] support disabling xla compilation cache #15567

[TPU] support disabling xla compilation cache #15567

Conversation

yaochengji commented Mar 26, 2025

github-actions bot commented Mar 26, 2025

alexm-redhat left a comment

Choose a reason for hiding this comment