Skip to content

[Fluidstack] sky launch can leak instances when instance creation times out #4392

New issue

Have a question about this project? No Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “No Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? No Sign in to your account

Closed
Xe opened this issue Nov 21, 2024 · 2 comments
Closed

Comments

@Xe
Copy link

Xe commented Nov 21, 2024

With a config like this:

resources:
  accelerators: [A100:1]
  cloud: fluidstack

setup: "echo hi"

run: "python -m http.server 8080"

Fluidstack instance creation fails and the instances in the cloud are unable to be destroyed. The panel returns a "try again in 60 seconds" 500 error with this JSON body:

{"message":"Unable to terminate instance. Please try again in 60 seconds."}

Version & Commit info:

  • sky -v: skskypilot, version 1.0.0.dev20241031
  • sky -c: skypilot, commit c4eeeb5fb3ef64be0f05a727e119ac9266f8940f
@mjibril
Copy link
Contributor

mjibril commented Nov 22, 2024

@Xe there is fix for this issue in this PR already merged to master and available in the nightly build.

@Michaelvll
Copy link
Collaborator

Closing this issue as it is now fixed.

@Michaelvll Michaelvll added the OSS label Dec 19, 2024 — with Linear
@Michaelvll Michaelvll removed the OSS label Dec 19, 2024
@Michaelvll Michaelvll added the OSS label Dec 19, 2024 — with Linear
No Sign up for free to join this conversation on GitHub. Already have an account? No Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants