Skip to content

Conversation

@windreamer
Copy link
Collaborator

Motivation

Flash Attention 2 does not provide pre-built wheels for CUDA 13.0. Installing FA2 on cu13 causes build failures, so we skip it for non-cu12 environments.

Modification

Conditionally install FA2 only when CUDA version matches cu12*, allowing cu13 to bypass the installation while preserving existing behavior for supported versions.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates the Docker installation script to avoid FlashAttention 2 (FA2) installation failures on CUDA 13 by conditionally installing FA2 only for CUDA 12.x environments.

Changes:

  • Add a CUDA-version guard around the FA2 wheel URL construction and pip install step so CUDA 13 images skip FA2 installation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant