[TLE]: add triton language extension (tle) support#399
Open
small-cat wants to merge 13 commits intotriton_v3.2.xfrom
Open
[TLE]: add triton language extension (tle) support#399small-cat wants to merge 13 commits intotriton_v3.2.xfrom
small-cat wants to merge 13 commits intotriton_v3.2.xfrom
Conversation
* move tle.ascend to tle.dsa.ascend * move tle_ir to third_party/tle/dsa * reimplement alloc/to_tensor/to_buffer reference to buffer_ir in third_party/ascend * reimplement tle.dsa.ascend scope with address_space in ascend
* [FIX] remove memory_space_cast in dsa_to_tensor because the op removes the memory space attribute and result in compiling errors * [TESTING] add collect_single method in ascend/testing.py to preserve the original benchmark statistics
* decouple TleOps from TritonOps and mov to third_party/tle/dsa/dialect * implement the TleOp conversion in third_party/tle/dsa rather than in flir directly, flir just call the conversion in its pass
* backend/ascend/spec/triton/compiler/code_generator.py still use tle.dsa in its visitor to visit python ast
* fix tle.dsa.hint for nested usage, see python/test/tle/test_tle_with_hints.py * implement extract_tle in experimental/tle
…mpiler/code_generator.py and add sparse_flash_attn_tle.py
08c1606 to
b6fb294
Compare
zhzhcookie
reviewed
Mar 10, 2026
zhzhcookie
reviewed
Mar 10, 2026
third_party/ascend/backend/spec/include/triton/Dialect/Triton/IR/TritonOps.td
Show resolved
Hide resolved
Collaborator
|
Please add tle tests to .github/workflows/ascend-build-and-test.yml |
neoskypig
reviewed
Mar 10, 2026
third_party/tle/dsa/dialect/include/Conversion/TleToLinalg/DSACopyConverter.h
Outdated
Show resolved
Hide resolved
neoskypig
reviewed
Mar 10, 2026
third_party/tle/dsa/dialect/lib/Conversion/TleToLinalg/DSACopyConverter.cpp
Outdated
Show resolved
Hide resolved
neoskypig
reviewed
Mar 10, 2026
third_party/tle/dsa/dialect/lib/Conversion/TleToLinalg/MathConverter.cpp
Outdated
Show resolved
Hide resolved
0cb44f1 to
d8fb02d
Compare
zhzhcookie
reviewed
Mar 11, 2026
9f8793a to
bcd2520
Compare
426b4e5 to
860a1f8
Compare
sunnycase
reviewed
Mar 13, 2026
sunnycase
reviewed
Mar 13, 2026
860a1f8 to
980243b
Compare
980243b to
bdd0e47
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This pull request introduces Triton Language Extension (TLE) support for Triton, enabling high-performance computing features such as on-chip memory management, pipeline compilation hints, and corresponding arithmetic operations. The extension is currently optimized for Ascend 910B devices mainly.
Key Operations in TLE
Memory & Tensor Operations
Arithmetic Operations
Pipeline & Parallelism
Examples
Example usage can be found under python/test/tle.