CollectivePermute based allgather in host for-loop#5963
CollectivePermute based allgather in host for-loop#5963
Conversation
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
|
!test |
Description
|
| Relevant files | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Enhancement | 7 files
| ||||||||||||||
| Tests | |||||||||||||||
| Additional files |
PR Reviewer Guide
Here are some key observations to aid the review process:
| 🧪 PR contains tests |
| ⚡ Recommended focus areas for review |
Potential division by zero
|
Test failures
-
(Medium, 3)
Missing NCCL broadcast event in tests.python.multidevice.test_overlap::test_column_parallel_linear_forwardTest Name A100 GB200 H100 Source tests.python.multidevice.test_overlap.test_column_parallel_linear_forward ❌ ❌ ❌ -
(Medium, 2)
Large numerical mismatches in multidevice column-parallel linear forward testTest Name A100 (dist.) H100 (dist.) Source tests.python.multidevice.test_overlap.test_column_parallel_linear_forward ❌ ❌
No description provided.