inf is returned by nn.TransformerEncoderLayer by Stonepia · Pull Request #3674 · intel/torch-xpu-ops

Stonepia · 2026-05-14T14:30:31Z

inf is returned by nn.TransformerEncoderLayer

Fixes #2015

Root Cause: The test_transformerencoderlayer test added XPU to fast_path_device check (commit 792afdf), causing the test to expect NaN output on XPU when a fully-masked row is passed. However, XPU's TransformerEncoderLayer does not implement the fast path (unlike CUDA/CPU), so the attention implementation produces inf/NaN due to float16 overflow or softmax over -inf values rather than the expected NaN pattern. For float16, large input values ([20., 30., 40., 50.]) cause overflow in the softmax/attention computation, producing NaN instead of the expected reference values.

Failed Tests:

test/test_nn.py::TestNNDeviceTypeXPU::test_transformerencoderlayer_gelu_xpu_float16
test/test_nn.py::TestNNDeviceTypeXPU::test_transformerencoderlayer_xpu_float16
test/test_nn.py::TestNNDeviceTypeXPU::test_transformerencoderlayer_xpu_float32
test/test_nn.py::TestNNDeviceTypeXPU::test_transformerencoderlayer_xpu_float64

Diff stat:

test/xpu/test_nn_xpu.py | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)

inf is returned by nn.TransformerEncoderLayer

chuanqi129 · 2026-05-14T14:31:52Z

@copilot The Python lint (flake8/ruff) check failed. Please fix the lint errors in this PR.

Lint job log: https://github.com/intel/torch-xpu-ops/actions/runs/25865859919

Instructions:

Review the lint failure log via the link above
Fix all lint errors in the relevant source files
Ensure both Python lint and Clang format checks pass

Agent-Logs-Url: https://github.com/intel/torch-xpu-ops/sessions/263ed202-9dbb-4bef-901e-8f32d916faa2 Co-authored-by: chuanqi129 <13608516+chuanqi129@users.noreply.github.com>

Copilot · 2026-05-14T14:33:47Z

@copilot The Python lint (flake8/ruff) check failed. Please fix the lint errors in this PR.

Lint job log: https://github.com/intel/torch-xpu-ops/actions/runs/25865859919...

Fixed in commit a5d61d7. The PYFMT formatter required collapsing the multi-line and (... or ...) condition into a single line: and ("cuda" in str(device) or "cpu" in str(device)).

Fix for #2015 (attempt 3)

73ff88b

inf is returned by nn.TransformerEncoderLayer

Stonepia added the disable_all Disable all ci test jobs for the PR, just keep basic lint check label May 14, 2026

Stonepia mentioned this pull request May 14, 2026

inf is returned by nn.TransformerEncoderLayer #2015

Closed

11 tasks

Copilot started work on behalf of chuanqi129 May 14, 2026 14:32 View session

Fix lint: collapse multi-line condition to single line per PYFMT

a5d61d7

Agent-Logs-Url: https://github.com/intel/torch-xpu-ops/sessions/263ed202-9dbb-4bef-901e-8f32d916faa2 Co-authored-by: chuanqi129 <13608516+chuanqi129@users.noreply.github.com>

Copilot finished work on behalf of chuanqi129 May 14, 2026 14:34

Copilot AI requested a review from chuanqi129 May 14, 2026 14:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inf is returned by nn.TransformerEncoderLayer#3674

inf is returned by nn.TransformerEncoderLayer#3674
Stonepia wants to merge 2 commits into
mainfrom
agent/issue-2015

Stonepia commented May 14, 2026

Uh oh!

chuanqi129 commented May 14, 2026

Uh oh!

Copilot AI commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Stonepia commented May 14, 2026