Int64 support for UpsampleNearest3d by kdrozd-dev · Pull Request #3737 · intel/torch-xpu-ops

kdrozd-dev · 2026-05-22T07:28:00Z

Fixes: #2510.

Align with changes from: pytorch/pytorch#144865. Improve error messages to resemble the cuda ones and change the relevant int32 variables to int64.

Co-authored-by: Slawomir Siwek <slawomir.siwek@intel.com>

pbielak

Looks good, but similar to my changes with the MaxPool3D kernel (see PR: #3558 and #3632 ), we should check the impact of the int64_t "hardcoding". Maybe a dispatch over int32/int64 would give better perf results.

kdrozd-dev · 2026-05-22T08:49:08Z

Looks good, but similar to my changes with the MaxPool3D kernel (see PR: #3558 and #3632 ), we should check the impact of the int64_t "hardcoding". Maybe a dispatch over int32/int64 would give better perf results.

Seems like a good idea will switch to dispatch approach after a quick benchmark

kdrozd-dev · 2026-05-22T13:16:53Z

3 runs with 20 execs per test case each

Case	Shape	dtype	Output numel	Speedup (med)	Speedup (avg)	Stable
small-f32	(2, 64, 16, 16, 16)	float32	4M	0.869x	0.899x	no
small-bf16	(2, 64, 16, 16, 16)	bfloat16	4M	1.044x	1.062x	no
small-f16	(2, 64, 16, 16, 16)	float16	4M	1.482x	1.287x	no
med-f32	(4, 128, 32, 32, 32)	float32	134M	1.034x	0.869x	no
med-bf16	(4, 128, 32, 32, 32)	bfloat16	134M	0.986x	1.089x	no
med-f16	(4, 128, 32, 32, 32)	float16	134M	0.925x	0.848x	no
large-f32	(1, 32, 64, 128, 128)	float32	268M	1.053x	1.053x	YES
large-bf16	(1, 32, 64, 128, 128)	bfloat16	268M	1.058x	1.058x	YES
xl-bf16	(1, 64, 64, 128, 256)	bfloat16	1.07B	1.054x	1.054x	YES
xl-f32	(1, 64, 64, 128, 256)	float32	1.07B	1.055x	1.055x	YES
med-exact-f32	(4, 128, 32, 32, 32)	float32	134M	0.847x	0.845x	no
med-scale3-f32	(2, 64, 16, 16, 16)	float32	14M	1.302x	1.063x	no

The benchmark shows that for stable test cases dispatch based approach is faster by around 5%. For small kernels results were mostly noise and varied greatly.

Int64 support for UpsampleNearest3d

f656595

Silv3S reviewed May 22, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/UpSampleNearest3dKernels.cpp Outdated

github-actions Bot added disable_e2e Disable all e2e test jobs for the PR disable_distributed Disable distributed UT test jobs for the PR labels May 22, 2026

Silv3S reviewed May 22, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/UpSampleNearest3dKernels.cpp Outdated

chuanqi129 marked this pull request as draft May 22, 2026 07:41

chuanqi129 marked this pull request as ready for review May 22, 2026 07:41

remove redundant cast

06d5d69

Co-authored-by: Slawomir Siwek <slawomir.siwek@intel.com>

Silv3S reviewed May 22, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/UpSampleNearest3dKernels.cpp Outdated

ditto

cb34db6

Co-authored-by: Slawomir Siwek <slawomir.siwek@intel.com>

Silv3S approved these changes May 22, 2026

View reviewed changes

pbielak approved these changes May 22, 2026

View reviewed changes

Switch to dispatch based approach

12bc324

Lint

c43c880

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Int64 support for UpsampleNearest3d#3737

Int64 support for UpsampleNearest3d#3737
kdrozd-dev wants to merge 5 commits into
mainfrom
fix/upsample-nearest3d-int64-indexing

kdrozd-dev commented May 22, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pbielak left a comment

Uh oh!

kdrozd-dev commented May 22, 2026 •

edited

Loading

Uh oh!

kdrozd-dev commented May 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kdrozd-dev commented May 22, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pbielak left a comment

Choose a reason for hiding this comment

Uh oh!

kdrozd-dev commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kdrozd-dev commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kdrozd-dev commented May 22, 2026 •

edited

Loading

kdrozd-dev commented May 22, 2026 •

edited

Loading