Apply constant folding to Q/DQ patterns on constants by pluralia · Pull Request #802 · Xilinx/onnx-mlir

pluralia · 2026-05-31T22:36:26Z

The update is required to remove Slice and Transpose ops if MM is identified as act x weight instead of act x act.

Adds three new patthers to src/Dialect/ONNX/Transforms/ConstProp.cpp, all gated behind the existing enableQDQ flag and registered alongside the current RemoveQDQForConst. They target Q/DQ chains rooted at an ONNXConstantOp that the existing patterns leave behind:

BypassShapeOpThroughDQ<ONNXOp> — swaps Const → DQ → ShapeOp into Const → ShapeOp → DQ for Slice/Transpose/Reshape/Squeeze/Unsqueeze so the existing *OfConst folders can collapse the shape op on the integer constant.
DropIdempotentQDQOnConst — removes a Const → DQ(s,z) → Q(s,z) round-trip on an integer constant when the DQ and Q share the same scale, zero-point, and storage dtype
FoldRequantizeOnConst — materializes a new integer constant for a genuine Const → DQ(s1,z1) → Q(s2,z2,intT2) requantize by recomputing each element with round-to-nearest-even and saturation

…olding

Copilot

Pull request overview

This PR extends ONNX constant propagation to further fold/remove Q/DQ patterns rooted at constants (under the existing enableQDQ gate), helping eliminate leftover Slice/Transpose/etc. that can block later optimizations.

Changes:

Adds BypassShapeOpThroughDQ<...> to commute shape-only ops ahead of DequantizeLinear for per-tensor quantization on constants.
Adds DropIdempotentQDQOnConst to remove no-op Const -> DQ -> Q round-trips on integer constants when parameters are identical.
Adds FoldRequantizeOnConst to materialize a new integer constant for genuine Const -> DQ -> Q requantization.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

p-lanza

please add unit tests

guwacAMD · 2026-06-01T19:09:24Z

DMAC FE Regression Summary

151 models submitted, 145 pass / 6 fail on both sides — pass/fail set identical.
No new failures, no recovered passes.
6 failures pre-exist on nightly (unrelated to the patch):
- exit 137 (OOM): fara-qwen-vl-1, fara-qwen-vl-512, phi-v-next-7b-psu0, phi-v-next-7b-psu1
- exit 134 (abort): mako-encoder-fp16, psr

Per-model frontend diff

143 / 145 identical between CHK and REF.
2 / 145 show real diffs: psx0-qdq (target) and microsoft-infoxlm-large (side effect).

psx0-qdq — net op-type deltas (REF → CHK)

op_type	REF	CHK	Δ
`MatMul_qdq_actxact_uint16xuint16xuint16`	120	0	−120
`MatMul_qdq_uint16xuint8xuint16`	0	120	+120
`MatMul_qdq_bias_uint16xuint8xuint16`	0	2	+2
`Slice_qdq_uint8xbfloat16`	120	0	−120
`Transpose_qdq_bfloat16xuint16`	120	0	−120
`Transpose_qdq_uint16xuint16`	3	1	−2
`Transpose_qdq_uint8xuint8`	4	0	−4
`Dequant_uint16xbfloat16`	4	0	−4
`Dequant_int16xbfloat16`	2	0	−2
`Quant_bfloat16xuint16`	2	0	−2

120 actxact uint16×uint16 MatMuls reclassified to AIE-runnable actxweight uint16×uint8; 120 paired Slice + 120 Transpose ops on weights collapsed; 2 1×1 Convs (AIESW-34318) now fused as MatMul_qdq_bias_uint16×uint8.

microsoft-infoxlm-large — side effect

1 Conversion_uint8xuint16 removed (lm_head.dense.bias_DequantizeLinear)
1 LayerNormalization_qdq weight retyped from uint16 → uint8 (B_dtype, wgt1_bytes 2→1)
Net: −1 op, small improvement

pluralia added 3 commits May 29, 2026 07:31

Bypass shape-only ONNX ops through DequantizeLinear to enable const-f…

e924fc0

…olding

Drop idempotent and widening Q/DQ pairs on integer constants

30a04a4

Replace widening requantize drop with general FoldRequantizeOnConst

18974c3

pluralia requested review from jorickert and p-lanza as code owners May 31, 2026 22:36

akj10 requested a review from Copilot May 31, 2026 23:25

Copilot started reviewing on behalf of akj10 May 31, 2026 23:25 View session

Copilot AI reviewed May 31, 2026

View reviewed changes

jorickert reviewed Jun 1, 2026

View reviewed changes

Comment thread src/Dialect/ONNX/Transforms/ConstProp.cpp

jorickert requested a review from ilango100 June 1, 2026 08:42

p-lanza reviewed Jun 1, 2026

View reviewed changes

pluralia and others added 5 commits June 1, 2026 06:28

Fix comments & formatting

3523f43

Guard BypassShapeOpThroughDQ against non-dense const

000499d

Merge branch 'release_rai_1_8' into ai.bypass-shape-ops-through-dq

5a2e460

Add tests

789024f

Fix previous constant folding tests

293c30c

guwacAMD approved these changes Jun 1, 2026

View reviewed changes

ilango100 requested changes Jun 1, 2026

View reviewed changes

Comment thread src/Dialect/ONNX/Transforms/ConstProp.cpp

ilango100 approved these changes Jun 1, 2026

View reviewed changes

Add DropWideningRequantizeOnConst

35620a8

pluralia force-pushed the ai.bypass-shape-ops-through-dq branch from 54f8e3e to 35620a8 Compare June 1, 2026 22:06

pluralia added 3 commits June 1, 2026 16:12

Add DropWideningRequantizeOnConst x2

578cfa4

Fix formatting

80ff03d

Fix tests

95f4c5e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply constant folding to Q/DQ patterns on constants#802

Apply constant folding to Q/DQ patterns on constants#802
pluralia wants to merge 12 commits into
Xilinx:release_rai_1_8from
pluralia:ai.bypass-shape-ops-through-dq

pluralia commented May 31, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

p-lanza left a comment

Uh oh!

Uh oh!

guwacAMD commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

pluralia commented May 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

p-lanza left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

guwacAMD commented Jun 1, 2026

DMAC FE Regression Summary

Per-model frontend diff

psx0-qdq — net op-type deltas (REF → CHK)

microsoft-infoxlm-large — side effect

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pluralia commented May 31, 2026 •

edited

Loading