Average pooling clamped divisor should be done on all conditions where the kernel can go out of bounds #4144

ivangarcia44 · 2025-04-17T16:35:34Z

In this pull request I added various E2E tests to cover edge cases on the average pooling torch to linalg lowering algorithm. These new tests uncovered various numerical issues that were addressed in this same PR.

One of the issues is the IREE test failure found in #4079 which triggered this work.

Background:

In the most common case, the divisor for the average pooling is just the product of kernel dimensions. But with padding, and ceil mode options, some elements need to be discounted from the divisor computation. This change fixes two components of this:

Fix the condition that determines if the divisor is just the product of kernel dimensions or the clamped divisor comptutation.
Add missing isCountIncludePad logic divisor computation algorithm and reversal of kernel/stride/padding parameters element order. Both were missing from the first generalization change.

@vivekkhandelwal1
@silvasean
@zjgarvey
@ramiro050
@JianzheXiao
@AmosLewis
@rsuderman
@nirvedhmeshram
@sahas3
@Hanumanth04
@dixinzhou
@rafaelubalmw

…e the kernel can go out of bounds.

test/Conversion/TorchToLinalg/pooling.mlir

projects/pt1/python/torch_mlir_e2e_test/test_suite/pooling.py

lib/Conversion/TorchToLinalg/Pooling.cpp

…any change of mine).

…rnel/stride/padding elements have to be processed in reversed order relative to the spatial dimensions.

vivekkhandelwal1 · 2025-04-29T10:28:14Z

@ivangarcia44 This is not a good practice to close and open a new PR just because conflicts have arisen in the branch. Ideally, you should have addressed the conflicts and updated this PR just like other contributors do. At the very least, it creates confusion and makes it hard to follow the discussion on 2 different PR for the same changes.

Also, I would expect that this PR (or the new replacement of this) is not merged until a thorough review of the patch is done. The last PR had already made some changes which should not have been done, so I expect all to be more careful this time.

ivangarcia44 · 2025-04-29T16:58:06Z

I will do the merge in this PR and close the new one.

ivangarcia44 · 2025-04-29T22:19:10Z

Hi all, updated the changes after merging the conflict with @vivekkhandelwal1 's changes. Please review when you get a chance. Thank you

@vivekkhandelwal1
@JianzheXiao
@AmosLewis
@rsuderman
@nirvedhmeshram)
@sahas3
@Hanumanth04
@dixinzhou
@rafaelubalmw

vivekkhandelwal1 · 2025-05-02T14:57:27Z

lib/Conversion/TorchToLinalg/Pooling.cpp

+    //
+    // indexStartOffset = ceil((kernelSize - 1)/2) - padding
+    //
+    // clampedKernelSize =
+    //   min(outIntIndex * stride + indexStartOffset + floor((kernelSize - 1)/2)
+    //   + 1,
+    //       InputSpatialDimSize + padding) -
+    //   max(outIntIndex * stride + indexStartOffset - ceil((kernelSize - 1)/2),
+    //   -padding)
+    //


Hi @ivangarcia44, can you please add any link to the code/implementation in PyTorch from where you adopted this?

Hi Vivek, the original author of the torch-linalg avg_pool2d lowering (@AmosLewis ) based the 2D implementation of the divisor computation on this PyTorch code:

https://github.com/pytorch/pytorch/blob/4a6dfbe4806b361c43210dfd56db64c4097c66bb/aten/src/ATen/native/cpu/AvgPoolKernel.cpp#L78

For the new E2E test suite I added, the test AvgPool2dCeilPaddingStridedIncludePadding discovered that there was a numerical mismatch due to the divisor computation. Because of this I wrote a new average pooling divisor formula based on the PyTorch avg_pool2d doc and numerical values I observed. I tried to make the comments, variable names and code clearer than the previous version which helped me to correct the numerical issues I found and make all E2E tests pass.

Taking a second look at the PyTorch code, and the @AmosLewis code, I noticed I missed a detail in the 1D/3D generalization of the algorithm: 1) count_include_pad parameter used in the divisor computation 2) Reversing of kernel/stride/pad parameter elements order. Although the 2nd issue could have been present in the original code. The second issue was found by 3 new E2E tests from this PR with non-uniform values for these parameters.

I updated the PR to use the PyTorch based algorithm with these two fixes and all E2E tests are passing now.

lib/Conversion/TorchToLinalg/Pooling.cpp

projects/pt1/python/torch_mlir_e2e_test/test_suite/pooling.py

…couple of corrections.

ivangarcia44 · 2025-05-10T01:46:54Z

Hi all, this is a reminder to review when you get a chance. Thank you

@vivekkhandelwal1
@JianzheXiao
@AmosLewis
@rsuderman
@nirvedhmeshram)
@sahas3
@Hanumanth04
@dixinzhou
@rafaelubalmw

lib/Conversion/TorchToLinalg/Pooling.cpp

projects/pt1/python/torch_mlir_e2e_test/test_suite/pooling.py

…est.

ivangarcia44 · 2025-05-16T12:55:27Z

Hi @vivekkhandelwal1,

Could you please take a look at this PR when you have a chance? I’ve addressed all the previous feedback, and GitHub requires your approval before it can be merged. This PR includes three numerical correctness bug fixes for the average pooling operator, which are important for our stakeholders working on safety-critical applications.

It’s been open for a month, so I wanted to check in and see if there’s anything else needed from my side to move this forward.

Thank you very much!

@silvasean
@zjgarvey
@ramiro050
@JianzheXiao
@AmosLewis
@rsuderman
@nirvedhmeshram
@sahas3
@Hanumanth04
@dixinzhou
@rafaelubalmw

vivekkhandelwal1 · 2025-05-16T13:15:55Z

Hi @vivekkhandelwal1,

Could you please take a look at this PR when you have a chance? I’ve addressed all the previous feedback, and GitHub requires your approval before it can be merged. This PR includes three numerical correctness bug fixes for the average pooling operator, which are important for our stakeholders working on safety-critical applications.

It’s been open for a month, so I wanted to check in and see if there’s anything else needed from my side to move this forward.

Thank you very much!

Hi @ivangarcia44, I'm sorry for the delay. I got caught up in some other work. Can I review it in a couple of days?

ivangarcia44 · 2025-05-16T13:18:22Z

Hi @vivekkhandelwal1,
Could you please take a look at this PR when you have a chance? I’ve addressed all the previous feedback, and GitHub requires your approval before it can be merged. This PR includes three numerical correctness bug fixes for the average pooling operator, which are important for our stakeholders working on safety-critical applications.
It’s been open for a month, so I wanted to check in and see if there’s anything else needed from my side to move this forward.
Thank you very much!

Hi @ivangarcia44, I'm sorry for the delay. I got caught up in some other work. Can I review it in a couple of days?

Its ok, next week is perfectly fine. Thank you for following up!

vivekkhandelwal1

@ivangarcia44, thanks for the changes and for adding the clarifications. The PR looks good. If you want, then you may incorporate the comment suggestion before merging the PR.

I have 2 doubts:

I am unable to understand the need for reversing the kernel/stride/padding element order.
In response to this comment of mine, you said:

Because of this I wrote a new average pooling divisor formula based on the PyTorch avg_pool2d doc and numerical values I observed.

Honestly, as per my opinion doing something based on observation won't be correct. There must be some rationale/reference for the same. Also. now I don't see that comment in the code.

lib/Conversion/TorchToLinalg/Pooling.cpp

Committing Vivek's comment update suggestion. Co-authored-by: Vivek Khandelwal <[email protected]>

ivangarcia44 · 2025-05-19T14:38:02Z

@ivangarcia44, thanks for the changes and for adding the clarifications. The PR looks good. If you want, then you may incorporate the comment suggestion before merging the PR.

I have 2 doubts:

I am unable to understand the need for reversing the kernel/stride/padding element order.

In response to this comment of mine, you said:

Because of this I wrote a new average pooling divisor formula based on the PyTorch avg_pool2d doc and numerical values I observed.

Honestly, as per my opinion doing something based on observation won't be correct. There must be some rationale/reference for the same. Also. now I don't see that comment in the code.

Hi Vivek, I will integrate your requested comment change. Here are my answers for the two questions above:

In the average pooling ND generalization the spatial dimension order was reversed in PoolSizeCalculator's constructor. Hence kernel/size/pad need to compensate. I will update the order in PoolSizeCalculator's constructor to remove the need of kernel/size/stride element order reversal.
The latest version of the divisor computation algorithm has a link to the PyTorch algorithm it is based on. I created a new divisor algorithm after finding numerical errors in it, but it turned out that a piece of the algorithm was omitted in the ND generalization. After adding this piece back I could bring back the PyTorch based algorithm with the reference to it in the comments.

I will submit a commit for point number 1 above. Please let me know if I missed anything. Thanks,
Ivan

Average pooling clamped divisor should be done on all conditions wher…

aa7e7f7

…e the kernel can go out of bounds.

ivangarcia44 mentioned this pull request Apr 17, 2025

testcase failing due to commit 7b23a1f5d87064daded4b89487dc968fd933051e #4079

Open

sahas3 reviewed Apr 21, 2025

View reviewed changes

test/Conversion/TorchToLinalg/pooling.mlir Outdated Show resolved Hide resolved

projects/pt1/python/torch_mlir_e2e_test/test_suite/pooling.py Outdated Show resolved Hide resolved

lib/Conversion/TorchToLinalg/Pooling.cpp Outdated Show resolved Hide resolved

Ivan Garcia added 2 commits April 23, 2025 17:01

Updated divisor algorithm after finding SWA in existing logic (prior …

0e965b8

…any change of mine).

Update patterns in MLIR unit tests.

4cc39d9

ivangarcia44 marked this pull request as draft April 24, 2025 15:26

Adding more tests and fixing issue oncovered by one of them; i.e., ke…

8bd5e70

…rnel/stride/padding elements have to be processed in reversed order relative to the spatial dimensions.

ivangarcia44 marked this pull request as ready for review April 24, 2025 17:11

Filtering new tests on ONNX test suite.

b92d049

sahas3 requested a review from vivekkhandelwal1 April 28, 2025 13:16

ivangarcia44 mentioned this pull request Apr 28, 2025

Fix various numerical issues in average pooling divisor calculation in torch to linalg lowering #4169

Closed

ivangarcia44 closed this Apr 28, 2025

ivangarcia44 reopened this Apr 29, 2025

ivangarcia44 marked this pull request as draft April 29, 2025 16:59

ivangarcia44 mentioned this pull request Apr 29, 2025

Extend to 1D and 3D the torch to linalg lowering of the average pool operator with count_include_pad = false #4035

Merged

Merging with Vivek's change.

247263c

ivangarcia44 marked this pull request as ready for review April 29, 2025 21:59

Merging with Vivek's change.

af167df

vivekkhandelwal1 requested changes May 2, 2025

View reviewed changes

Ivan Garcia added 2 commits May 2, 2025 16:41

Addressing round 2 of Vivek's feedback.

0b8d27e

Bring back PyTorch based average pooling divisor computation after a …

2b24438

…couple of corrections.

sahas3 reviewed May 12, 2025

View reviewed changes

Addressing saha3's feedback. Typo correction and removing redundant t…

43f1753

…est.

ivangarcia44 requested a review from vivekkhandelwal1 May 12, 2025 14:16

vivekkhandelwal1 approved these changes May 19, 2025

View reviewed changes

lib/Conversion/TorchToLinalg/Pooling.cpp Outdated Show resolved Hide resolved

Update lib/Conversion/TorchToLinalg/Pooling.cpp

d156707

Committing Vivek's comment update suggestion. Co-authored-by: Vivek Khandelwal <[email protected]>

ivangarcia44 closed this May 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Average pooling clamped divisor should be done on all conditions where the kernel can go out of bounds #4144

Average pooling clamped divisor should be done on all conditions where the kernel can go out of bounds #4144

ivangarcia44 commented Apr 17, 2025 •

edited

Loading

vivekkhandelwal1 commented Apr 29, 2025

ivangarcia44 commented Apr 29, 2025

ivangarcia44 commented Apr 29, 2025

vivekkhandelwal1 May 2, 2025

ivangarcia44 May 3, 2025

ivangarcia44 commented May 10, 2025

ivangarcia44 commented May 16, 2025 •

edited

Loading

vivekkhandelwal1 commented May 16, 2025

ivangarcia44 commented May 16, 2025

vivekkhandelwal1 left a comment •

edited

Loading

ivangarcia44 commented May 19, 2025 •

edited

Loading

Average pooling clamped divisor should be done on all conditions where the kernel can go out of bounds #4144

Average pooling clamped divisor should be done on all conditions where the kernel can go out of bounds #4144

Conversation

ivangarcia44 commented Apr 17, 2025 • edited Loading

vivekkhandelwal1 commented Apr 29, 2025

ivangarcia44 commented Apr 29, 2025

ivangarcia44 commented Apr 29, 2025

vivekkhandelwal1 May 2, 2025

Choose a reason for hiding this comment

ivangarcia44 May 3, 2025

Choose a reason for hiding this comment

ivangarcia44 commented May 10, 2025

ivangarcia44 commented May 16, 2025 • edited Loading

vivekkhandelwal1 commented May 16, 2025

ivangarcia44 commented May 16, 2025

vivekkhandelwal1 left a comment • edited Loading

Choose a reason for hiding this comment

ivangarcia44 commented May 19, 2025 • edited Loading

ivangarcia44 commented Apr 17, 2025 •

edited

Loading

ivangarcia44 commented May 16, 2025 •

edited

Loading

vivekkhandelwal1 left a comment •

edited

Loading

ivangarcia44 commented May 19, 2025 •

edited

Loading