coll: make bcast ring unsigned-safe #13287

hppritcha · 2025-06-02T16:39:20Z

In the conversion to support big count, there are several places where signed int's were replaced by unsigned types (size_t). Unfortunately there were a few places where signedness was being used and these need to be refactored.

To find these places the -Wtype-limit gnu compile option was used. This compile option is added to the --enable-picky compile option list as part of this PR.

devreal

A suggestion (feel free to ignore): this pattern occurs in multiple places here and could occur in other places. We could extract it into an inline function:

size_t rectified_diff(size_t a, size_t b){
  return a > b ? a - b : 0;
}

ompi/mca/coll/base/coll_base_bcast.c

hppritcha · 2025-06-03T20:08:19Z

A suggestion (feel free to ignore): this pattern occurs in multiple places here and could occur in other places. We could extract it into an inline function:
size_t rectified_diff(size_t a, size_t b){
  return a > b ? a - b : 0;
}

good idea. i kept a helper routine inside this file as part of this PR.

hppritcha · 2025-06-04T19:36:36Z

@devreal please review again when you have a chance

devreal

Thanks @hppritcha! Looks good, just one nit

devreal · 2025-06-04T07:43:41Z

ompi/mca/coll/base/coll_base_bcast.c

@@ -850,10 +860,8 @@ int ompi_coll_base_bcast_intra_scatter_allgather(
     * Allgather by recursive doubling
     * Each process has the curr_count elems in the buf[vrank * scatter_count, ...]
     */
-    size_t rem_count = count - vrank * scatter_count;
+    size_t rem_count = (count > vrank * scatter_count) ? count - vrank * scatter_count : 0;


Can use rectify_diff here too:

Suggested change

size_t rem_count = (count > vrank * scatter_count) ? count - vrank * scatter_count : 0;

size_t rem_count = rectify_diff(count, (size_t)(vrank * scatter_count));

good catch!

hppritcha · 2025-06-09T14:05:21Z

@devreal recheck when you have a chance

hppritcha · 2025-06-09T15:28:11Z

@dalcinl would you have suggestions on debugging what's going on with mpi4py? it looks like some kind of problem with generation of mpi4py internal docs?

devreal

Thanks @hppritcha!

hppritcha · 2025-06-10T18:24:01Z

@dalcinl never mind we are just going to disable the test_doc.py in our CI.

In the conversion to support big count, there are several places where signed int's were replaced by unsigned types (size_t). Unfortunately there were a few places where signedness was being used and these need to be refactored. To find these places the -Wtype-limit gnu compile option was used. This compile option is added to the --enable-picky compile option list as part of this PR. Signed-off-by: Howard Pritchard <[email protected]>

github-actions bot added the Target: main label Jun 2, 2025

hppritcha requested a review from devreal June 2, 2025 16:39

devreal requested changes Jun 2, 2025

View reviewed changes

ompi/mca/coll/base/coll_base_bcast.c Outdated Show resolved Hide resolved

ompi/mca/coll/base/coll_base_bcast.c Outdated Show resolved Hide resolved

hppritcha force-pushed the add_some_unsigned_safe_code branch from 1861ca1 to 5686600 Compare June 3, 2025 19:56

hppritcha requested a review from devreal June 3, 2025 20:24

devreal previously approved these changes Jun 5, 2025

View reviewed changes

hppritcha dismissed devreal’s stale review via 9b2ff4a June 9, 2025 14:04

hppritcha force-pushed the add_some_unsigned_safe_code branch from 5686600 to 9b2ff4a Compare June 9, 2025 14:04

hppritcha requested a review from devreal June 9, 2025 14:05

devreal approved these changes Jun 9, 2025

View reviewed changes

hppritcha force-pushed the add_some_unsigned_safe_code branch from 9b2ff4a to 0f34c42 Compare June 10, 2025 18:32

hppritcha merged commit c859bfe into open-mpi:main Jun 10, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

coll: make bcast ring unsigned-safe #13287

coll: make bcast ring unsigned-safe #13287

hppritcha commented Jun 2, 2025

Uh oh!

devreal left a comment

Uh oh!

Uh oh!

Uh oh!

hppritcha commented Jun 3, 2025

Uh oh!

hppritcha commented Jun 4, 2025

Uh oh!

devreal left a comment

Uh oh!

devreal Jun 4, 2025

Uh oh!

hppritcha Jun 9, 2025

Uh oh!

hppritcha commented Jun 9, 2025

Uh oh!

hppritcha commented Jun 9, 2025

Uh oh!

devreal left a comment

Uh oh!

hppritcha commented Jun 10, 2025

Uh oh!

Uh oh!

Uh oh!

	size_t rem_count = (count > vrank * scatter_count) ? count - vrank * scatter_count : 0;
	size_t rem_count = rectify_diff(count, (size_t)(vrank * scatter_count));

coll: make bcast ring unsigned-safe #13287

coll: make bcast ring unsigned-safe #13287

Conversation

hppritcha commented Jun 2, 2025

Uh oh!

devreal left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hppritcha commented Jun 3, 2025

Uh oh!

hppritcha commented Jun 4, 2025

Uh oh!

devreal left a comment

Choose a reason for hiding this comment

Uh oh!

devreal Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

hppritcha Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

hppritcha commented Jun 9, 2025

Uh oh!

hppritcha commented Jun 9, 2025

Uh oh!

devreal left a comment

Choose a reason for hiding this comment

Uh oh!

hppritcha commented Jun 10, 2025

Uh oh!

Uh oh!

Uh oh!