refactor[cartesian]: Dace backend: expose control flow #1894

romanc · 2025-03-03T09:54:39Z

Description

This PR refactors the GT4Py/DaCe bridge to expose control flow elements (if statements and while loops) to DaCe. Previously, the whole contents of a vertical loop was put in one big Tasklet. With this PR, that Tasklet is broken apart in case control flow is found such that control flow is visible in the SDFG. This allows DaCe to better analyze code and will be crucial in future (within the current milestone) performance optimization work.

The main ideas in this PR are the following

Introduce oir.CodeBlock to recursively break down oir.HorizontalExecutions into smaller pieces that are either code flow or evaluated in (smaller) Tasklets.
Introduce dcir.Conditionand dcir.WhileLoop to represent if statements and while loops that are translated into SDFG states. We keep the current dcir.MaskStmt / dcir.While for if statements / while loops inside horizontal regions, which aren't yet exposed to DaCe (see cartesian: expose HorizontalRegions to DaCe #1900).
Add support for if statements and while loops in the state machine of sdfg_builder.py
We are breaking up vertical loops inside stencils in multiple Tasklets. It might thus happen that we write a "local" scalar in one Tasklet and read it in another Tasklet (downstream). We thus create output connectors for all scalar writes in a Tasklet and input connectors for all reads (unless previously written in the same Tasklet).
Memlets can't be generated per horizontal execution anymore and need to be more fine grained. TaskletAccessInfoCollector does this work for us, duplicating some logic in AccessInfoCollector. A refactor task has been logged to fix/re-evaluate this later.

This PR depends on the following (downstream) DaCe fixes

which have been merged by now.

Follow-up issues

Related issue: GEOS-ESM/NDSL#53

Requirements

All fixes and/or new features come with corresponding tests.
Added new tests and increased coverage of horizontal regions with PRs test[cartesian]: Increased coverage for horizontal regions #1807 and tests[cartesian]: Increase horizontal region test coverage #1851.
Important design decisions have been documented in the appropriate ADR inside the docs/development/ADRs/ folder.
Docs are in our knowledge base for now. Will be ported.

## Description In preparation for PR #1894, pull out some refactors and cleanups. Notable in this PR are the changes to `src/gt4py/cartesian/gtc/dace/oir_to_dace.py` - visit `stencil.vertical_loops` directly instead of calling `generic_visit` (simplification since there's nothing else to visit) - rename library nodes from `f"{sdfg_name}_computation_{id(node)}"` to `f"{sdfg_name}_vloop_{counter}_{node.loop_order}_{id(node)}"`. This adds a bit more information (because `sdfg_name` is the same for all library nodes) and thus simplifies debugging workflows. Related issue: GEOS-ESM/NDSL#53 ## Requirements - [x] All fixes and/or new features come with corresponding tests. covered by existing test suite - [ ] Important design decisions have been documented in the appropriate ADR inside the [docs/development/ADRs/](docs/development/ADRs/Index.md) folder. N/A --------- Co-authored-by: Roman Cattaneo <1116746+romanc@users.noreply.github.com>

We don't need to replace visit_MaskStmt and visit_While because we won't have `if` statements or `while` loops in tasklet code anymore. These constructs are represented directly in DaCe (allowing DaCe to do more of its magic).

This will be used in the gt4py/dace bridge only (for now). We'll have to move this concept up and have all the backends work with it.

Add support for adding conditions and while loops to the `SDFGContext`, a helper class managing state transitions when building the SDFG from the dace IR.

Add support for generating sdfg nodes from dcir.Condition and dcir.WhileLoop nodes.

Some of them might be fixed "the wrong way" ...

we should fix this at another level in the future

This seems to fix the variable shadowing problem in the newer version (without all the nexted sdfgs). This also seems to add support for variables declared inside if/else statements.

This still seems to fail consistently for while loops because they try to connect to the (duplicated) entry map. No clue where this is comming from (again now ...).

While loops can contain other while loops. This was forgotten and tests complained about it.

not needed anymore since we now properly separate read and write memlets not working either. needs fixes to how we setup the "conditional evaluation tasklet", i.e. we don't go through the "normal" ComputeState transformations and thus lack the updated node-ctx information. This is why it tries to connect to outer maps.

This still ends in a segfault - for whatever reason ... This seems to work now ... let's just see in ndsl.

if input and output connectors are called the same, e.g. in code like ```python level = 1 condition = True while condition: condition = level < 10 level = level + 1 ``` which translates to ```none Tasklet1: level = 1 condtion = True While condition: Tasklet2: condition = level < 10 level = level + 1 ``` then the sdfg complains because DaCe can't handle reading level and writing to it in the same tasklet. This creates and input connector for level and an output connector for level, which is ambiguous.

the ideas this time is to have different connector names and the "same name" outside in the things that are passed around.

FlorianDeconinck

LGTM - will require a re-read with the documentation you are writing for a go-ahead

FlorianDeconinck · 2025-03-10T13:46:48Z

src/gt4py/cartesian/gtc/dace/daceir.py

@@ -730,7 +727,8 @@ class Literal(common.Literal, Expr):


 class ScalarAccess(common.ScalarAccess, Expr):
-    pass
+    is_target: bool
+    original_name: Optional[str] = None


Out of scope for the PR: this kinds of plead for more information on the access. Especially how and where they apply. Of course IR are suppose to be atomic information, but I wonder if a type defining the context of the access: temporary, parameter...

src/gt4py/cartesian/gtc/dace/expansion/daceir_builder.py

- rename constants -> prefix - add dace's passthrough prefixes - update import style

romanc · 2025-03-11T17:06:02Z

Two things for reviewers

There's technical docs available in https://geos-esm.github.io/SMT-Nebulae/technical/backend/dace-bridge/. Feedback welcome if you have any.
I pushed a small PR today to re-organize all the prefix strings that we use in the bridge.

philip-paul-mueller

There are some general comment or suggestions.
But I have some suggestions regarding DaCe, especially the usage of newer API.

src/gt4py/cartesian/gtc/dace/daceir.py

src/gt4py/cartesian/gtc/dace/expansion/daceir_builder.py

src/gt4py/cartesian/gtc/dace/daceir.py

src/gt4py/cartesian/gtc/dace/expansion/daceir_builder.py

src/gt4py/cartesian/gtc/dace/utils.py

src/gt4py/cartesian/gtc/dace/expansion/sdfg_builder.py

edopao

My 2 cents review.

edopao · 2025-03-10T17:12:42Z

src/gt4py/cartesian/gtc/dace/expansion/sdfg_builder.py

+            assert isinstance(node.condition.stmts[0].left, dcir.ScalarAccess)
+            condition_name = node.condition.stmts[0].left.original_name
+
+            after_state = self.sdfg.add_state("while_after")


Just an idea, but please do as you prefer. Already on dace 1.0 it is possible to instantiate LoopRegion constructs, although it is not possible to apply SDFG transformations nor codegen from it (you need dace from main branch, for that). However, after the SDFG is built, it is possible to call dace.sdfg.utils.inline_loop_blocks(sdfg) that will turn the LoopRegion nodes into the equivalent state machines. In this way, you can prepare the SDFG for the upgrade to dace main.

Thanks, Eduardo. That is valuable feedback. I left a note in #1898 such that we don't forget it in the future.

edopao · 2025-03-10T17:41:04Z

src/gt4py/cartesian/gtc/dace/expansion/daceir_builder.py

+        inside_horizontal_region: bool = False,
+        **kwargs: Any,
+    ) -> dcir.MaskStmt | dcir.Condition:
+        if inside_horizontal_region:


This is just a question, not a comment. I do not know this code base, I've only worked on gt4py-next. I've learned from my mistakes that lowering if-statements inside tasklets does not ensure exclusive branch execution: all inputs of a tasklet are evaluated, no matter the value of the if-condition inside the tasklet node. Randomly, this can result in correct code or not.
I guess the gt4py user code mostly uses dynamic K-offsets, which limits the vertical domain of intermediate fields. On the horizontal region, fields are always defined on the full domain. Is this the motivation behind treating the vertical region differently?
If what I wrote so far makes sense, my question is the following. Is it theoretically possible to write cartesian programs that use horizontal dynamic offsets, and would also require dcir.Condition in the horizontal region?

I've learned from my mistakes that lowering if-statements inside tasklets does not ensure exclusive branch execution: all inputs of a tasklet are evaluated, no matter the value of the if-condition inside the tasklet node. Randomly, this can result in correct code or not.

I don't quite follow. Let's assume we have the following stencil

def small_conditional(in_field: FloatField, out_field: FloatField): with computation(PARALLEL), interval(...): if in_field < 4: tmp = in_field else: tmp = in_field + 1

Are you saying both branches, if and else, are evaluated? That is not what we are observing and looking at the generated code, this translates to real if statements in cpp code that work as I would expect them to. Can you elaborate on this?

Background: Before this PR, gt4py-cartesian would would have all conditionals inside Tasklets. I'm surprised to learn that this might pose a problem.

I guess the gt4py user code mostly uses dynamic K-offsets, which limits the vertical domain of intermediate fields. On the horizontal region, fields are always defined on the full domain. Is this the motivation behind treating the vertical region differently?

@FlorianDeconinck / @twicki can you make sense of that? I'm just reading words here without understanding ...

If what I wrote so far makes sense, my question is the following. Is it theoretically possible to write cartesian programs that use horizontal dynamic offsets, and would also require dcir.Condition in the horizontal region?

What I can say is that it is not possible to have dcir.Conditions (or dcir.WhileLoops) inside horizontal regions. There might be dcir.MaskStmts (or dcir.While loops) inside horizontal regions. In that case, we won't translate them to dcir.Conditions / dcir.WhileLoops and codegen if/while statements inside the Tasklet (without exposing them to DaCe) just as we'd do before this PR.

No, only one of the if-banchesis is executed. What I meant is the following, consider:

def small_conditional(in_field1: FloatField, in_field2: FloatField, k: IntField): with computation(PARALLEL), interval(...): if k > 10: tmp = in_field1 else: tmp = in_field2

Both inputs to the tasklet (in_field1 and in_field2) have to be defined for all k values [0:N]. That because the two dataflows that compute in_field1 and in_field2 are executed before evaluating the if-expression.

However, this case maybe is handled differently in cartesian.

K-domain validation will indeed take care of that.

romanc · 2025-03-13T16:07:00Z

Thanks for all the feedback. I have addressed and/or answered all review comments. Please have a second look if you have strong opinions.

@FlorianDeconinck you wanted to do a re-read once the docs are done. The docs are here.

This reverts commit d3a1211.

FlorianDeconinck · 2025-03-14T16:26:12Z

@edopao / @philip-paul-mueller : Thank you for engaging here despite this part of the code being further away than your current focus. Very much appreciated.

FlorianDeconinck

LGTM - one ticket to log

FlorianDeconinck · 2025-03-17T12:36:07Z

src/gt4py/cartesian/gtc/dace/daceir.py

@@ -730,7 +727,8 @@ class Literal(common.Literal, Expr):


 class ScalarAccess(common.ScalarAccess, Expr):
-    pass
+    is_target: bool
+    original_name: Optional[str] = None


romanc mentioned this pull request Mar 3, 2025

refactor[cartesian]: gt4py/dace bridge cleanup #1895

Merged

2 tasks

romanc force-pushed the romanc/bridge-explicit-indexing-with-linting branch from 31f1890 to 947c1a1 Compare March 4, 2025 07:55

This was referenced Mar 4, 2025

cartesian: refactor oportunities after dace bridge refactor #1898

Open

cartesian: expose HorizontalRegions to DaCe #1900

Open

Roman Cattaneo added 25 commits March 4, 2025 17:22

WIP: Updated daceir with Condition & WhileLoop

7a9b083

WIP: tasklet_codegen typehints & cleanups

885537d

We don't need to replace visit_MaskStmt and visit_While because we won't have `if` statements or `while` loops in tasklet code anymore. These constructs are represented directly in DaCe (allowing DaCe to do more of its magic).

Add CodeBlock to OIR-level

8de07f3

This will be used in the gt4py/dace bridge only (for now). We'll have to move this concept up and have all the backends work with it.

WIP: Update daceir to avoid unecessary nested SDFG

d09a11e

sdfg context: add support for condition and while

79dab2f

Add support for adding conditions and while loops to the `SDFGContext`, a helper class managing state transitions when building the SDFG from the dace IR.

sdfg builder: add visit_{Condition, WhileLoop}

c303d36

Add support for generating sdfg nodes from dcir.Condition and dcir.WhileLoop nodes.

WIP: Fix the obivious issues

c8efbd3

Some of them might be fixed "the wrong way" ...

WIP: fix node_ctx issue in nested sdfgs

1780f9f

we should fix this at another level in the future

WIP: ... aaaand we are back to variable shadowing

24c4a53

WIP: we don't (shouldn't) have local scalars

5c86ad4

Fixed variable shadowing in the newer version

a06c2d7

This seems to fix the variable shadowing problem in the newer version (without all the nexted sdfgs). This also seems to add support for variables declared inside if/else statements.

Fix typing issue highlighted by tests

1ac6fc4

WIP: Move condition evaluation tasklet to daceir builder

9883755

This still seems to fail consistently for while loops because they try to connect to the (duplicated) entry map. No clue where this is comming from (again now ...).

Fix issue with extra map entry nodes

759a742

Fix type issue raised in gt4py tests

74c83ae

While loops can contain other while loops. This was forgotten and tests complained about it.

No Tasklet without ComputationState in DomainMap

484c23b

WIP: separate node context for eval Tasklet

33527be

This still ends in a segfault - for whatever reason ... This seems to work now ... let's just see in ndsl.

This seems to fix the if-only k-offset write

51df134

WIP: working on export / import of local scalars

3e8afa1

the ideas this time is to have different connector names and the "same name" outside in the things that are passed around.

This seems to fix the column_physics_conditional \o/

5746e31

This seems to fix the read after write issue

f0b2948

visit node.data_index for oir.FieldAccess nodes

feea69f

For testing: let's see if this fixes k offset write

12a1247

romanc added 2 commits March 7, 2025 18:25

WIP: More DaCe backend tests

1d216f7

Add tests for daceir and sdfg builder

d25ccca

romanc requested a review from FlorianDeconinck March 10, 2025 10:23

FlorianDeconinck reviewed Mar 10, 2025

View reviewed changes

havogt requested a review from philip-paul-mueller March 10, 2025 15:02

reorganize all the prefix strings

98296e6

- rename constants -> prefix - add dace's passthrough prefixes - update import style

philip-paul-mueller reviewed Mar 12, 2025

View reviewed changes

edopao reviewed Mar 12, 2025

View reviewed changes

romanc added 2 commits March 12, 2025 17:27

Review: use elif to reduce indent level

1c6721e

Review: rename Condition.{true,false}_states

83cb69d

romanc mentioned this pull request Mar 13, 2025

[build] Pending enhancements to development infrastructure #1829

Open

15 tasks

romanc added 10 commits March 13, 2025 09:50

Review: leverage eve's built-in filter function

679184c

Review: removed wrong return type, added doc string

3189955

Review: make _global_grid_subset() a regular class method

d13799a

Review: simplify code

2e48c20

Review: remove extra space in test

9a39394

Review: add comment to clarify usage of is_write

bc9cd41

Review: Move comment to docstring

8534c7c

Review: list() -> []

69b7a36

Review: remove intermediate list

617290d

Review: simplify defined_symbol in sdfg_builder

5ed4568

romanc added 3 commits March 13, 2025 17:53

Review: pull names out of mapped_access_iterator

d3a1211

Revert "Review: pull names out of mapped_access_iterator"

e8b229f

This reverts commit d3a1211.

Review: pull names out of mapped_access_iterator (2)

c928df6

FlorianDeconinck approved these changes Mar 17, 2025

View reviewed changes

romanc merged commit e6b9398 into GridTools:main Mar 18, 2025
25 checks passed

romanc mentioned this pull request Mar 24, 2025

Refactor of GT4Py-DaCe bridge to expose all control-flow to Dace GEOS-ESM/NDSL#53

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor[cartesian]: Dace backend: expose control flow #1894

refactor[cartesian]: Dace backend: expose control flow #1894

romanc commented Mar 3, 2025 •

edited

Loading

FlorianDeconinck left a comment

FlorianDeconinck Mar 10, 2025

romanc commented Mar 11, 2025

philip-paul-mueller left a comment

edopao left a comment

edopao Mar 10, 2025

romanc Mar 13, 2025

edopao Mar 10, 2025

romanc Mar 13, 2025 •

edited

Loading

edopao Mar 14, 2025

FlorianDeconinck Mar 14, 2025

romanc commented Mar 13, 2025

FlorianDeconinck commented Mar 14, 2025

FlorianDeconinck left a comment

FlorianDeconinck Mar 17, 2025

refactor[cartesian]: Dace backend: expose control flow #1894

refactor[cartesian]: Dace backend: expose control flow #1894

Conversation

romanc commented Mar 3, 2025 • edited Loading

Description

Requirements

FlorianDeconinck left a comment

Choose a reason for hiding this comment

FlorianDeconinck Mar 10, 2025

Choose a reason for hiding this comment

romanc commented Mar 11, 2025

philip-paul-mueller left a comment

Choose a reason for hiding this comment

edopao left a comment

Choose a reason for hiding this comment

edopao Mar 10, 2025

Choose a reason for hiding this comment

romanc Mar 13, 2025

Choose a reason for hiding this comment

edopao Mar 10, 2025

Choose a reason for hiding this comment

romanc Mar 13, 2025 • edited Loading

Choose a reason for hiding this comment

edopao Mar 14, 2025

Choose a reason for hiding this comment

FlorianDeconinck Mar 14, 2025

Choose a reason for hiding this comment

romanc commented Mar 13, 2025

FlorianDeconinck commented Mar 14, 2025

FlorianDeconinck left a comment

Choose a reason for hiding this comment

FlorianDeconinck Mar 17, 2025

Choose a reason for hiding this comment

romanc commented Mar 3, 2025 •

edited

Loading

romanc Mar 13, 2025 •

edited

Loading