You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When fetching upstream data versions (for composing the output data version), the combined upstream partition keys are hashed together. This makes sense for cases when Downstream Bar depends on more than one partition of Upstream Foo because you need to represent the group of partitions with a single data version.
However, a pretty standard use case would be Foo and Bar having the same partitions def and 1:1 partition dependencies. Tracing lineage would be much easier if you could match the data versions.
You can get this easy-trace behavior by adding these two lines
When looking at input_data_version/foo in the Bar materialization information, the version is the same as the data version listed in Foo for the consumed partition.
How to reproduce?
No response
Dagster version
1.9.2
Deployment type
Local
Deployment details
No response
Additional information
I checked the original PR #14265 by @smackesey and didn't see any specific reason why single partitions also must be hashed.
I thought about submitting this as a PR, but this is a change to a private function and I couldn't find the best place to write the tests. I'd be happy to give it a go if you tell me the best place to put them.
Message from the maintainers
Impacted by this issue? Give it a 👍! We factor engagement into prioritization.
By submitting this issue, you agree to follow Dagster's Code of Conduct.
The text was updated successfully, but these errors were encountered:
What's the issue?
When fetching upstream data versions (for composing the output data version), the combined upstream partition keys are hashed together. This makes sense for cases when Downstream Bar depends on more than one partition of Upstream Foo because you need to represent the group of partitions with a single data version.
However, a pretty standard use case would be Foo and Bar having the same partitions def and 1:1 partition dependencies. Tracing lineage would be much easier if you could match the data versions.
You can get this easy-trace behavior by adding these two lines
What did you expect to happen?
When looking at
input_data_version/foo
in the Bar materialization information, the version is the same as the data version listed in Foo for the consumed partition.How to reproduce?
No response
Dagster version
1.9.2
Deployment type
Local
Deployment details
No response
Additional information
I checked the original PR #14265 by @smackesey and didn't see any specific reason why single partitions also must be hashed.
I thought about submitting this as a PR, but this is a change to a private function and I couldn't find the best place to write the tests. I'd be happy to give it a go if you tell me the best place to put them.
Message from the maintainers
Impacted by this issue? Give it a 👍! We factor engagement into prioritization.
By submitting this issue, you agree to follow Dagster's Code of Conduct.
The text was updated successfully, but these errors were encountered: