Major features and improvements
- Added Data Set transformer support in the form of AbstractTransformer and DataCatalog.add_transformer.
Breaking changes to the API
- Merged the
ExistsMixin
into AbstractDataSet
.
Pipeline.node_dependencies
returns a dictionary keyed by node, with sets of parent nodes as values; Pipeline
and ParallelRunner
were refactored to make use of this for topological sort for node dependency resolution and running pipelines respectively.
Pipeline.grouped_nodes
returns a list of sets, rather than a list of lists.
Thanks for supporting contributions
Darren Gallagher, Zain Patel