Release Notes

Added support for MultiIndex column and and index validation
DataFrameSchema can validate head, tail, or a random sample of dataframe
Checks and Hypothesis checks now support dataframe-level (wide) data validation

Assets 2

10 Jun 13:56

cosmicBboy

v0.1.4

988b14b

v0.1.4

Add testing support for python 3.7, improved documentation

Assets 2

10 Jun 03:29

cosmicBboy

v0.1.3

6cfa2a7

Pandera v0.1.3

This release adds a few nifty features to pandera, special thanks to @mastersplinter and @ralbertazzi:

We now have official documentation! Thanks to @mastersplinter on the work here.
the Check class now has a groupby argument, which enables the user to assert properties on subsets of the Column of interest. This opens up the possibility to compare the values or aggregates of values of subsets of a column #42.
the introduction of hypothesis tests through the Hypothesis class, which is a subclass of the Check class. This enables the user to run hypothesis tests on their dataframe as part of a DataFrameSchema definition. Refer to the documentation for more info #43.
Columns now have a required argument (default = True), where required=False means that the column is optional #23.
SeriesSchemaBase now has an allow_duplicates argument (default = True) #24
add informative errors to check_input and check_output decorators 902f199
DataFrameSchema(..., strict=True) means that all columns in the dataframe need to be specified in the schema columns. #34
improved error messaging in general.
improved CI (codecoverage).

Assets 2

29 Dec 19:35

cosmicBboy

0.1.1

6421d85

Improve error reporting, add coerce option

This release adds two new features to pandera.

Improved error reporting

Now failure cases in column checks are displayed in a much more compact format,
where the failure cases, the index of the dataframe where those failures occur, and the
count of failure cases are shown to the user, e.g.

# failure cases:
#              index  count
# failure_case
# foo1           [0]      1
# foo2           [1]      1
# foo3           [2]      1

Coerce option in `DataFrameSchema` and `Column`

Now the user can coerce the dataframe when calling schema.validate so that
the columns are cast into the expected data-type before performing Checks.

Assets 2

16 Dec 04:05

cosmicBboy

0.1.0

dcb0c9c

New DataFrameSchema API

Release Notes

Major change: This release updates changes the API of the DataFrameSchema object.
Instead of passing a list of Columns, you now pass a dictionary where the keys are column_names
values are Column objects. This makes the API feel a lot more familiar for pandas users, who may
often define DataFrames in a similar way (see README for details).
renamed Validator to Check for brevity and clarity (accordingly renamed validator_{input, output}
to check_{input, output}.
created convenience variables for PandasDtype so they can be accessed in pandera namespace:
Bool, DatetTime, Category, Float, Int, Object, String, Timedelta