You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I run kedro viz and as soon as I press onto one of the datasets persisted in memory, the application fails with the TypeError: 'float' object cannot be interpreted as an integer
Context
This issue makes all the consecutive datasets not available for a preview. Apart from that, there is no direct way to understand which exact column of the dataset persisted as parquet doesn't fit.
Steps to Reproduce
Create a pipeline of one node producing a dataframe. Dataframe may contain questionable pureness of the columns but definitely serialisable to parquet. kedro run pipeline doesn't throw any errors and the dataset actually exists in the data folder and easily readable using catalog.load
So I'm not sure if this is the same issue, but I remember once that infinity is a valid float in python but not JSON, it's possible something related is going on
@datajoely thank you very much for your answer! I've investigated the dataset I had and actually found that the issue happens with the timestamp of type dtype='datetime64[ns]' containing 'NaT' values.
Here is the minimal dataframe which reproduces the error on my side:
@datajoely, yes. Kedro pipeline does this without any changes. I've also tried to save such a dataframe using pyre pandas from the example above - both work. I haven't specified any configuration for the parquet, but I have fastparquet and pyarrow installed in the venv. So should use pyarrow with snappy compression by default, but I haven't checked into the depth if this may cause any issues
Description
I run kedro viz and as soon as I press onto one of the datasets persisted in memory, the application fails with the
TypeError: 'float' object cannot be interpreted as an integer
Context
This issue makes all the consecutive datasets not available for a preview. Apart from that, there is no direct way to understand which exact column of the dataset persisted as parquet doesn't fit.
Steps to Reproduce
catalog.load
Expected Result
Preview is available
Actual Result
Long error starting with
and ending with
Your Environment
Include as many relevant details as possible about the environment you experienced the bug in:
Checklist
The text was updated successfully, but these errors were encountered: