Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH: add pandera #472

Merged
merged 1 commit into from
Mar 5, 2025
Merged

ENH: add pandera #472

merged 1 commit into from
Mar 5, 2025

Conversation

dschmitz89
Copy link
Contributor

@dschmitz89 dschmitz89 commented Mar 4, 2025

This adds pandera, a popular library for data validation of dataframe libraries (pandas, polars , ...). pandera is kind of an analogon to pydantic for dataframes and very useful for data pipelines. Thanks in advance.

If I understand correctly, the layers would automatically be available on the second of the month after merging according to the readme?

Copy link
Owner

@keithrozario keithrozario left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@keithrozario keithrozario merged commit 4153623 into keithrozario:master Mar 5, 2025
3 checks passed
@keithrozario
Copy link
Owner

I merged them. It should build almost immediately usually less than 10 minutes after I merge. If you don't see it, then usually there's an issue with the build.

Am out now, if you don't see the layer present then just @ me here and I'll see what I can troubleshoot

Sometimes the layer becomes too large and can't be published.

@dschmitz89
Copy link
Contributor Author

This was quick, thank you so much! I checked for the ARN in the URL https://api.klayers.cloud/api/v2/p3.11/layers/latest/eu-central-1/csv as that is what I understood from the README. In that pandera is not listed yet, am I missing something or did the build go wrong? pandera might try to automatically also package pandas in the layer which is quite heavy and might cause the build to fail.

@keithrozario
Copy link
Owner

There was a problem with Pandera :(

#473

I manually tweaked it for Python3.12, and you should have a layer for that.

arn:aws:lambda::770693421928:layer:Klayers-p312-pandera:1

@dschmitz89
Copy link
Contributor Author

We are actually more interested even in the 3.11 layer, thanks though! But don't worry, take your time, that license text issue seems like something related to escape characters?

@keithrozario
Copy link
Owner

Yea .... will have to investigate.

In the meantime, I've manually fixed this for 3.11 as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants