Skip to content
View pjbull's full-sized avatar
πŸ₯¦
πŸ₯¦

Organizations

@drivendataorg

Block or report pjbull

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pjbull/README.md

Hello, friends πŸ‘‹

I'm Peter. Nice to meet ya.

πŸ“ˆ Data science + machine learning πŸ“Š

I largely help social sector organizations get their data into a shape where machine learning can be valuable. Much of this work ends up on drivendata.org, where you can join a competition to help these organizations, learn from interesting data, try new methods, and make friends that care about impact. Here are some cool recent ones:

Competitions are great, but not every problem is a good fit, so our team of data scientists and software engineers also works with organizations directly to analyze data, build data systems, setup pipelines, train machine learning models, and design and deploy solutions. Check out DrivenData Labs to learn more. There I write case studies, publish on our blog, and maintain our open source work.

✨ Open source πŸ“¦

You can find me working on open source projects that are tools for data scientists and engineers using Python. I particularly care about reproducible data science and machine learning and AI ethics.

See below for the projects I regularly contribute to!

Pinned Loading

  1. drivendataorg/cookiecutter-data-science drivendataorg/cookiecutter-data-science Public

    A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

    Python 8.4k 2.5k

  2. drivendataorg/deon drivendataorg/deon Public

    A command line tool to easily add an ethics checklist to your data science projects.

    Python 290 52

  3. drivendataorg/cloudpathlib drivendataorg/cloudpathlib Public

    Python pathlib-style classes for cloud storage services such as Amazon S3, Azure Blob Storage, and Google Cloud Storage.

    Python 483 63

  4. drivendataorg/nbautoexport drivendataorg/nbautoexport Public

    Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.

    Python 74 9

  5. drivendataorg/zamba drivendataorg/zamba Public

    A Python package for identifying 42 kinds of animals, training custom models, and estimating distance from camera trap videos

    Python 119 27

  6. drivendataorg/pandas-path drivendataorg/pandas-path Public

    Use pathlib syntax to easily work with Pandas series containing file paths.

    Python 69 4