- Washington, D.C.
Stars
Vizro is a low-code toolkit for building high-quality data visualization apps.
《Python Cookbook》 3rd Edition Translation
First-party plugins maintained by the Kedro team.
Source code accompanying O'Reilly book: Machine Learning Design Patterns
Kedro gRPC Server is a Kedro plugin that creates a gRPC server for triggering and monitoring pipeline runs using a general-purpose RPC framework gRPC
A Singer.io tap for extracting data from the JIRA API
Python app that uses the Jira API for aggregate and complex tasks and reports
Atlassian Python REST API wrapper
Python library to interface with Gerrit's REST API
Extract Gerrit data to build up a database for analysis
Examples of data science projects created with Kedro.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
An open source python library for automated feature engineering
create sankey diagrams with matplotlib
Create HTML profiling reports from Apache Spark DataFrames
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
A two-page cheatsheet for restructured text
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Repo to migrate old wiki to, esp for devs and code examples
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
💡 News, full-text, and article metadata extraction in Python 3
Collection of example and notes on Hadoop and Map Reduce
A project for NYU course CSCI-GA.3033-001: Realtime & Big Data Analytics
A single file that links up all the local geographies in LA County
A list of awesome interactive journalism projects.
An Atom package with D3v5 snippets. Accelerate your graphics!