This repo contains the data analysis and subsequent reports done by the data science team on 350 Seattle's ⚡Electrify Seattle⚡ campaign during Winter-Spring 2023.
- data: contains raw and intermediate data.
- experiments: notebooks for initial exploration.
- utils: contains extracted preprocessing, plotting, and other commonly-used functions. To have access to all of these utility functions, run
pip install .
in root.
- Github repo: stores experiments and experiment results, as well as code used to update/clean data.
- Big Query: stores data in a SQL database. Reachable by API.
- Google Sheets: displays current Big Query data in a spreadsheet, pulled in automatically. This is available for organizers who want to see the data but are not comfortable with SQL.
- Sign up for a Google Cloud account
- Ask Isaac to add you as an editor to the 350 Seattle project in Google Cloud.
- Install the Pandas Big Query SDK, which allows you to access Big Query directly from Pandas. You will need to use Python3 and pip3 for this library. Note: this is not the same as the Big Query Python API.
- To authenticate, you have two choices:
- Use Google Cloud authorization already cached on your machine
- The first time you run a query with the library, you'll be prompted to log in on a pop up window
- You should now be able to run the code in the API example successfully.
- If you see an error about the
tdqm
library, runpip install tdqm
and restart your iPython kernel.
- If you see an error about the
- Use the Pandas Big Query library to read and write data to the Source of Truth dataset in Big Query. Be sure to log all changes in our change log. Any changes you make will be visible in the Google Sheets display of the data.