Skip to content

linnealovespie/BPS

Repository files navigation

Electrify Seattle: Building Performance Data Analysis

This repo contains the data analysis and subsequent reports done by the data science team on 350 Seattle's ⚡Electrify Seattle⚡ campaign during Winter-Spring 2023.

Repo Structure:

  • data: contains raw and intermediate data.
  • experiments: notebooks for initial exploration.
  • utils: contains extracted preprocessing, plotting, and other commonly-used functions. To have access to all of these utility functions, run pip install . in root.

Data Pipeline Structure

  • Github repo: stores experiments and experiment results, as well as code used to update/clean data.
  • Big Query: stores data in a SQL database. Reachable by API.
  • Google Sheets: displays current Big Query data in a spreadsheet, pulled in automatically. This is available for organizers who want to see the data but are not comfortable with SQL.

Getting Up and Running With the Pipeline

  • Sign up for a Google Cloud account
  • Ask Isaac to add you as an editor to the 350 Seattle project in Google Cloud.
  • Install the Pandas Big Query SDK, which allows you to access Big Query directly from Pandas. You will need to use Python3 and pip3 for this library. Note: this is not the same as the Big Query Python API.
  • To authenticate, you have two choices:
    • Use Google Cloud authorization already cached on your machine
    • The first time you run a query with the library, you'll be prompted to log in on a pop up window
  • You should now be able to run the code in the API example successfully.
    • If you see an error about the tdqm library, run pip install tdqm and restart your iPython kernel.
  • Use the Pandas Big Query library to read and write data to the Source of Truth dataset in Big Query. Be sure to log all changes in our change log. Any changes you make will be visible in the Google Sheets display of the data.

About

Analyze building performance data in Seattle.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published