This Public Utility Data Liberation (PUDL) toolkit depends on a variety of
other open source tools and public data. We use the
Anaconda Python 3 distribution, and
manage package dependencies using conda environments. See the PUDL
environment.yml
file at the top level of the repository
for the most up to date list of required Python packages and their versions.
PUDL is currently designed to populate a local Postgresql relational database, so you'll need to install the database server software:
- Postgres, version 9.6 or later.
All that software isn't any good without some data! The raw data comes from the US government. The update_datastore.py script efficiently downloads and organizes this data for you, so that PUDL knows where to find it. Currently the data PUDL can ingest includes: