Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Transform vceregen renewable generation profiles (#3898)
* Add source metadata for vceregen * Add profiles to vceregen dataset name * Remove blank line in description * Add blank Data Source template for vceregen * Add links to download docs section * Add availability section * Add respondents section * Add original data section * Stash WIP of extraction * Extract VCE tables to raw dask dfs * Clean up warnings and restore EIA 176 * Revert to pandas concatenation * Add latlonfips * Add blank transform module for vceregen * Fill out the basic vceregen transforms * Add underscores back to function names * Update time col calculation * Update docstrings and comments to reflect new time cols * Change merge to concat * Remove dask, coerce dtypes on read-in * override load_column_maps behavior * Update addition of county and state name fields * Add vceregen to init files and metadata so that it will run on dagster, update column names, and add new fields * Add resource metadata for vcregen * Clean county strings more * Add release notes * Add function to validate state_county_names and improve performance of add_time_cols function * make for loops into dict comp, update loggers, and improve regex * Add asset checks and remove inline checks * Change hour_utc to datetime_utc * Remove incorrect docstring * Update dataset and field metadata * Rename county col to county_or_subregion * Update data_source docs page * change axis=1 to axis=columns * Update DOI to sandbox and temporarily xfail DOI test * Change county_or_subregion to county_or_lake_name * Change county_or_subregion to county_or_lake_name * Update docs to explain solar cap fac * Update regen to rare * [pre-commit.ci] auto fixes from pre-commit.com hooks For more information, see https://pre-commit.ci * Update gsutil in zenodo-cache-sync * Rename vceregen to vcerare * Add back user project * Update project path * Update project to billing project * Update dockerfile to replace gsutil with gcloud storage * Update docs/release_notes.rst Co-authored-by: E. Belfer <[email protected]> * Update docs/release_notes.rst Co-authored-by: E. Belfer <[email protected]> * Update docs/templates/vcerare_child.rst.jinja Co-authored-by: E. Belfer <[email protected]> * First batch of little docs fixes Co-authored-by: E. Belfer <[email protected]> * Restructure _combine_city_county_records function * Add link to zenodo archive to data source page * Clarify 1 vs. 100 in data source page * Spread out comments in the _prep_lat_long_fips_df function * Update docstring for _prep_lat_long_fips_df * Switch order of add_time_cols and make_cap_frac functions * Update _combine_city_county_records and move assertion to asset checks * Change all().all() to any().any() * Add validations to merges * docs cleanup tidbits * Turn _combine_city_county_records function into _drop_city_records and a few other tweaks * Make fips columns categorical and narrow scope of regex * data source docs updates * Add downloadable docs to vcerare data source and fix data source file name to vcerare from vceregen * Remove 1.34 from field description for capacity_factor_solar_pv * Add some logs and a function to null county_id_fips values from lakes and an asset check to match * Update solar_pv metadata * Rename RARE dataset in the release notes * Add issue number to release notes * Update field description for county_or_lake_name * Update docstring for transform module * Make all references to FIPS uppercase in notes and comments * Correct inline comment in _null_non_county_fips_rows * Fix asset check * Minor late-night PR fixes - Fix failing asset check. - Change dtype for `hour_of_year` from float to int. - Clarify some release notes / data source docs. * Log during VCE RARE asset checks to see what's slow. * Add simple notebook for processing vcerare data * Re-enable Zenodo DOI validation unit test. * Update docs to use gcloud storage not gsutil * Try to reduce memory use & concurrency for VCE RARE dataset * Retry policy for VCE + highmem use for VCE asset check. * Bump VM RAM and remove very-high memory tag & retry * Bump vCPUs to 16 * Add fancy charts to notebook * Add link to VCE data in nightly build outputs. Other docs tweaks. --------- Co-authored-by: e-belfer <[email protected]> Co-authored-by: E. Belfer <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Zane Selvans <[email protected]>
- Loading branch information