-
-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Map PHMSA Natural Gas Transmission Part L columns #3254
Conversation
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## main #3254 +/- ##
=====================================
Coverage 92.7% 92.7%
=====================================
Files 144 144
Lines 13087 13091 +4
=====================================
+ Hits 12128 12134 +6
+ Misses 959 957 -2 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One non-blocking question about standardizing our column name orders.
commodity_group,,,,,,,,,,,,,,,,,,,,,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity,parta5commodity | ||
interstate_or_intrastate,,,,,,,,,,,,,,,,,,,,,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra,inter_intra | ||
report_state,,,,,,,,,,,,stop,stop,stop,stop,stop,stop,stop,stop,stop,state_name,state_name,state_name,state_name,state_name,state_name,state_name,state_name,state_name,state_name,state_name,state_name,state_name | ||
onshore_transmission_pipe_class_1_miles,,,,,,,,,,,,b4ton_1,b4ton_1,b4ton_1,b4ton_1,b4ton_1,b4ton_1,b4ton_1,b4ton_1,b4ton_1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1,partltonc1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we are a little inconsistent about the ordering of onshore/offshore and transmission/gathering:
> ag ^onshore_ -c
yearly_miles_of_pipe_by_class_location.csv:25
yearly_miles_of_transmission_pipe_by_nps.csv:35
yearly_miles_of_gathering_pipe_by_nps.csv:95
> ag ^transmission_onshore -c
yearly_transmission_gathering_summary_by_commodity.csv:1
> ag ^gathering_onshore -c
yearly_transmission_gathering_summary_by_commodity.csv:4
It appears that "shoreyness" comes before "transmissioniness" in most cases, should we standardize that everywhere?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think in the transmission_pipe_by_nps
and gathering_pipe_by_nps
transmission/gathering are implied, so they're not explicitly spelled out. I can flip the summary_by_commodity
usage to more closely reflect the one in class_location
though.
For more information, see https://pre-commit.ci
Overview
Closes #3253.
What problem does this address?
Extracts Part L in a raw format for PHMSA transmission data into one table,
raw_phmsagas__yearly_miles_of_pipe_by_class_location
. This table includes data from 2001-2022.What did you change?
Mapped columns for the relevant table, and updated the remaining CSVs in
src/pudl/package_data/phmsagas
.Testing
How did you make sure this worked? How can a reviewer verify this?
Materialize the raw asset in dagster. Also run the table through the notebook and verify columns are correctly mapped, for additional peace of mind.
To-do list