-
-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature branch: Rename core + output assets to match new naming protocols #2818
Conversation
Thanks @e-belfer! I feel pretty good about all of the proposed asset names in the naming convention sheet except for the assets currently in the
|
I think the alternative to
Is there a reason to keep the |
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
Updated decisions: keep "pudl" as datasource, and leave duplicated PUDL as is. |
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## dev #2818 +/- ##
=======================================
- Coverage 92.7% 92.6% -0.0%
=======================================
Files 134 134
Lines 12597 12611 +14
=======================================
+ Hits 11672 11682 +10
- Misses 925 929 +4 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me! Just a couple of assets in notebooks that might need to be renamed.
I'm thinking we should use this as the main feature branch for the core/output rename. What do you think @e-belfer?
" utilities_entity_eia: pd.DataFrame,\n", | ||
" utilities_eia860: pd.DataFrame,\n", | ||
" core_eia860__scd_utilities: pd.DataFrame,\n", | ||
" utilities_eia: pd.DataFrame,\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should utilities_entity_eia
be core_eia__entity_utilities
and utilities_eia
be core_pudl__assn_utilities_eia
?
Hmm I'll take a look and update this branch with the changes from dev again. |
It looks like |
…pudl into rename-core-assets
For more information, see https://pre-commit.ci
devtools/data-release.sh
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This file has been deleted from dev
. Do we know why it's showing up here now?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah that is strange. I'll look into it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This file has been deleted on dev
src/pudl/etl/analysis_assets.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This file has been deleted on dev
src/pudl/metadata/resources/mcoe.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If nothing inside this module is going to be called MCOE maybe we should change the name of the module?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree! I decided to punt updating references to MCOE everwhere because it's a large change. #2910
The CI and full build passed... should we merge this thing in? |
I think merging it in and then using it ourselves for a while to iron out the kinks before we cut a new release with the new names is probably a good idea. I'm sure there will be issues and inconsistencies that we only discover when using it day-to-day. |
Should we point datasette at |
You mean just stop updating it? |
I dunno. The docs say that Datasette reflects the nightly builds, so we would need to change that. And the If we can get on an actual monthly release schedule, then having the public Kaggle and Datasette outputs track the |
Having datasette and docs point at |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm sure we will find things to fix and improve as we use it, but I think that's only going to happen through actual usage.
See #2812 for issue description and progress tracking. This will collect naming PRs, which will all need to be merged in together to avoid major headaches for downstream users.
The PR includes renamed assets for:
It also does a few other things:
Not converted:
entity_types_eia
, which is currently not loaded into the PUDL DB.clean_xbrl_metadata_json
)Steps for merging
dev
, create a PR fordev -> main
.pre-name-change
. Question: Should we do a final full software release or just let people install from git tab?dev
dev -> main
and tag a release using our desired release naming convention:vYYYY.MM.DD
PR Checklist
dev
).