Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0.5 changes: logging, computational improvements, and more #84

Merged
merged 63 commits into from
Dec 14, 2023
Merged

Conversation

mmcdermott
Copy link
Owner

No description provided.

Removed the old increment calls to assign event IDs in favor of hashes of subject IDs and timestamps which can be run lazily.
Remove the `with_row_count` to create new event IDs to speed up dataset creation and permit lazy frames to be used longer.
Added logging to other aspects of ESGPT.
Copy link

codecov bot commented Dec 3, 2023

Codecov Report

Attention: 72 lines in your changes are missing coverage. Please review.

Comparison is base (e944f14) 86.08% compared to head (8e38596) 88.23%.

❗ Current head 8e38596 differs from pull request most recent head c017794. Consider uploading reports for the commit c017794 to get more accurate results

Files Patch % Lines
EventStream/data/pytorch_dataset.py 93.01% 16 Missing ⚠️
EventStream/data/dataset_polars.py 91.25% 14 Missing ⚠️
EventStream/data/dataset_base.py 73.80% 11 Missing ⚠️
EventStream/transformer/model_output.py 11.11% 8 Missing ⚠️
EventStream/transformer/config.py 45.45% 6 Missing ⚠️
EventStream/data/config.py 94.11% 5 Missing ⚠️
EventStream/data/visualize.py 0.00% 3 Missing ⚠️
...tream/transformer/lightning_modules/fine_tuning.py 75.00% 3 Missing ⚠️
EventStream/transformer/transformer.py 88.00% 3 Missing ⚠️
...tStream/transformer/lightning_modules/embedding.py 66.66% 1 Missing ⚠️
... and 2 more
Additional details and impacted files
@@            Coverage Diff             @@
##              dev      #84      +/-   ##
==========================================
+ Coverage   86.08%   88.23%   +2.15%     
==========================================
  Files          34       33       -1     
  Lines        6401     6569     +168     
==========================================
+ Hits         5510     5796     +286     
+ Misses        891      773     -118     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mmcdermott and others added 23 commits December 3, 2023 12:46
…n tests and a reference to deleted generative evaluation code.
Adds support and a script for exporting an ESDS dataset from an ESGPT dataset (with support for modifier columns).
…d elsewhere). tests failing due to data_stats.json errors.
Updated some deprecated polars functions.
@mmcdermott mmcdermott merged commit dd81924 into dev Dec 14, 2023
1 of 2 checks passed
@mmcdermott mmcdermott deleted the v0.5 branch December 14, 2023 19:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants