Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pytest with f2s ops #3099

Closed
wants to merge 85 commits into from
Closed

Pytest with f2s ops #3099

wants to merge 85 commits into from

Conversation

rousik
Copy link
Collaborator

@rousik rousik commented Nov 30, 2023

Dummy PR that combines pytest refactor and ferc_to_sqlite break-up of monolithic ops; this PR exists so that pytests can be automatically run and evaluated for speedups.

rousik and others added 30 commits September 2, 2023 13:34
Additionally, switch integration tests to use tox under the hood.
rousik and others added 23 commits November 6, 2023 17:35
This will result in a nicer more compact output.
Instead of using single monolith op that loops over all forms,
we can use ops factory and XbrlRuntimeSettings resource to simplify
how stuff is passed in.

This way, single runtime settings exists for all xbrl ops and one
op is generated for each form. This should allow for better
parallelism, even though it might interfere with the num_workers
being set to num cpus by default (i.e. this way we will oversubscribe
available cores twice, one for dagster workers, and once for xbrl
num workers). This, hovewer, should be an easy fix.
Refactor monolithic dbf2sqlite and xbrl2sqlite methods into per-dataset
smaller ops that are invoked within the graphs. This should allow us to
better make use of dagster parallelism and speed up ferc_to_sqlite
processing.

It seems that current unit/integration tests only use FERC1 raw data,
so I've modified the fixtures to only run the relevant pieces of processing.
@rousik rousik changed the base branch from main to dev November 30, 2023 07:39
rousik and others added 4 commits November 30, 2023 13:02
add tuning for xbrl workers, set it to 2 here
To test the speed-up characteristics of different tuning params, and
to determine what fits on the worker.
Base automatically changed from dev to main January 5, 2024 04:14
@rousik rousik closed this Jan 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

2 participants