-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Docs #36
Docs #36
Changes from 6 commits
fa228da
27cba9b
adf93b0
e39b934
aeadfde
2f72bb8
b0365a6
b2655a6
0bcde72
1fe9186
14bad2b
5f75c2f
0daab40
7929e5a
85bfd5e
f6e229a
0227900
16325b2
3e81f60
1ab0405
46fc6aa
eb77da1
877cfd3
dd53a9f
caf203f
ceda62b
a934edb
0b6b840
ed6b1d0
8008824
215ab04
ae42305
88320ea
4a57e11
a374b26
72dc060
6eddea4
ba90809
f3d615f
c6cc059
d1ac6b2
c82703b
1da8e6d
40e85ce
0d148d7
11e4623
a7cb194
7fe635a
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
@@ -46,15 +46,15 @@ This repository consists of two key pieces: | |||||||||||||
what is more advanced is the efficient construction, storage, and loading of tabular features for the | ||||||||||||||
candidate AutoML models, enabling a far more extensive search over different featurization strategies. | ||||||||||||||
|
||||||||||||||
### Scripts and Examples | ||||||||||||||
## Scripts and Examples | ||||||||||||||
|
||||||||||||||
See `tests/test_integration.py` for an example of the end-to-end pipeline being run on synthetic data. This | ||||||||||||||
script is a functional test that is also run with `pytest` to verify the correctness of the algorithm. | ||||||||||||||
|
||||||||||||||
For an end to end example over MIMIC-IV, see the [companion repository](https://github.com/mmcdermott/MEDS_TAB_MIMIC_IV) | ||||||||||||||
For an end to end example over Philips eICU, see the [eICU companion repository](https://github.com/mmcdermott/MEDS_TAB_EICU). | ||||||||||||||
|
||||||||||||||
### Core CLI Scripts Overview | ||||||||||||||
## Core CLI Scripts Overview | ||||||||||||||
|
||||||||||||||
1. **`meds-tab-describe`**: This command processes MEDS data shards to compute the frequencies of different code-types | ||||||||||||||
|
||||||||||||||
|
@@ -124,7 +124,7 @@ For an end to end example over Philips eICU, see the [eICU companion repository] | |||||||||||||
|
||||||||||||||
6. **`meds-tab-xgboost-sweep`**: Conducts an Optuna hyperparameter sweep to optimize over `window_sizes`, `aggregations`, and `min_code_inclusion_frequency`, aiming to enhance model performance and adaptability. | ||||||||||||||
|
||||||||||||||
### Additional CLI Scripts | ||||||||||||||
## Additional CLI Scripts | ||||||||||||||
|
||||||||||||||
1. **`generate-permutations`**: Generates and prints a sorted list of all permutations from a comma separated input. This is provided for the convenience of sweeping over all possible combinations of window sizes and aggregations. | ||||||||||||||
Comment on lines
+140
to
142
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Ensure clarity in the command's functionality. Clarify the description of the - 1. **`generate-permutations`**: Generates and prints a sorted list of all permutations from a comma separated input. This is provided for the convenience of sweeping over all possible combinations of window sizes and aggregations.
+ 1. **`generate-permutations`**: Generates and prints all permutations from a comma-separated input, aiding in sweeping over combinations of window sizes and aggregations. Committable suggestion
Suggested change
ToolsLanguageTool
|
||||||||||||||
|
||||||||||||||
|
@@ -149,7 +149,7 @@ For an end to end example over Philips eICU, see the [eICU companion repository] | |||||||||||||
|
||||||||||||||
# How does MEDS-Tab Work? | ||||||||||||||
|
||||||||||||||
#### What do you mean "tabular pipelines"? Isn't _all_ structured EHR data already tabular? | ||||||||||||||
## What do you mean "tabular pipelines"? Isn't _all_ structured EHR data already tabular? | ||||||||||||||
|
||||||||||||||
This is a common misconception. _Tabular_ data refers to data that can be organized in a consistent, logical | ||||||||||||||
set of rows/columns such that the entirety of a "sample" or "instance" for modeling or analysis is contained | ||||||||||||||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,15 +1,14 @@ | ||
sphinx==7.1.2 | ||
sphinx-rtd-theme==1.3.0rc1 | ||
sphinx-collections | ||
recommonmark | ||
piccolo_theme | ||
sphinx_immaterial | ||
sphinx_subfigure | ||
nbsphinx | ||
myst_parser | ||
pypandoc | ||
linkify-it-py | ||
ipykernel | ||
omegaconf | ||
ipywidgets | ||
ipykernel | ||
ipython | ||
esgpt @ git+https://github.com/mmcdermott/EventStreamGPT |
This file was deleted.
This file was deleted.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please review the punctuation and grammar in the descriptions of the CLI scripts to ensure clarity and professionalism.
Also applies to: 72-72, 84-84, 99-99, 112-112, 125-125, 129-129
Tools
LanguageTool