-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add module to create the expected_clusters file for gas-call #11
Conversation
…l levels for the gas-call process
… 3 address in expected_clusters file
This PR is against the
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks so much for this @kylacochrane . This is amazing 😄
In addition to the comments I gave, I'm wondering if you could add process test cases for the CLUSTER_FILE
process (https://www.nf-test.com/docs/testcases/nextflow_process/) so that we can more easily test error conditions (such as different number of levels in some addresses)?
I have added the Currently, it only includes a test case where one input sample address is at a different level. Other test cases have been addressed by the gm_delimiter pattern in the nextflow_schema. These test cases will be provided in a separate PR. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, possibly just one very small thing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks so much @kylacochrane for addressing my comments 😄 . I have a few additional comments.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Amazing. Thanks so much Kyla for your great work 😄
This PR eliminates the necessity of the
--ref_clusters
input parameter file ingasnomenclature
, which is currently used to fulfill the--rclusters
parameter for thegas_call
module. Instead, it introduces a module within the pipeline that utilizes the provided metadata address to dynamically generate the required--rclusters
information in the correct format forgas_call
.The CLUSTER_FILE module now generates a tab-separated values (TSV) file called expected_clusters.txt. This file organizes sample addresses into columns based on the maximum number of hierarchical levels found among all reference samples.
NOTE: Previously the expected_clusters.txt file that was provided externally used the incorrect address for sample3. Due to the change, the tests/data/call/expected_results.txt file was updated.
PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).