-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Revise 'input_check' to 'input_assure'; enforce JSON key alteration to match the sample ID if a mismatch is detected #13
Conversation
…o match the sample ID if a mismatch is detected
|
workflows/gas_nomenclature.nf
Outdated
throw new RuntimeException("Pipeline exiting: sample with ID ${meta.id} does not have matching MLST JSON file.") | ||
} | ||
} | ||
match.view() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we want to leave the .view()
in or was this for debugging?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Haha - I had used it for debugging.
Although highlighting this also made me realize that we no longer need to store the id_match
boolean in the meta
data as we won't be removing any samples from the analysis.
I have updated the python code, the workflow, and the input_assure
process to simplify this here: 95e40f6
workflows/gas_nomenclature.nf
Outdated
profiles = match.branch { | ||
query: !it[0].address | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this for only querying profiles that have don't already have an address?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes exactly 😄
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is amazing @kylacochrane 😄 . Thanks so much.
In addition to the in-line comments, could you fix the failing test prior to us merging the PR.
…n provided MLST JSON file(s)
The following tests have been replaced or added:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks so much Kyla for all your hard work and the updates you've made 😄
I have just one additional comment.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks so much for all the changes Kyla 😄 . Just one more comment.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is amazing @kylacochrane . Thanks so much for all your hard work 😄
This PR resolves an issue that arises when a
.mlst.json
file, generated byLocidex
, retains the original IRIDA Next sample identifier after a sample is cloned into a new project, leading to a mismatch in identifiers.To address this, the PR alters the previous
input_check
process, which now reads the.mlst.json
file fromLocidex
and if the sample identifier does not match the JSON key, it is overwritten to ensure consistency.The process has been renamed to
input_assure
for clarity.An
error_report.csv
is generated to identify any samples where the JSON key has been forcefully altered and discloses whether they are a query or reference sample in the pipeline.Info on added tests can be seen below in a separate comment.