Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Result Retrieval from FastMatch Pipeline #3

Merged
merged 12 commits into from
Dec 12, 2024
Merged

Result Retrieval from FastMatch Pipeline #3

merged 12 commits into from
Dec 12, 2024

Conversation

emarinier
Copy link
Member

@emarinier emarinier commented Dec 11, 2024

Description:

As a Data Analyst, I want the FastMatch pipeline to return a file with results that meet my specified similarity threshold, and include the similarity scores for each match, so that I can effectively analyze and compare the results.

Acceptance criteria:

Acceptance Criteria:

  1. The FastMatch pipeline generates a linelist excel file with query results.
    • Columns: Query ID, Query Sample Name, Reference ID, Reference Sample Name, Distance, Reference Metadata 1, Reference Metadata 2, ...
  2. The results in the file meet the user-specified similarity threshold.
  3. The file includes the similarity scores for each match.
  4. The user can analyze and compare the matches based on the provided similarity scores.

samplesheet.csv

sample,sample_name,mlst_alleles,metadata_1,metadata_2,metadata_3,metadata_4,metadata_5,metadata_6,metadata_7,metadata_8
sample1,sample_name1,https://raw.githubusercontent.com/phac-nml/gasclustering/dev/tests/data/reports/sample1.mlst.json,1.1,1.2,1.3,1.4,1.5,1.6,1.7,1.8
sample2,sample_name2,https://raw.githubusercontent.com/phac-nml/gasclustering/dev/tests/data/reports/sample2.mlst.json,2.1,2.2,2.3,2.4,2.5,2.6,2.7,2.8
sample3,sample_name3,https://raw.githubusercontent.com/phac-nml/gasclustering/dev/tests/data/reports/sample3.mlst.json,3.1,3.2,3.3,3.4,3.5,3.6,3.7,3.8

distances/profile_dists.results.text

query_id	ref_id	dist
sample_name1	sample_name1	0
sample_name1	sample_name2	0
sample_name1	sample_name3	1
sample_name2	sample_name1	0
sample_name2	sample_name2	0
sample_name2	sample_name3	1
sample_name3	sample_name3	0
sample_name3	sample_name1	1
sample_name3	sample_name2	1

append/distances_and_metadata.tsv

Query ID	Query Sample Name	Reference ID	Reference Sample Name	Distance	metadata_1	metadata_2	metadata_3	metadata_4	metadata_5	metadata_6	metadata_7	metadata_8
sample1	sample_name1	sample1	sample_name1	0	1.1	1.2	1.3	1.4	1.5	1.6	1.7	1.8
sample1	sample_name1	sample2	sample_name2	0	2.1	2.2	2.3	2.4	2.5	2.6	2.7	2.8
sample1	sample_name1	sample3	sample_name3	1	3.1	3.2	3.3	3.4	3.5	3.6	3.7	3.8
sample2	sample_name2	sample1	sample_name1	0	1.1	1.2	1.3	1.4	1.5	1.6	1.7	1.8
sample2	sample_name2	sample2	sample_name2	0	2.1	2.2	2.3	2.4	2.5	2.6	2.7	2.8
sample2	sample_name2	sample3	sample_name3	1	3.1	3.2	3.3	3.4	3.5	3.6	3.7	3.8
sample3	sample_name3	sample3	sample_name3	0	3.1	3.2	3.3	3.4	3.5	3.6	3.7	3.8
sample3	sample_name3	sample1	sample_name1	1	1.1	1.2	1.3	1.4	1.5	1.6	1.7	1.8
sample3	sample_name3	sample2	sample_name2	1	2.1	2.2	2.3	2.4	2.5	2.6	2.7	2.8

process/results.tsv

Query ID	Query Sample Name	Reference ID	Reference Sample Name	Distance	metadata_1	metadata_2	metadata_3	metadata_4	metadata_5	metadata_6	metadata_7	metadata_8
sample1	sample_name1	sample1	sample_name1	0	1.1	1.2	1.3	1.4	1.5	1.6	1.7	1.8
sample1	sample_name1	sample2	sample_name2	0	2.1	2.2	2.3	2.4	2.5	2.6	2.7	2.8
sample2	sample_name2	sample1	sample_name1	0	1.1	1.2	1.3	1.4	1.5	1.6	1.7	1.8
sample2	sample_name2	sample2	sample_name2	0	2.1	2.2	2.3	2.4	2.5	2.6	2.7	2.8
sample3	sample_name3	sample3	sample_name3	0	3.1	3.2	3.3	3.4	3.5	3.6	3.7	3.8

@emarinier emarinier self-assigned this Dec 11, 2024
@emarinier emarinier marked this pull request as draft December 11, 2024 18:12
Copy link

github-actions bot commented Dec 11, 2024

nf-core pipelines lint overall result: Passed ✅ ⚠️

Posted for pipeline commit 4c18be3

+| ✅ 145 tests passed       |+
#| ❔  28 tests were ignored |#
!| ❗   4 tests had warnings |!

❗ Test warnings:

❔ Tests ignored:

  • files_exist - File is ignored: assets/nf-core-fastmatchirida_logo_light.png
  • files_exist - File is ignored: docs/images/nf-core-fastmatchirida_logo_light.png
  • files_exist - File is ignored: docs/images/nf-core-fastmatchirida_logo_dark.png
  • files_exist - File is ignored: .github/workflows/awstest.yml
  • files_exist - File is ignored: .github/workflows/awsfulltest.yml
  • files_exist - File is ignored: lib/Utils.groovy
  • files_exist - File is ignored: lib/WorkflowMain.groovy
  • files_exist - File is ignored: lib/NfcoreTemplate.groovy
  • files_exist - File is ignored: lib/Workflowfastmatchirida.groovy
  • nextflow_config - Config variable ignored: manifest.name
  • nextflow_config - Config variable ignored: manifest.homePage
  • nextflow_config - Config variable ignored: params.max_cpus
  • files_unchanged - File ignored due to lint config: LICENSE or LICENSE.md or LICENCE or LICENCE.md
  • files_unchanged - File ignored due to lint config: .github/CONTRIBUTING.md
  • files_unchanged - File ignored due to lint config: .github/ISSUE_TEMPLATE/bug_report.yml
  • files_unchanged - File ignored due to lint config: .github/PULL_REQUEST_TEMPLATE.md
  • files_unchanged - File ignored due to lint config: .github/workflows/branch.yml
  • files_unchanged - File ignored due to lint config: assets/email_template.html
  • files_unchanged - File ignored due to lint config: assets/email_template.txt
  • files_unchanged - File ignored due to lint config: assets/sendmail_template.txt
  • files_unchanged - File does not exist: assets/nf-core-fastmatchirida_logo_light.png
  • files_unchanged - File does not exist: docs/images/nf-core-fastmatchirida_logo_light.png
  • files_unchanged - File does not exist: docs/images/nf-core-fastmatchirida_logo_dark.png
  • files_unchanged - File ignored due to lint config: docs/README.md
  • files_unchanged - File ignored due to lint config: .gitignore or .prettierignore
  • actions_awstest - 'awstest.yml' workflow not found: /home/runner/work/fastmatchirida/fastmatchirida/.github/workflows/awstest.yml
  • actions_awsfulltest - actions_awsfulltest
  • pipeline_name_conventions - pipeline_name_conventions

✅ Tests passed:

Run details

  • nf-core/tools version 3.0.1
  • Run at 2024-12-12 17:59:32

@emarinier emarinier changed the title WIP: Result Retrieval from FastMatch Pipeline Result Retrieval from FastMatch Pipeline Dec 11, 2024
@emarinier emarinier marked this pull request as ready for review December 11, 2024 22:34
Copy link
Member

@apetkau apetkau left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks amazing. Thanks so much @emarinier 😄

Some in-line comments below.

Copy link
Collaborator

@sgsutcliffe sgsutcliffe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me! I didn't run it myself but I figure I can take a closer look when me merge all three stories.

Copy link
Member

@apetkau apetkau left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Everything looks great to me. Thanks so much Eric 😄

@emarinier emarinier merged commit 70962d5 into dev Dec 12, 2024
4 checks passed
@emarinier emarinier deleted the output-stub branch December 13, 2024 17:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants