Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bouncy basenji pre-release PR #519

Merged
merged 135 commits into from
Sep 11, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
135 commits
Select commit Hold shift + click to select a range
a58388c
Install module
jfy133 Nov 9, 2023
b25350c
Merge pull request #416 from nf-core/dev
jfy133 Nov 10, 2023
69e0d09
Add get it working, add to tests, update docs/images/citations
jfy133 Nov 10, 2023
2c24711
Fix groovy semantic errors
jfy133 Nov 13, 2023
1a37251
update gitignore to ignore nf-test
jfy133 Nov 13, 2023
240634c
Revert "update gitignore to ignore nf-test"
jfy133 Nov 13, 2023
5e853b0
Fix nonpareil args across all tests
jfy133 Nov 15, 2023
a050658
Add nonpareil
jfy133 Nov 30, 2023
ff16394
Minor linting
jfy133 Nov 30, 2023
46bf2b2
Fix channel emission
jfy133 Nov 30, 2023
c69b00e
Fix module linting
jfy133 Nov 30, 2023
a0d5821
Add nonpareil to valid MultiQC entries
jfy133 Nov 30, 2023
c1d12d9
Final linting
jfy133 Nov 30, 2023
88b8e1a
[automated] Fix linting with Prettier
nf-core-bot Nov 30, 2023
40d1bfa
Apply suggestions from code review
jfy133 Nov 30, 2023
ffff2e6
Merge pull request #417 from nf-core/add-nonpareil
jfy133 Nov 30, 2023
5ea76e8
Merge branch 'dev' into bouncy-basenji
jfy133 Jan 12, 2024
f4dd39b
Merge dev into bouncy-basenji
sofstam Jan 25, 2024
f00bd54
Fix prettier
jfy133 Jan 25, 2024
45dd3c9
Remove merge marker
jfy133 Jan 25, 2024
d9de044
Merge branch 'bouncy-basenji' into fix-merge-conflict
jfy133 Feb 8, 2024
aa6d088
Merge pull request #448 from nf-core/fix-merge-conflict
jfy133 Feb 8, 2024
c22f6e9
Merge dev into bouncy-basenji
LilyAnderssonLee Mar 14, 2024
faf93e9
Merge pull request #455 from LilyAnderssonLee/bouncy-basenji
LilyAnderssonLee Mar 15, 2024
9373a96
update the nanopore and fasta input channel
LilyAnderssonLee Mar 18, 2024
b0d2ad3
Merge branch 'nf-core:master' into fix-validation
LilyAnderssonLee Mar 19, 2024
ff977f6
add fallback to download pipeline workflow
LilyAnderssonLee Mar 19, 2024
b89aba6
update nf-validation
LilyAnderssonLee Mar 19, 2024
d0a58e2
Update workflows/taxprofiler.nf
LilyAnderssonLee Mar 20, 2024
dabc253
Update taxprofiler.nf
LilyAnderssonLee Mar 20, 2024
d5762a7
Disable the save of kaiju2krona txt output
LilyAnderssonLee Mar 20, 2024
a878b74
Merge pull request #458 from LilyAnderssonLee/fix-validation
LilyAnderssonLee Mar 21, 2024
5e8cf10
add db_type feature into database
LilyAnderssonLee Apr 8, 2024
0b4ee8e
Merge branch 'merge_dev_into_bouncy' of github.com:LilyAnderssonLee/t…
jfy133 Apr 8, 2024
1f1ff56
Merge branch 'bouncy-basenji' into LilyAnderssonLee-merge_dev_into_bo…
jfy133 Apr 8, 2024
c4faa69
Merge pull request #467 from nf-core/LilyAnderssonLee-merge_dev_into_…
jfy133 Apr 11, 2024
ece7d5a
update bouncy-basenji branch
LilyAnderssonLee Apr 11, 2024
226eb7b
fix the merge conflicts
LilyAnderssonLee Apr 11, 2024
9211308
Add missing information after fixing the merge conflicts.
LilyAnderssonLee Apr 11, 2024
ba2a2ed
format config files
LilyAnderssonLee Apr 11, 2024
a5f7c78
update CHANGELOG.md
LilyAnderssonLee Apr 11, 2024
b5c76f7
Enable downloading the results of failed tests for debugging
LilyAnderssonLee Apr 11, 2024
abe18e5
Update ci.yml
LilyAnderssonLee Apr 11, 2024
60087e7
Update ci.yml
LilyAnderssonLee Apr 11, 2024
f8bd172
Update CHANGELOG.md
LilyAnderssonLee Apr 17, 2024
e529430
Merge branch 'dev' into bouncy-basenji
jfy133 Jun 11, 2024
88d2b23
Merge branch 'bouncy-basenji' of github.com:nf-core/taxprofiler into …
jfy133 Jun 11, 2024
6b47739
Add nonpareil to testing
jfy133 Jun 11, 2024
1f5d5ea
combine reads and db by db_type
LilyAnderssonLee Jun 13, 2024
547a0aa
Merge branch 'bouncy-basenji' into add_db_type
LilyAnderssonLee Jun 13, 2024
301119a
Update ci.yml
LilyAnderssonLee Jun 13, 2024
749cd28
Update profiling.nf
LilyAnderssonLee Jun 14, 2024
862937b
seperate db_type short,long(both) by semicolon
LilyAnderssonLee Jun 18, 2024
f8e1e4f
replace comma as semicolon in db_type
LilyAnderssonLee Jun 18, 2024
58ef246
Make database splitting and merging more nextflow-y
jfy133 Jun 20, 2024
5d3ee55
Merge pull request #499 from nf-core/dev
jfy133 Jun 20, 2024
2534c08
remove dump from profiling.nf
LilyAnderssonLee Jun 25, 2024
2a679b6
Merge pull request #466 from LilyAnderssonLee/add_db_type
LilyAnderssonLee Jun 25, 2024
c5f47bb
add output files to tower.yml
LilyAnderssonLee Jul 2, 2024
e65ccd2
update changelog
LilyAnderssonLee Jul 2, 2024
f91562e
delete non-supported files in tower.yml
LilyAnderssonLee Jul 3, 2024
72978f3
delete big txt or tsv files from tower.yml
LilyAnderssonLee Jul 3, 2024
3202ea5
update changelog
LilyAnderssonLee Jul 3, 2024
fce5e6f
update tower.yml
LilyAnderssonLee Jul 3, 2024
96dc9ec
Merge pull request #505 from LilyAnderssonLee/add_files_tower
jfy133 Jul 3, 2024
bab16d7
Add nanoq to the pipeline
LilyAnderssonLee Jul 11, 2024
9bba191
Merge branch 'bouncy-basenji' into nanoq
LilyAnderssonLee Jul 11, 2024
58f57b0
update the module nanoq
LilyAnderssonLee Jul 11, 2024
1189411
update modules.config
LilyAnderssonLee Jul 11, 2024
957cc0e
Update nextflow_schema.json
LilyAnderssonLee Jul 12, 2024
fdd851f
Update subworkflows/local/longread_filtering.nf
LilyAnderssonLee Jul 12, 2024
3245430
Update subworkflows/local/longread_preprocessing.nf
LilyAnderssonLee Jul 12, 2024
39541d4
Add nanoq to docs
LilyAnderssonLee Jul 12, 2024
fac95e3
Merge branch 'dev' into bouncy-basenji
jfy133 Jul 18, 2024
656613e
Merge branch 'bouncy-basenji' of github.com:nf-core/taxprofiler into …
jfy133 Jul 18, 2024
5132c21
add help text to longread_filter_tool
LilyAnderssonLee Jul 18, 2024
841d481
default of longread_filter_tool into nanoq in nextflow.config
LilyAnderssonLee Jul 18, 2024
dcc109c
Update nextflow_schema.json
LilyAnderssonLee Jul 18, 2024
5d5edd1
Merge branch 'bouncy-basenji' into nanoq
LilyAnderssonLee Jul 26, 2024
97c4c2b
remove nanoq from multiqc_config.yml
LilyAnderssonLee Jul 26, 2024
e1b367f
Merge pull request #508 from genomic-medicine-sweden/nanoq
LilyAnderssonLee Jul 26, 2024
007f2ab
Add porechop_abi to the pipeline
LilyAnderssonLee Jul 31, 2024
cb9d743
update changelog.md
LilyAnderssonLee Jul 31, 2024
d2435b3
Update docs/output.md
LilyAnderssonLee Jul 31, 2024
9ab08d1
add porechop_abi to multiqc report
LilyAnderssonLee Jul 31, 2024
b178c6a
update comments in longread_preprocessing.nf
LilyAnderssonLee Jul 31, 2024
273ce7f
modify porecho_abi module and add patch file
LilyAnderssonLee Aug 1, 2024
103ea44
run lint
LilyAnderssonLee Aug 1, 2024
b6b5fc0
Update multiqc_config.yml
LilyAnderssonLee Aug 2, 2024
cba2332
update the multiqc_config.yml format
LilyAnderssonLee Aug 2, 2024
aaea700
update modules
LilyAnderssonLee Aug 2, 2024
7707280
update ganon malt motus
LilyAnderssonLee Aug 2, 2024
ac04c4a
Update assets/multiqc_config.yml
LilyAnderssonLee Aug 6, 2024
c779e73
Merge pull request #511 from genomic-medicine-sweden/porechop_abi
jfy133 Aug 7, 2024
e263afe
update filtlong,ganon,krakentools
LilyAnderssonLee Aug 7, 2024
c5a7932
update bracken
LilyAnderssonLee Aug 8, 2024
65899de
add diamond warning for paired-end reads
LilyAnderssonLee Aug 8, 2024
c9f28dc
Remove and consolidate duplicated emit
jfy133 Aug 9, 2024
b42798e
Merge pull request #514 from nf-core/fix-dup-emit
jfy133 Aug 9, 2024
833f72b
update krona,nonpareil and krakentools
LilyAnderssonLee Aug 9, 2024
7cb4f72
update CHANGELOG.md
LilyAnderssonLee Aug 9, 2024
32d4faf
update megan/rma2info
LilyAnderssonLee Aug 9, 2024
0cdc36c
Update Nonpareil to latest, add native MultiQC support, update output…
jfy133 Aug 11, 2024
e0094a6
Fix rendering
jfy133 Aug 11, 2024
02567b3
Apply suggestions from code review
jfy133 Aug 13, 2024
6204b1a
Merge pull request #515 from nf-core/update-nonpareil
jfy133 Aug 13, 2024
d2bc3aa
update metaphlan to the lastest
LilyAnderssonLee Aug 27, 2024
65dbe75
Merge branch 'bouncy-basenji' into update_tools
LilyAnderssonLee Aug 27, 2024
2b1afc4
Update Multiqc to the latest
LilyAnderssonLee Aug 28, 2024
05db993
Add nanoq in the multiqc report
LilyAnderssonLee Aug 30, 2024
a90456d
Publish the ganon classify log file in the output
LilyAnderssonLee Sep 4, 2024
c4d5ed7
update motus/profile
LilyAnderssonLee Sep 4, 2024
81902b3
update modules, most changes happened in nf-test
LilyAnderssonLee Sep 4, 2024
d429463
update utils_nextflow_pipeline and utils_nfcore_pipeline
LilyAnderssonLee Sep 4, 2024
8a42385
Merge branch 'nf-core:master' into update_tools
LilyAnderssonLee Sep 5, 2024
75efdb7
merge dev to the current branch
LilyAnderssonLee Sep 5, 2024
4e2b85f
separate fasta into fasta_shrt and fasta_long
LilyAnderssonLee Sep 5, 2024
2f3ea43
Update changelog
LilyAnderssonLee Sep 5, 2024
bd66c35
add nanoq is default option to the usage
LilyAnderssonLee Sep 5, 2024
4ef2579
unique check for fasta file names
LilyAnderssonLee Sep 6, 2024
54cb683
state that the pipeline is built on nextflow to README.md
LilyAnderssonLee Sep 6, 2024
ef5db33
Update CHANGELOG.md
LilyAnderssonLee Sep 6, 2024
9a23069
Update CHANGELOG.md
LilyAnderssonLee Sep 6, 2024
df25a1b
update modules
LilyAnderssonLee Sep 7, 2024
c6dfb84
Merge pull request #512 from genomic-medicine-sweden/update_tools
LilyAnderssonLee Sep 9, 2024
8b5b635
Remove porechop_abi patch
jfy133 Sep 10, 2024
0774258
Fix patches on PORECHOP modules
jfy133 Sep 10, 2024
8ea3e7c
Fix formatting
jfy133 Sep 10, 2024
b22bb1d
fix pre-commit linting
LilyAnderssonLee Sep 10, 2024
8949ad4
Merge pull request #522 from nf-core/remove-nanopore-patch
jfy133 Sep 10, 2024
af8bc72
replace output-lca into -m lca in ganon classify
LilyAnderssonLee Sep 10, 2024
9500726
Drop unnecesary ganon parameter (should come via db_params) and patte…
jfy133 Sep 11, 2024
5bc5716
Update output docs
jfy133 Sep 11, 2024
8bf5833
Update parameter docs
jfy133 Sep 11, 2024
cc34d41
Merge pull request #523 from nf-core/ganon-classy-lca
jfy133 Sep 11, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
6 changes: 4 additions & 2 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -65,8 +65,10 @@ jobs:
if [[ "${{ matrix.tags }}" == "test_motus" ]]; then
wget https://raw.githubusercontent.com/motu-tool/mOTUs/master/motus/downloadDB.py
python downloadDB.py --no-download-progress
echo 'tool,db_name,db_params,db_path' > 'database_motus.csv'
echo "motus,db_mOTU,,db_mOTU" >> 'database_motus.csv'
echo 'tool,db_name,db_params,db_type,db_path' > 'database_motus.csv'
echo "motus,db1_mOTU,,short,db_mOTU" >> 'database_motus.csv'
echo "motus,db2_mOTU,,long,db_mOTU" >> 'database_motus.csv'
echo "motus,db3_mOTU,,short;long,db_mOTU" >> 'database_motus.csv'
nextflow run ${GITHUB_WORKSPACE} -profile docker,${{ matrix.tags }} --databases ./database_motus.csv --outdir ./results_${{ matrix.tags }};
else
nextflow run ${GITHUB_WORKSPACE} -profile docker,${{ matrix.tags }} --outdir ./results_${{ matrix.tags }};
Expand Down
28 changes: 27 additions & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,16 +3,42 @@
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/)
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## dev - [unreleased]
## v1.2dev - Bouncy Basenji [unreleased]

### `Added`

- [#417](https://github.com/nf-core/taxprofiler/pull/417) - Added reference-free metagenome estimation with Nonpareil (added by @jfy133)
- [#466](https://github.com/nf-core/taxprofiler/pull/466) - Input database sheets now require a `db_type` column to distinguish between short- and long-read databases (added by @LilyAnderssonLee)
- [#505](https://github.com/nf-core/taxprofiler/pull/505) - Add small files to the file `tower.yml` (added by @LilyAnderssonLee)
- [#508](https://github.com/nf-core/taxprofiler/pull/508) - Add `nanoq` as a filtering tool for nanopore reads (added by @LilyAnderssonLee)
- [#511](https://github.com/nf-core/taxprofiler/pull/511) - Add `porechop_abi` as an alternative adapter removal tool for long reads nanopore data (added by @LilyAnderssonLee)
- [#512](https://github.com/nf-core/taxprofiler/pull/512) - Update all tools to the latest version and include nf-test (Updated by @LilyAnderssonLee & @jfy133)

### `Fixed`

- [#518](https://github.com/nf-core/taxprofiler/pull/518) Fixed a bug where Oxford Nanopore FASTA input files would not be processed (❤️ to @ikarls for reporting, fixed by @jfy133)

### `Dependencies`

| Tool | Previous version | New version |
| ------------- | ---------------- | ----------- |
| bbmap | 39.01 | 39.06 |
| bowtie2 | 2.4.4 | 2.5.2 |
| bracken | 2.7 | 2.9 |
| cat/fastq | 8.30 |
| diamond | 2.0.15 | 2.1.8 |
| ganon | 1.5.1 | 2.0.0 |
| kraken2 | 2.1.2 | 2.1.3 |
| krona | 2.8 | 2.8.1 |
| megan | 6.24.20 | 6.25.9 |
| metaphlan | 4.0.6 | 4.1.1 |
| minimap2 | 2.24 | 2.28 |
| motus/profile | 3.0.3 | 3.1.0 |
| multiqc | 1.21 | 1.24.1 |
| nanoq | | 0.10.0 |
| samtools | 1.17 | 1.20 |
| untar | 4.7 | 4.8 |

### `Deprecated`

## v1.1.8 - Augmented Akita Patch [2024-06-20]
Expand Down
12 changes: 12 additions & 0 deletions CITATIONS.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,14 +30,26 @@

> Schubert, M., Lindgreen, S., & Orlando, L. (2016). AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Research Notes, 9, 88. https://doi.org/10.1186/s13104-016-1900-2
- [Nonpareil](https://doi.org/10.1128/mSystems.00039-18)

- Rodriguez-R, L. M., Gunturu, S., Tiedje, J. M., Cole, J. R., & Konstantinidis, K. T. (2018). Nonpareil 3: Fast Estimation of Metagenomic Coverage and Sequence Diversity. mSystems, 3(3). https://doi.org/10.1128/mSystems.00039-18

- [Porechop](https://github.com/rrwick/Porechop)

> Wick, R. R., Judd, L. M., Gorrie, C. L., & Holt, K. E. (2017). Completing bacterial genome assemblies with multiplex MinION sequencing. Microbial Genomics, 3(10), e000132. https://doi.org/10.1099/mgen.0.000132
- [Porechop_ABI](https://github.com/bonsai-team/Porechop_ABI)

> Bonenfant, Q., Noé, L., & Touzet, H. (2023). Porechop_ABI: discovering unknown adapters in Oxford Nanopore Technology sequencing reads for downstream trimming. Bioinformatics Advances, 3(1):vbac085. https://10.1093/bioadv/vbac085
- [Filtlong](https://github.com/rrwick/Filtlong)

> Wick R (2021) Filtlong, URL: https://github.com/rrwick/Filtlong
- [nanoq](https://github.com/esteinig/nanoq)

> Steinig, E., & Coin, L. (2022). Nanoq: ultra-fast quality control for nanopore reads. Journal of Open Source Software, 7(69). https://doi.org/10.21105/joss.02991
- [BBTools](http://sourceforge.net/projects/bbmap/)

> Bushnell B. (2022) BBMap, URL: http://sourceforge.net/projects/bbmap/
Expand Down
10 changes: 7 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,17 +23,21 @@

**nf-core/taxprofiler** is a bioinformatics best-practice analysis pipeline for taxonomic classification and profiling of shotgun short- and long-read metagenomic data. It allows for in-parallel taxonomic identification of reads or taxonomic abundance estimation with multiple classification and profiling tools against multiple databases, and produces standardised output tables for facilitating results comparison between different tools and databases.

The pipeline is built using [Nextflow](https://www.nextflow.io), a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It uses Docker/Singularity containers making installation trivial and results highly reproducible. The [Nextflow DSL2](https://www.nextflow.io/docs/latest/dsl2.html) implementation of this pipeline uses one container per process which makes it much easier to maintain and update software dependencies. Where possible, these processes have been submitted to and installed from [nf-core/modules](https://github.com/nf-core/modules) in order to make them available to all nf-core pipelines, and to everyone within the Nextflow community!

On release, automated continuous integration tests run the pipeline on a full-sized dataset on the AWS cloud infrastructure. This ensures that the pipeline runs on AWS, has sensible resource allocation defaults set to run on real-world datasets, and permits the persistent storage of results to benchmark between pipeline releases and other analysis sources. The results obtained from the full-sized test can be viewed on the [nf-core website](https://nf-co.re/scnanoseq/results).

## Pipeline summary

![](docs/images/taxprofiler_tube.png)

1. Read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) or [`falco`](https://github.com/smithlabcode/falco) as an alternative option)
2. Performs optional read pre-processing
- Adapter clipping and merging (short-read: [fastp](https://github.com/OpenGene/fastp), [AdapterRemoval2](https://github.com/MikkelSchubert/adapterremoval); long-read: [porechop](https://github.com/rrwick/Porechop))
- Low complexity and quality filtering (short-read: [bbduk](https://jgi.doe.gov/data-and-tools/software-tools/bbtools/), [PRINSEQ++](https://github.com/Adrian-Cantu/PRINSEQ-plus-plus); long-read: [Filtlong](https://github.com/rrwick/Filtlong))
- Adapter clipping and merging (short-read: [fastp](https://github.com/OpenGene/fastp), [AdapterRemoval2](https://github.com/MikkelSchubert/adapterremoval); long-read: [porechop](https://github.com/rrwick/Porechop), [Porechop_ABI](https://github.com/bonsai-team/Porechop_ABI))
- Low complexity and quality filtering (short-read: [bbduk](https://jgi.doe.gov/data-and-tools/software-tools/bbtools/), [PRINSEQ++](https://github.com/Adrian-Cantu/PRINSEQ-plus-plus); long-read: [Filtlong](https://github.com/rrwick/Filtlong)), [Nanoq](https://github.com/esteinig/nanoq)
- Host-read removal (short-read: [BowTie2](http://bowtie-bio.sourceforge.net/bowtie2/); long-read: [Minimap2](https://github.com/lh3/minimap2))
- Run merging
3. Supports statistics for host-read removal ([Samtools](http://www.htslib.org/))
3. Supports statistics metagenome coverage estimation ([Nonpareil](https://nonpareil.readthedocs.io/en/latest/)) and for host-read removal ([Samtools](http://www.htslib.org/))
4. Performs taxonomic classification and/or profiling using one or more of:
- [Kraken2](https://ccb.jhu.edu/software/kraken2/)
- [MetaPhlAn](https://huttenhower.sph.harvard.edu/metaphlan/)
Expand Down
148 changes: 119 additions & 29 deletions assets/multiqc_config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,48 @@ report_section_order:
order: -1001
"nf-core-taxprofiler-summary":
order: -1002
general_stats":
order: 1000
fastqc:
order: 900
fastqc-1:
order: 800
fastp:
order: 700
adapterRemoval:
order: 600
nonpareil:
order: 500
porechop:
order: 400
porechop_abi:
order: 450
bbduk:
order: 300
prinseqplusplus:
order: 200
filtlong:
order: 100
nanoq:
order: 95
bowtie2:
order: 90
samtools:
order: 80
kraken:
order: 70
bracken:
order: 60
centrifuge:
order: 50
malt:
order: 40
diamond:
order: 30
kaiju:
order: 20
motus:
order: 10

export_plots: true

Expand All @@ -22,11 +64,13 @@ custom_logo_title: "nf-core/taxprofiler"
run_modules:
- fastqc
- adapterRemoval
- fastp
- fastp
- nonpareil
- bbduk
- prinseqplusplus
- porechop
- filtlong
- nanoq
- bowtie2
- minimap2
- samtools
Expand All @@ -44,6 +88,8 @@ sp:
fn_re: ".*(fastqc|falco)_data.txt$"
fastqc/zip:
fn: "*_fastqc.zip"
nonpareil:
fn: "nonpareil_all_samples.json"

top_modules:
- "fastqc":
Expand All @@ -60,13 +106,23 @@ top_modules:
path_filters_exclude:
- "*raw*"
extra: "If used in this run, Falco is a drop-in replacement for FastQC producing the same output, written by Guilherme de Sena Brandine and Andrew D. Smith."
- "fastp"
- "adapterRemoval"
- nonpareil
- "porechop":
name: "Porechop"
anchor: "porechop"
target: "Porechop"
path_filters:
- "*porechop.log"
extra: "ℹ️: if you get the error message 'Error - was not able to plot data.' this means that porechop did not detect any adapters and therefore no statistics generated."
- "bbduk"
- "prinseqplusplus"
- "filtlong"
- "porechop":
name: "Porechop_ABI"
anchor: "porechop_abi"
target: "Porechop_ABI"
doi: "10.1093/bioadv/vbac085"
info: "find and remove adapters from Oxford Nanopore reads."
path_filters:
- "*porechop_abi.log"
extra: "ℹ️: if you get the error message 'Error - was not able to plot data.' this means that porechop_abi did not detect any adapters and therefore no statistics generated."
- "bowtie2":
name: "bowtie2"
- "samtools":
Expand Down Expand Up @@ -95,12 +151,11 @@ top_modules:
- "*.centrifuge.txt"
- "malt":
name: "MALT"
- "diamond"
- "kaiju":
name: "Kaiju"
- "motus"

#It is not possible to set placement for custom kraken and centrifuge columns.
# It is not possible to set placement for custom kraken
# and centrifuge columns.

table_columns_placement:
FastQC / Falco (pre-Trimming):
Expand Down Expand Up @@ -130,16 +185,32 @@ table_columns_placement:
percent_aligned: 370
percent_collapsed: 380
percent_discarded: 390
nonpareil:
nonpareil_R: 400
nonpareil_LR: 410
nonpareil_kappa: 420
nonpareil_C: 430
nonpareil_diversity: 440
Porechop:
Input Reads: 400
Start Trimmed: 410
Start Trimmed Percent: 420
End Trimmed: 430
End Trimmed Percent: 440
Middle Split: 450
Middle Split Percent: 460
Input Reads: 500
Start Trimmed: 510
Start Trimmed Percent: 520
End Trimmed: 530
End Trimmed Percent: 540
Middle Split: 550
Middle Split Percent: 560
Porechop_ABI:
Input Reads: 500
Start Trimmed: 510
Start Trimmed Percent: 520
End Trimmed: 530
End Trimmed Percent: 540
Middle Split: 550
Middle Split Percent: 560
Filtlong:
Target bases: 500
Target bases: 600
nanoq:
Read N50: 700
BBDuk:
Input reads: 800
Total Removed bases percent: 810
Expand Down Expand Up @@ -203,6 +274,24 @@ table_columns_visible:
percent_duplicates: False
percent_gc: False
percent_fails: False
Adapter Removal:
aligned_total: True
percent_aligned: True
percent_collapsed: True
percent_discarded: False
fastp:
pct_adapter: True
pct_surviving: True
pct_duplication: False
after_filtering_gc_content: False
after_filtering_q30_rate: False
after_filtering_q30_bases: False
nonpareil:
nonpareil_R: false
nonpareil_LR: false
nonpareil_kappa: true
nonpareil_C: true
nonpareil_diversity: true
porechop:
Input reads: False
Start Trimmed:
Expand All @@ -211,20 +300,18 @@ table_columns_visible:
End Trimmed Percent: True
Middle Split: False
Middle Split Percent: True
fastp:
pct_adapter: True
pct_surviving: True
pct_duplication: False
after_filtering_gc_content: False
after_filtering_q30_rate: False
after_filtering_q30_bases: False
porechop_abi:
Input reads: False
Start Trimmed:
Start Trimmed Percent: True
End Trimmed: False
End Trimmed Percent: True
Middle Split: False
Middle Split Percent: True
Filtlong:
Target bases: True
Adapter Removal:
aligned_total: True
percent_aligned: True
percent_collapsed: True
percent_discarded: False
nanoq:
ReadN50: True
BBDuk:
Input reads: False
Total Removed bases Percent: False
Expand Down Expand Up @@ -276,6 +363,9 @@ extra_fn_clean_exts:
- ".bbduk"
- ".unmapped"
- "_filtered"
- "porechop"
- "porechop_abi"
- "_processed"
- type: remove
pattern: "_falco"

Expand Down
6 changes: 6 additions & 0 deletions assets/schema_database.json
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,12 @@
"errorMessage": "Invalid database db_params entry. No quotes allowed.",
"meta": ["db_params"]
},
"db_type": {
"type": "string",
"enum": ["short", "long", "short;long"],
"default": "short;long",
"meta": ["db_type"]
},
"db_path": {
"type": "string",
"exists": true,
Expand Down
3 changes: 3 additions & 0 deletions assets/schema_input.json
Original file line number Diff line number Diff line change
Expand Up @@ -38,18 +38,21 @@
"type": "string",
"format": "file-path",
"pattern": "^\\S+\\.f(ast)?q\\.gz$",
"unique": true,
"errorMessage": "FastQ file for reads 1 must be provided, cannot contain spaces and must have extension '.fq.gz' or '.fastq.gz'"
},
"fastq_2": {
"type": "string",
"format": "file-path",
"pattern": "^\\S+\\.f(ast)?q\\.gz$",
"unique": true,
"errorMessage": "FastQ file for reads 2 cannot contain spaces and must have extension '.fq.gz' or '.fastq.gz'. If not applicable, leave it empty."
},
"fasta": {
"type": "string",
"format": "file-path",
"pattern": "^\\S+\\.(f(ast)?q|fa(sta)?)\\.gz$",
"unique": true,
"errorMessage": "FastA file must be provided, cannot contain spaces and must have extension '.fa.gz' or '.fasta.gz'. If not applicable, leave it empty."
}
},
Expand Down
Loading
Loading