You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this kmermaid PR (nf-core/kmermaid#131) I'm using some of @luizirber's kindly contributed Sourmash Rust code (luizirber/2021-01-27-olga-remove-protein#3) to remove k-mer hashes from constitutive genes in single cells. For each single cell, two signatures are created: one for aligned reads, the other for unaligned reads. Then, the aligned/unaligned signatures are merged (nf-core/kmermaid#117, nf-core/kmermaid#132 for stragglers) using the Python API in merge_rename_sigs.py (because sourmash sig merge doesn't allow for --name and I like reinventing the wheel)
Command executed:
subtract \
--track-abundance \
--scaled 10 \
--ksize 30 \
--encoding dna \
--output subtracted/ \
vertebrate_mammalian--205--2021-03-15.rna.fa__only_constitutive_genes.fa__molecule-dna__ksize-21,30,51__scaled-10__track_abundance-true.sig \
dna__30.txt
Command exit status:
1
Command output:
(empty)
Command error:
+ nxf_launch
+ /bin/bash .command.run nxf_trace
[2021-03-16T22:08:27Z INFO subtract] Loading queries
[2021-03-16T22:08:28Z INFO subtract] Loaded query signature, k=30
[2021-03-16T22:08:28Z INFO subtract] Loading siglist
[2021-03-16T22:08:28Z INFO subtract] Loaded 2 sig paths in siglist
[2021-03-16T22:08:28Z INFO subtract] Processed 0 sigs
Error: Unable to load a sketch from "mouse_lung__AAATGCCCAAACTGCT---molecule-dna__ksize-21,30,51__scaled-10__track_abundance-true.sig"
Caused by:
No sketch matching the provided template: MinHash(KmerMinHash { num: 0, ksize: 30, hash_function: murmur64_DNA, seed: 42, max_hash: 1844674407370955264, mins: [], abunds: Some([]), md5sum: Mutex { data: None } })
But the signature definitely has the right ksize (30):
In this kmermaid PR (nf-core/kmermaid#131) I'm using some of @luizirber's kindly contributed Sourmash Rust code (luizirber/2021-01-27-olga-remove-protein#3) to remove k-mer hashes from constitutive genes in single cells. For each single cell, two signatures are created: one for aligned reads, the other for unaligned reads. Then, the aligned/unaligned signatures are merged (nf-core/kmermaid#117, nf-core/kmermaid#132 for stragglers) using the Python API in merge_rename_sigs.py (because
sourmash sig merge
doesn't allow for--name
and I like reinventing the wheel)But the signature definitely has the right ksize (30):
But the
max_hash
values don't match! This signature was made with Sourmash 3.5.01844674407370955264
1844674407370955300
(last three digits are different)@luizirber mentioned this may be related to the
scaled
/max_hash
changes in: #1139Example Files
Here are some signatures for example.
Single cell signatures, with aligned+unaligned merged
mouse_brown_fat_ptprc_plus_unaligned__CTGAAGTCAATGGTCT---molecule-dna__ksize-21,30,51__scaled-10__track_abundance-true.sig.txt
mouse_lung__AAATGCCCAAACTGCT---molecule-dna__ksize-21,30,51__scaled-10__track_abundance-true.sig.txt
Non-single cell signatures
SRR4050379__molecule-dna__ksize-21,30,51__scaled-10__track_abundance-true.sig.txt
SRR4050380__molecule-dna__ksize-21,30,51__scaled-10__track_abundance-true.sig.txt
Signature to subtract
vertebrate_mammalian--205--2021-03-15.rna.fa__only_constitutive_genes.fa__molecule-dna__ksize-21,30,51__scaled-10__track_abundance-true.sig.txt
The text was updated successfully, but these errors were encountered: