Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

-f option #5

Open
ksahlin opened this issue Dec 3, 2016 · 1 comment
Open

-f option #5

ksahlin opened this issue Dec 3, 2016 · 1 comment

Comments

@ksahlin
Copy link

ksahlin commented Dec 3, 2016

Firstly, thanks for creating minialign!

I noticed -f option have a large impact on the runtime and memory. Does -f have the same meaning as option -f in minimap? Also, what is the default value of -f, currently it says:
-f FLOAT,... occurrence thresholds [0.05,0.01,0.001]

@ocxtal
Copy link
Owner

ocxtal commented Dec 3, 2016

Hi @ksahlin,

Thank you for trying minialign 😄.
Right. And I'm sorry the document on the options and algorithms is unkind for users.

Does -f have the same meaning as option -f in minimap?

Each value of the -f option list (0.05, 0.01, and 0.001) has the same definition as the minimap, the fraction of top-N occurring minimizers.

In the minialign, seed (minimizer) collection is performed multiple times. With the default (0.05, 0.01, 0.001) setting, in the first seed collection trial, seeds occurring less than top 5% are gathered, chained, and extended. Only if any meaningful alignment is not found in the first trial, the next one with 1% threshold (and finally 0.1%) are executed. This strategy effectively avoid chain confusion around repetitive regions and efficiently reduces calculation time (since the number of collected seeds are dramatically reduced in the first stage). In other words however, this also means changing -f threshold greatly affects on calculation time and memory usage as you pointed out.

At least I have to write detailed explanation on the multiple seed collection and its thresholds. I'll write as soon as possible.

Thanks,

Hajime Suzuki

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants