You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Dear authors,
Thank you for developing and maintaining this software!
I have been running ABSREL for many genes (around 10k) in a 6-species phylogeny and struggling with after applying multiple testing correction (BH) "removing" any "signal" of positive selection. I have been trying with some subsets of smaller more targeted gene sets, but mostly run into this issue again.
When checking the p-values output by ABSREL I noticed a weird distribution that I am not sure how to interpret (whether this is the expected behavior or not). So I wanted to ask, is it abnormal to have many p-values of "1" ? I also observe that there never are any p-values inferred between 0.5 and 1 (except for the value of 1). I atttached a plot of the distribution.
I would be very grateful for some advice on how to interpret this (if this is an indication for problems when running ABSREL on my data or not).
Best regards,
F
The text was updated successfully, but these errors were encountered:
I am afraid that with a 6-sequence dataset nothing will survive a 10K BH FDR correction. You may have to be a little bit more permissive with what you consider significant. Keep in mind that within a gene, aBSREL does a conservative multiple testing correction over branches already.
The p-value distribution that has a lot of values at 1 is expected. If the inference procedure for a given branch fits an unconstrained model where all ω values are < 1, then constraining them to < 1 will yield exactly the same model, with the same log-likelihood an a p-value of exactly 1. In fact aBSREL doesn't even fit these models, it reports them as "test not run".
The boundary at 0.5 is also expected, because if the test IS run, then the constraint on ω results in a test statistic which does not generate p-values >0.5 (it's a mixture of a point mass and a χ squared).
So the graph you pasted looks pretty much exactly as what I would expect.
Dear Sergei,
Thanks for that explanation! That all makes sense now. So setting the FDR cutoff to be more permissive ( for example 0.1 instead of 0.05 might help), if I understand correctly?
I had not realized that the size of the dataset has such a huge impact on the results of ABSREL. I think I will try to extend my dataset with more distant outgroups. I know this is hard to predict/judge but from your experience would you have a recommendation on a minimum dataset size, when running ABSREL on thousands of orthologs?
Dear authors,
Thank you for developing and maintaining this software!
I have been running ABSREL for many genes (around 10k) in a 6-species phylogeny and struggling with after applying multiple testing correction (BH) "removing" any "signal" of positive selection. I have been trying with some subsets of smaller more targeted gene sets, but mostly run into this issue again.
When checking the p-values output by ABSREL I noticed a weird distribution that I am not sure how to interpret (whether this is the expected behavior or not). So I wanted to ask, is it abnormal to have many p-values of "1" ? I also observe that there never are any p-values inferred between 0.5 and 1 (except for the value of 1). I atttached a plot of the distribution.
I would be very grateful for some advice on how to interpret this (if this is an indication for problems when running ABSREL on my data or not).
Best regards,
F
The text was updated successfully, but these errors were encountered: