Will it be beneficial to use all genomics bins, instead of most variable feature selection? #121
Unanswered
wangmeijiao
asked this question in
Q&A
Replies: 1 comment 5 replies
-
SnapATAC version 1 also filters bins. Some bins, especially those with low counts, may be very noisy. If you don't filter them, you will see a lot of noisy structures in your clustering result. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi Kai and all,
It seems that it is quite important to select features for downstream dimension reduction and other. If I understood correct the hints you raised in the discussion thread "#116", both top ranked variable features (needs tunning) and iteratively selection method can do good to the next dimension reduction step. Here I ask that why not just select all genomic bins (with some filtering of black list regions) as SnapATAC version 1 does? if CPU and memory are not the problems.
Best,
Meijiao
Beta Was this translation helpful? Give feedback.
All reactions