Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unexpected number of reads at the end .. #20

Open
ptranvan opened this issue Aug 10, 2016 · 4 comments
Open

Unexpected number of reads at the end .. #20

ptranvan opened this issue Aug 10, 2016 · 4 comments

Comments

@ptranvan
Copy link

Hi,

I have a MP which contain 27 912 951 reads.
After nexclip, categories A+B+C contain 8 458 324 reads
And categories D+E = 3 569 827 reads

It looks like it misses 27 912 951 - (8 458 324 + 3 569 827) = 15 884 800

Where are those reads ??

@richardmleggett
Copy link
Owner

Hi Patrick,

Not sure... can you send the complete output text from NextClip to [email protected]:[email protected] and I'll look into it.

Thanks,
Richard

On 10 Aug 2016, at 09:51, Patrick Tran Van <[email protected]mailto:[email protected]> wrote:

Hi,

I have a MP which contain 27 912 951 reads.
After nexclip, categories A+B+C contain 8 458 324 reads
And categories D+E = 3 569 827 reads

It looks like it misses 27 912 951 - (8 458 324 + 3 569 827) = 15 884 800

Where are those reads ??


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHubhttps://github.com//issues/20, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AAWKvp3YYfkwUzVxpKBbqL4hf4GjanIoks5qeZEagaJpZM4Jg67E.

@richardmleggett
Copy link
Owner

Hi,

I've just looked at your log file.

The issue is that you have a LOT of reads that are too short after clipping. You can see this in the line:

All categories too short: 15884800 56.91 %

which is towards the bottom of the log. The minimum length is, by default, 25 bp.

I note that you also have a very high level of PCR duplicates too.

All the best,
Richard

On 10 Aug 2016, at 09:51, Patrick Tran Van <[email protected]mailto:[email protected]> wrote:

Hi,

I have a MP which contain 27 912 951 reads.
After nexclip, categories A+B+C contain 8 458 324 reads
And categories D+E = 3 569 827 reads

It looks like it misses 27 912 951 - (8 458 324 + 3 569 827) = 15 884 800

Where are those reads ??


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHubhttps://github.com//issues/20, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AAWKvp3YYfkwUzVxpKBbqL4hf4GjanIoks5qeZEagaJpZM4Jg67E.

@ptranvan
Copy link
Author

I see ... Thanks

Another questions, what are the categories D and E, can I use them as paired end reads ?

@richardmleggett
Copy link
Owner

Hi Patrick,

Category D reads don't contain the adaptor in either read. Category E reads are reads which don't appear to contain the adaptor, but by relaxing the match criteria, we can find what may be the adaptor. Category E reads are usually a small number and I would ignore. I also tend not to use category D reads, as without the presence of a junction adaptor, it is unclear what has happened.

So the safe approach is just to use category A, B and C.

Thanks,
Richard

On 10 Aug 2016, at 11:56, Patrick Tran Van <[email protected]mailto:[email protected]> wrote:

I see ... Thanks

Another questions, what are the categories D and E, can I use them as paired end reads ?


You are receiving this because you commented.
Reply to this email directly, view it on GitHubhttps://github.com//issues/20#issuecomment-238833535, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AAWKvueKRN_MVoJNu2rR7qPdBbk-3v4Mks5qea5mgaJpZM4Jg67E.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants