-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
node33.candidates file not generated #14
Comments
Hi Riccha, Li |
Thanks for your reply @fangli08. I got following intermediate files: -rw-r--r-- 1 10529 Apr 23 11:17 phased_possorted_bam.bam.arguments |
Hi, Getting the same error on a small test dataset. -rw-rw-r-- 1 morispi morispi 11193 sept. 23 11:40 HCC1143_chr1_209344657_chr13_108809430.bam.arguments Did you manage to find what was causing the problem? Best, |
Sorry for the late reply. Do you have the error information (e.g. stdout and stderr )? |
Hi, Thanks for your reply! LInkedSV was run with the following command: stdout was empty. You can however find stderr attached here stderr.log Best, |
@morispi By the way, from the stderr information, I guess you were probably running on a bam file only including a very small region of interest. LinkedSV needs to do some parameter estimation based on the global distribution of reads and barcodes, so please extract a larger region (> 10 Mb). Best, |
HI, Just cloned the latest version, and the error message related to node33.candidates indeed disappeared. Please find attached the new stderr.log. I don't believe it's a memory issue, LinkedSV consumes at most 900 MB and my machine has 32 GB. I am indeed running on a bam file including a tiny region of interest. I'm actually installing multiple tools on a cluster, and running them on a small example dataset to make sure they run properly. I will attempt to run LinkedSV on a larger example, if you believe this might be the issue. Thank you very much for the quick answers and for the quick fix! Best, |
Hello @morispi, Best, |
Was there a solution found for this? I'm receiving the same error message seen in the last stderr.log except with a return value of 139. |
Hello,
I am using LinkedSV on 10X Genomics linked-reads sequencing generated bam file with following command:
python linkedsv.py -i phased_possorted_bam.bam -d . -r hg38.fa -v hg38 -t 20 --germline_mode --save_temp_files --black_region_bed /black_lists/hg38_black_list.bed --gap_region_bed /black_lists/hg38_gap.bed
And there is no phased_possorted_bam.bam.node33.candidates file generated in the folder. The exact error is:
[04/23/2020 11:05:10 (119.964 MB)] searching for extremely high coverage region
[04/23/2020 11:15:31 (133.612 MB)] calculating distribution parameters
[04/23/2020 11:15:31 (133.612 MB)] total number of reads in the genome is: 322232329
[04/23/2020 11:15:31 (133.616 MB)] calculating fragment parameters from file: phased_possorted_bam.bam.bcd22
[04/23/2020 11:16:25 (306.758 MB)] N95_fragment_length is: 32524
[04/23/2020 11:17:37 (453.153 MB)] finished getting fragment parameters
[04/23/2020 11:17:38 (452.964 MB)] searching for paired breakpoints
[04/23/2020 11:17:38 (452.964 MB)] searching paired breakpoints
[04/23/2020 11:17:38 (452.964 MB)] building nodes from fragments
[04/23/2020 11:17:38 (452.964 MB)] reading bcd22 file: phased_possorted_bam.bam.bcd22
[04/23/2020 11:18:24 (6.147 GB)] total number of fragments: 4547447
[04/23/2020 11:18:26 (6.166 GB)] writing to node file
[04/23/2020 11:21:15 (4.208 GB)] removing sparse nodes, min_support_fragments is 10
[04/23/2020 11:21:16 (4.208 GB)] clustering nodes, max distance for connecting two nodes is: 17684
[04/23/2020 11:21:16 (4.208 GB)] min support fragment pairs is: 10
[04/23/2020 11:21:16 (4.208 GB)] reading black region bed file
[04/23/2020 11:21:16 (4.208 GB)] reading node candidate file: phased_possorted_bam.bam.node33.candidates
Traceback (most recent call last):
File "linkedsv.py", line 313, in
main()
File "linkedsv.py", line 47, in main
detect_increased_fragment_ends(args, dbo_args, endpoint_args)
File "linkedsv.py", line 221, in detect_increased_fragment_ends
find_paired_bk.find_paired_bk(args, dbo_args, endpoint_args)
File "/scripts/find_paired_bk.py", line 723, in find_paired_bk
build_graph_from_fragments(args, dbo_args, endpoint_args)
File "/scripts/find_paired_bk.py", line 196, in build_graph_from_fragments
clustering_nodes(args, dbo_args, endpoint_args, args.node33_candidate_file, args.node_cluster33_file, max_gap_distance, 'R_end', 'R_end')
File "/scripts/find_paired_bk.py", line 333, in clustering_nodes
node_list = read_node_list_file(node_list_file, black_region_key_set, args.alt_tid_set)
File "/scripts/find_paired_bk.py", line 231, in read_node_list_file
node_list_fp = open(node_list_file, 'r')
IOError: [Errno 2] No such file or directory: 'phased_possorted_bam.bam.node33.candidates'
Could you help me find the reason for the problem.
Thanks,
Riccha
The text was updated successfully, but these errors were encountered: