-
Notifications
You must be signed in to change notification settings - Fork 700
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NFCORE_RNASEQ:RNASEQ:ALIGN_STAR:STAR_ALIGN_IGENOMES Segmentation fault #1294
Comments
Can you share more information, like the .nextflow.log file? |
Sure. Thank you. |
Do you also have the STAR logs from the process directory? Also, just to be clear, something like |
Do you mean this? path/to/test/test_output/nextflow_work/67/8bff9d83c65cb091f7207dd3befa38/.command.sh: line 10: 44 Segmentation fault STAR --genomeDir SAindex --readFilesIn sample_test_1_val_1.fq.gz sample_test_2_val_2.fq.gz --runThreadN 12 --outFileNamePrefix e_pal_1001_11_c_1_r. --sjdbGTFfile genome.filtered.gtf --outSAMattrRGline ID:e_pal_1001_11_c_1_r 'SM:e_pal_1001_11_c_1_r' --quantMode TranscriptomeSAM --twopassMode Basic --outSAMtype BAM Unsorted --readFilesCommand zcat --runRNGseed 0 --outFilterMultimapNmax 20 --alignSJDBoverhangMin 1 --outSAMattributes NH HI AS NM MD --quantTranscriptomeBan Singleend --outSAMstrandField intronMotif I also tried running using --aligner 'star_rsem' and it works. Is it star_rsem using another version of STAR? How could be run with rsem but not with salmon? |
This is the code I use for running the nfcore rnaseq pipeline throught slurm: #!/bin/bash source path/to/conda/miniconda3/bin/activate base export NXF_SINGULARITY_CACHEDIR=path/to/nf-core_rnaseq/nf-core_rnaseq_singularity/" nextflow run nf-core/rnaseq |
No, if you have a look in the process work directory for STAR there should be more logs that might tell us where it's failing. |
These are the log and the run files from the directory where the error was generated (path/to/test/test_output/nextflow_work/f3/6ddef0ebba1b326eaa2d7ee3302030/). |
Normally we'd expect a STAR log file ending in But, here are some suggestions for you to follow as you debug. 1. Are you able to run the test_full profile of the workflow?
If that fails to run, it would point to something specific to your HPC systems, which it would be hard for us to help with. If that does run, then there will be something specific to your input reads. 2. Can you run the STAR process manually? I would suggest that you copy the task directory and try running the alignment manually, either with the singularity image or a Conda environment. Note that this process uses STAR version 2.6.1d for compatibility with indices in iGenomes. That might provide you with more information about why things are failing to run 3. Try with up-to-date references, not using iGenomes Using iGenomes (i.e. I would suggest that you try running by specifying inputs directly as per that documentation link. You will need sufficient resource to generate a STAR index, but you can do that just once by using the save_reference option to output the index files so you can store them elsewhere and supply them next time you run. This will use an up-to-date STAR which may not have the same issue for you. Use the up-to-date Ensembl reference files if possible, but even if you require GRCh37, supplying the GTF and FASTA inputs directly rather than using |
The log. out of STAR, it's empty. I tried the -profile test_full and I got the same error in the same step, so I suppose it's an error because of my HPC system. I also used the genome and index generated by me but the error persisted. I have now two questions:
Thank you for all the help. |
Could you try without |
Yes, I download the genomes from the AWS: aws s3 --no-sign-request --region eu-west-1 sync s3://ngi-igenomes/igenomes/Homo_sapiens/NCBI/GRCh38/Sequence/STARIndex/ aws s3 --no-sign-request --region eu-west-1 sync s3://ngi-igenomes/igenomes/Homo_sapiens/NCBI/GRCh38/Annotation/Genes/ ./references/Homo_sapiens/NCBI/GRCh38/Annotation/Genes/ --exclude "*" --include "genes.bed" and provided: but the segfault still persisted. |
Don't provide the STAR index (that won't be compatible). Assuming you have sufficient resource to do the STAR indexing, just provide the FASTA and GTF. |
I tried using the "-profile test_full,singularity", and the segfault is still there. When I don't provide the index still being another sigfault because of STAR, it seems I need to look at what STAR does in the back and it is interfering with clusters access. Do you have any ideas? And also as I asked before: Thanks. |
By using RSEM does not use the same process, so will be using a more updated STAR process behind the scenes. Please, if you can, try again with your data, supplying You can use |
Hi, I tried using the last version of GRCh38 (v46), triggering a re-indexing and now star+salmon is working without segfaults, thank you so much. I don't know if I should open a new thread or maybe you can answer me here, there is a way to do rsem and salmon on the same pipeline? I mean run the pipeline and have the results of rsem and salmon. Or, is it possible to feed the pipeline with bam files? Thank you. |
Glad it worked! No, not currently, you can only follow one 'path' at a time through the workflow. Please check the issue queue to see if others have requested the same things, and feel free to create feature requests if not. Closing this issue as complete. |
Description of the bug
Hi, I am trying to run the rnaseq pipeline (3.14.0) but I always get stack in the STAR_ALIGN_IGENOMES. THerror is:
.command.sh: line 10: 42 Segmentation fault
I am using singularity in a HPC, sending the job throught slurm. I tried to add more CPUs and more RAM space but is always the same error. Is it a way to fix it? Is it a problem with the STAR version (2.7.10a)?
Command used and terminal output
The error is similar to this one:
#684 (comment)
Relevant files
No response
System information
No response
The text was updated successfully, but these errors were encountered: