Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

get_sequence and chromosome name #99

Open
dputhier opened this issue Feb 25, 2022 · 1 comment
Open

get_sequence and chromosome name #99

dputhier opened this issue Feb 25, 2022 · 1 comment

Comments

@dputhier
Copy link
Owner

dputhier commented Feb 25, 2022

Hi @fafa13 ,
In the get_sequence function, the chromosome names that are returned may contain additional information. Indeed, in the following case the string following " " (dna_sm:chromo...) should be discarded. Indeed, this information won't be present in the names of the chromosomes found in GTF files. Would it be possible to update this in the library ?
Best

	>1 dna_sm:chromosome chromosome:TAIR10:1:1:30427671:1 REF
	ccctaaaccctaaaccctaaaccctaaacctctgaatccttaatccctaaatccctaaat
	ctttaaatcctacatccatgaatccctaaatacctaattccctaaacccgaaaccGGTTT
	CTCTGGTTGAAAATCATTGTGTATATAATGATAATTTTATCGTTTTTATGTAATTGCTTA
	TTGTTGTGTGTAGATTTTTTAAAAATATCATTTGAGGTCAATACAAATCCTATTTCTTGT
	GGTTTTCTTTCCTTCACTTAGCTATGGATGGTTTATCTTCATTTGTTATATTGGATACAA
	GCTTTGCTACGATCTACATTTGGGAATGTGAGTCTCTTATTGTAACCTTAGGGTTGGTTT

Should be:

Hi @fafa13 ,
In the get_sequence function, the chromosome names that are returned may contain additional information. Indeed, in the following case the string following " " (dna_sm:chromo...) should be discarded. Indeed, this information won't be present in the names of the chromosomes found in GTF files. Would it be possible to update this in the library ?
Best

	>1
	ccctaaaccctaaaccctaaaccctaaacctctgaatccttaatccctaaatccctaaat
	ctttaaatcctacatccatgaatccctaaatacctaattccctaaacccgaaaccGGTTT
	CTCTGGTTGAAAATCATTGTGTATATAATGATAATTTTATCGTTTTTATGTAATTGCTTA
	TTGTTGTGTGTAGATTTTTTAAAAATATCATTTGAGGTCAATACAAATCCTATTTCTTGT
	GGTTTTCTTTCCTTCACTTAGCTATGGATGGTTTATCTTCATTTGTTATATTGGATACAA
	GCTTTGCTACGATCTACATTTGGGAATGTGAGTCTCTTATTGTAACCTTAGGGTTGGTTT
@dputhier
Copy link
Owner Author

See dputhier/pygtftk#171

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant