Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The bacterial genome sequence file (.fa) sequence name is too long, resulting in the wrong format of the gbk file #712

Open
fersity opened this issue Oct 14, 2024 · 0 comments

Comments

@fersity
Copy link

fersity commented Oct 14, 2024

When the sequence name of the bacterial genome sequence file (.fa) is too long, the name in the starting LOCUS field in the generated.gbk file is too long to occupy the space of the subsequent description. For example, there is no space between the name and the length, and the timestamp is moved to the next line.
This prevented some software from recognizing the gbk file.
ps: Return to normal when the sequence name is truncated appropriately with Spaces.
Before:
(name: >NODE_3_length_398478_cov_81.435531)

LOCUS       NODE_3_length_398478_cov_81.435531398478 bp   DNA linear
14-OCT-2024
DEFINITION  Genus species strain strain.

After truncating the name:

LOCUS       NODE_3                398478 bp    DNA     linear       14-OCT-2024
DEFINITION  Genus species strain strain.

Is there room for adjustment and optimization?
Thank you in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant