Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create and integrate GOA species-centric files #37

Open
pgaudet opened this issue Jun 22, 2023 · 2 comments
Open

Create and integrate GOA species-centric files #37

pgaudet opened this issue Jun 22, 2023 · 2 comments

Comments

@pgaudet
Copy link
Contributor

pgaudet commented Jun 22, 2023

Alex has create a test file containing all human annotations that GO Central want to load:

The set of protein accessions included in this file is based on UniProt reference proteomes, which provide one protein per gene.
!They include the protein sequences annotated in Swiss-Prot or the longest TrEMBL transcript if there is no Swiss-Prot record.
!In addition this file included Swiss-Prot Isoforms, RNA and ComplexPortal annotations data

We will generate these for pig, cow, dog and chicken, and load these rather than the 3 files we currently load (protein, RNA, complexes). This will also include isoform annotations, now in a separate file, not loaded by the GOC pipeline.

@pgaudet
Copy link
Contributor Author

pgaudet commented Jun 22, 2023

Number of entities:

<style> </style>

New GOA file

protein | 19698
lncRNA | 8476
rRNA | 5269
pre_miRNA | 3583
snoRNA | 1935
snRNA | 1851
tRNA | 1770
SRP_RNA | 1704
protein_complex | 1224
sRNA | 582
miRNA | 536
precursor_RNA | 270
ncRNA | 98
other | 71
misc_RNA | 23
scaRNA | 21
antisense_RNA | 16
RNase_MRP_RNA | 9
telomerase_RNA | 5
RNase_P_RNA | 4
ribozyme | 4
hammerhead_ribozyme | 2
guide_RNA | 1
piRNA | 1

<style> </style>
AmiGO  
protein 19647
gene_product 13026
rRNA 5269
snoRNA 1935
snRNA 1851
tRNA 1770
SRP_RNA 1704
miRNA 536
protein_complex 506
ncRNA 98
antisense_RNA 16
RNase_MRP_RNA 9
telomerase_RNA 5
RNase_P_RNA 4
ribozyme 4
hammerhead_ribozyme 2
guide_RNA 1
piRNA 1

@pgaudet
Copy link
Contributor Author

pgaudet commented Jun 22, 2023

No description provided.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

No branches or pull requests

1 participant