Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Obsolete UniProt identifiers in existing PHI-base curation #52

Open
jseager7 opened this issue Nov 4, 2019 · 5 comments
Open

Obsolete UniProt identifiers in existing PHI-base curation #52

jseager7 opened this issue Nov 4, 2019 · 5 comments

Comments

@jseager7
Copy link

jseager7 commented Nov 4, 2019

We currently have many obsolete UniProt accessions in PHI-base, and we have no effective way to locate the corresponding extant entry for these accessions.

Currently, I think the best we can do is either BLAST the sequence of the obsolete accession (which is slow and probably unreliable); or try to find an active accession with the same gene name as the accession that was obsoleted (seems even less reliable, but you could use a sequence alignment tool to support the comparison).

@ValWood contacted UniProt about their policies for obsoleting accessions, and we were directed to a mapping that "maps old to new accession numbers via their protein_ids". I think from preliminary analysis there was no evidence that these new accessions mapped to any of our obsolete accessions, but I might have done the analysis wrong, so I'm planning to re-do this to make sure.

@CuzickA
Copy link

CuzickA commented Aug 25, 2022

Hi @jseager7, can this ticket be closed now?

@jseager7
Copy link
Author

@CuzickA It should stay open if we still have obsolete UniProtKB accession numbers in PHI-base 4. That is, unless you want this tracker to only be for the new PHI-Canto curation.

If all of the obsolete identifiers in PHI-base 4 have been replaced or removed, then it's fine to close this issue.

@CuzickA
Copy link

CuzickA commented Aug 25, 2022

Ahh I see, this query is for PHI-Base 4 data migration. I thought this tracker was just for new PHI-Canto curation.

We can keep it open and I'll add a new label 'PHI4 to PHI5 data migration'.

@martin2urban do you know whether we still have obsolete UniProtKB accession numbers in PHI-base 4? I remember that you did some work this with a colleague.

@CuzickA
Copy link

CuzickA commented Sep 6, 2024

Hi @jseager7, has this been resolved in the data migration? Can we close the ticket now?

@jseager7
Copy link
Author

jseager7 commented Sep 9, 2024

I'll check whether there are still obsolete UniProt IDs in PHI-base 4.18 before closing this issue. From memory I think that there are.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants