Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The sequences of the wildtype for the DMS datasets #5

Open
lzhangUT opened this issue Dec 29, 2021 · 0 comments
Open

The sequences of the wildtype for the DMS datasets #5

lzhangUT opened this issue Dec 29, 2021 · 0 comments

Comments

@lzhangUT
Copy link

Hi,
Really appreciate your work, it is very helpful to what I am working on right now.
I am kinda newbie in deep learning of protein sequences, forgive me if i am asking silly questions.
Thanks for providing the DMS in the supplemental in your paper 'Deep generative models of genetic variation capture the effects of mutations'. I wonder IF you have the raw sequences of the wildtype protein for each of the DMS datasets.

  1. If you already have the sequences, can you point to me somewhere?
  2. if not, were you using the supplemental table 1 to extract the sequences using UniProt ID? if so, can you point to me the code to extract the sequences? or did you do it outside using the UniProt website?

Thanks a lot! Happy holidays

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant