Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Local Clone and ML tools #11

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

AlexanderGress
Copy link

Hi,

I implemented some features:

  • Search MaveDB with some filters to get a list of urns (or without filters to get all of them)
  • Download scoretables from MaveDB
  • Create a local clone of MaveDB
  • Different features that aim to create datasets used in Machine Learning applications, like data aggregation and effect value scaling (currently limited to SAVs)

If you have questions or anything: [email protected]

best wishes,
Alexander Gress

ML tools include
 - aggregation of scoresets from same expirements
 - scaling effect values
 - fetching whole protein sequences and updating the hgvs_pro ids
   accordingly
 - outputting datasets in a specialized fasta format
 - currently works just for savs
Implementating a search feature to collect urns with coresponding meta
data from the database.
Added the feature to actually download score tables
Also added features for cloning the whole database locally
Added three example scripts to show how to use the ML features
- also updated procession of MaveDB data to newest standards
- capable to process the now downloadable full MaveDB
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant