[WIP] Load or download the model. #55

georgeamccarthy · 2021-08-10T13:01:08Z

PR type

💔 Breaking Changes - may interfere with Dockerization / have overlap with [Structure] Dockerizing the project #30
🏆 Enhancements

Purpose

Allows model and tokenizor to be stored locally & will download if not found.

Why?

Unable to download indexer within flow on GCP. (deployment).

Extra info

New protein_search/models directory to store models in.

models/
└── prot_bert
    ├── model
    │   ├── config.json
    │   └── pytorch_model.bin
    └── tokenizer
        ├── special_tokens_map.json
        ├── tokenizer_config.json
        └── vocab.txt

Models downloaded from huggingface and then I moved them into these dirs.

Feedback required over

A quick pair of 👀 on the code
Discussion on the technical approach

Mentions

@fissoreg
@Rubix182

References

https://huggingface.co/transformers/model_doc/bert.html

Legal

I have read and agreed to the terms of contributing.

georgeamccarthy · 2021-08-10T13:11:01Z

Not sure if gonna merge this but needing it on GCP without the Dockerization merged. Could probs use a simpler model files structure https://huggingface.co/Rostlab/prot_bert/tree/main

georgeamccarthy · 2021-08-10T13:27:10Z

There may be a simpler to get around the issue. If I try and download the model with a simple script

from transformers import BertModel, BertTokenizer

model_path = "Rostlab/prot_bert"

print("Loading tokenizer.")
tokenizer = BertTokenizer.from_pretrained(model_path, do_lower_case=False)
print("loading model.")
model = BertModel.from_pretrained(model_path)

self.tokenizer = tokenizer
self.model = model

print("Done.")

then the system runs out of RAM ~1 GB and throws an error Killed.

To monitor RAM usage ps -m -o %cpu,%mem,command

Instead of downloading the repo I might just be able to configure the download to use a disk cache.

internal: ignore local models

7c0e67b

georgeamccarthy added backend deployment labels Aug 10, 2021

feat: load or download model and tokenizor

e930d84

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Load or download the model. #55

[WIP] Load or download the model. #55

georgeamccarthy commented Aug 10, 2021 •

edited

Loading

georgeamccarthy commented Aug 10, 2021

georgeamccarthy commented Aug 10, 2021

[WIP] Load or download the model. #55

Are you sure you want to change the base?

[WIP] Load or download the model. #55

Conversation

georgeamccarthy commented Aug 10, 2021 • edited Loading

PR type

Purpose

Why?

Extra info

Feedback required over

Mentions

References

Legal

georgeamccarthy commented Aug 10, 2021

georgeamccarthy commented Aug 10, 2021

georgeamccarthy commented Aug 10, 2021 •

edited

Loading