Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RuntimeError with Shared Tensors in DNABERT-2-117M Configuration Using run_mlm.py #130

Open
RobinLi7 opened this issue Dec 22, 2024 · 0 comments

Comments

@RobinLi7
Copy link

RuntimeError: The weights trying to be saved contained shared tensors [{'cls.predictions.decoder.weight', 'bert.embeddings.word_embeddings.weight'}] that are mismatching the transformers base configuration. Try saving using safe_serialization=False or remove this tensor sharing.

When I run run_mlm.py, the following error occurs during train_result = trainer.train(resume_from_checkpoint=checkpoint):
I’m using the configuration zhihan1996/DNABERT-2-117M and the BertForMaskedLM model from bert_layers.py.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant