qlora-llama2

Writeup here: https://medium.com/@venkat.ramrao/fine-tuning-llama-2-using-qlora-and-a-cot-dataset-515f38b3972d

LLAMA-2 7B trained on a variety of datasets using QLORA. All datasets available on HuggingFace. Conclusion: The 7B model seems to work well in extractive QA. It seems to not do so well in reasoning tasks. I'm planning to test out with a larger version of the model in the future.

kaist-ai/CoT-Collection.

Link to the github Repo and Paper tied to the dataset: https://github.com/kaistAI/CoT-Collection https://arxiv.org/abs/2305.14045

Pubmed-QA: https://pubmedqa.github.io/ Link to Paper: https://arxiv.org/abs/1909.06146
DataBricks, Dolly databricks/databricks-dolly-15k

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
README.md		README.md
qlora_CoT_Calling_ONLY.ipynb		qlora_CoT_Calling_ONLY.ipynb
qlora_llama2_clean_CoT-Dataset.ipynb		qlora_llama2_clean_CoT-Dataset.ipynb
qlora_llama2_clean_Pubmed.ipynb		qlora_llama2_clean_Pubmed.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

qlora-llama2

About

Releases

Packages

Languages

vvr-rao/qlora-llama2

Folders and files

Latest commit

History

Repository files navigation

qlora-llama2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages