Add OpenELM #93

yumemio · 2024-05-31T06:36:57Z

Paper: https://arxiv.org/abs/2404.14619
Code: https://github.com/apple/corenet
Models
- Pretrained: https://huggingface.co/collections/apple/openelm-pretrained-models-6619ac6ca12a10bd0d0df89e
- Instruct-following: https://huggingface.co/collections/apple/openelm-instruct-models-6619ad295d7ae9f868b759ca

This is a set of tiny open models released by Apple (no cynicism intended!). From the paper:

The reproducibility and transparency of large language models are crucial for advancing open research, ensuring the trustworthiness of results, and enabling investigations into data and model biases, as well as potential risks. To this end, we release OpenELM, a state-of-the-art open language model [...]
[...]
Diverging from prior practices that only provide model weights and inference code, and pre-train on private datasets, our release includes the complete framework for training and evaluation of the language model on publicly available datasets, including training logs, multiple checkpoints, and pre-training configurations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OpenELM #93

Add OpenELM #93

yumemio commented May 31, 2024

Add OpenELM #93

Add OpenELM #93

Comments

yumemio commented May 31, 2024