model name and link	container name	open-source?	size	GPU usage	max tokens (prompt + response)	licence	description
BLOOMZ 7B	transformers-lm-bloomz7b	yes	7.1B	33GB	2,048 tokens	bigscience-bloom-rail-1.0, commercial use allowed	An open-source multilingual instruction-based large language model (46 languages). NB: free of charge. This model is up and running on our servers and can be used for free.
GPT-J 6B	transformers-lm-gptj	yes	6B	25GB	2,048 tokens	Apache 2.0 , commercial use allowed	An open-source English-only large language model which is NOT fine-tuned for instruction following and NOT capable of code generation. NB: free of charge. This model is up and running on our servers and can be used for free.
GPT-3.5	openai-api-davinci3	no	supposedly, 175B	- (cannot be run locally)	4,097 tokens	available under subscription plan, commercial use allowed	A multilingual instruction-based large language model which is capable of code generation. Unlike ChatGPT, not optimized for chat. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
ChatGPT	openai-api-chatgpt	no	supposedly, 175B	- (cannot be run locally)	4,096 tokens	available under subscription plan, commercial use allowed	Based on gpt-3.5-turbo -- the most capable of the entire GPT-3/GPT-3.5 models family. Optimized for chat. Able to understand and generate code. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
Open-Assistant Pythia 12B	transformers-lm-oasst12b	yes	12B	29GB (half-precision)	5,120 tokens	Apache 2.0 , commercial use allowed	An open-source English-only instruction-based large language model which is NOT good at answering math and coding questions. NB: free of charge. This model is up and running on our servers and can be used for free.
Vicuna 13B	transformers-lm-vicuna13b	yes, but only for non-commercial use	13B	29GB (half-precision)	2,048 tokens	Non-commercial license	An instruction-based large language model fine-tuned on LLaMa that achieves more than 90%* quality of OpenAI ChatGPT and Google Bard. The model performs best in English and is NOT good at answering math, reasoning, and coding questions. NB-1: Free of charge. This model is up and running on our servers and can be used for free. NB-2: cannot be used for commercial purposes due to license restriction.
GPT-4	openai-api-gpt4	no	supposedly, 175B	- (cannot be run locally)	8,192 tokens	available under subscription plan, commercial use allowed	A multilingual instruction-based large language model which is capable of code generation and other complex tasks. More capable than any GPT-3.5 model, able to do more complex tasks, and optimized for chat. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
GPT-4 32K	openai-api-gpt4-32k	no	supposedly, 175B	- (cannot be run locally)	32,768 tokens	available under subscription plan, commercial use allowed	A multilingual instruction-based large language model which is capable of code generation and other complex tasks. Same capabilities as the base gpt-4 mode but with 4x the context length. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
GPT-JT 6B	transformers-lm-gptjt	yes	6B	14GB (half-precision)	2,048 tokens	Apache 2.0 , commercial use is allowed	An open-source English-only large language model which was fine-tuned for instruction following but is NOT capable of code generation. NB: free of charge. This model is up and running on our servers and can be used for free.
ChatGPT 16k	openai-api-chatgpt-16k	no	supposedly, 175B	- (cannot be run locally)	16,384 tokens	available under subscription plan, commercial use allowed	Same capabilities as the standard gpt-3.5-turbo model but with 4 times the context. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
Anthropic Claude-v1	anthropic-api-claude-v1	no	supposedly, 52B	- (cannot be run locally)	9,000 tokens	available under subscription plan, commercial use allowed	The largest model, ideal for a wide range of more complex tasks. NB: paid. You must provide your Anthropic API key to use the model. Your Anthropic API account will be charged according to your usage.
Anthropic Claude Instant v1	anthropic-api-claude-instant-v1	no (paid access via API)	supposedly, 52B	- (cannot be run locally)	9,000 tokens	available under subscription plan, commercial use allowed	A smaller model with far lower latency, sampling at roughly 40 words/sec! Its output quality is somewhat lower than the latest claude-1 model, particularly for complex tasks. However, it is much less expensive and blazing fast. NB: paid. You must provide your Anthropic API key to use the model. Your Anthropic API account will be charged according to your usage.
Russian XGLM 4.5B (private weights)	transformers-lm-ruxglm	no	4.5B	15GB	2,048 tokens	Not available yet	A private large language model for the Russian language which was fine-tuned for instruction following by Dmitry Kosenko in Summer 2023. This model is up and running on our servers and can be used for free.
ruGPT-3.5-13B	transformers-lm-rugpt35	yes	13B	35GB (half-precision)	2,048 tokens	MIT	A large language model for the Russian language which was used for trainig GigaChat. This model is up and running on our servers and can be used for free.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MODELS.md

MODELS.md

Files

MODELS.md

Latest commit

History

MODELS.md

File metadata and controls