Skip to content

Latest commit

 

History

History
16 lines (16 loc) · 11.9 KB

MODELS.md

File metadata and controls

16 lines (16 loc) · 11.9 KB
model name and link container name open-source? size GPU usage max tokens (prompt + response) licence description
BLOOMZ 7B transformers-lm-bloomz7b yes 7.1B 33GB 2,048 tokens bigscience-bloom-rail-1.0, commercial use allowed An open-source multilingual instruction-based large language model (46 languages). NB: free of charge. This model is up and running on our servers and can be used for free.
GPT-J 6B transformers-lm-gptj yes 6B 25GB 2,048 tokens Apache 2.0 , commercial use allowed An open-source English-only large language model which is NOT fine-tuned for instruction following and NOT capable of code generation. NB: free of charge. This model is up and running on our servers and can be used for free.
GPT-3.5 openai-api-davinci3 no supposedly, 175B - (cannot be run locally) 4,097 tokens available under subscription plan, commercial use allowed A multilingual instruction-based large language model which is capable of code generation. Unlike ChatGPT, not optimized for chat. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
ChatGPT openai-api-chatgpt no supposedly, 175B - (cannot be run locally) 4,096 tokens available under subscription plan, commercial use allowed Based on gpt-3.5-turbo -- the most capable of the entire GPT-3/GPT-3.5 models family. Optimized for chat. Able to understand and generate code. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
Open-Assistant Pythia 12B transformers-lm-oasst12b yes 12B 29GB (half-precision) 5,120 tokens Apache 2.0 , commercial use allowed An open-source English-only instruction-based large language model which is NOT good at answering math and coding questions. NB: free of charge. This model is up and running on our servers and can be used for free.
Vicuna 13B transformers-lm-vicuna13b yes, but only for non-commercial use 13B 29GB (half-precision) 2,048 tokens Non-commercial license An instruction-based large language model fine-tuned on LLaMa that achieves more than 90%* quality of OpenAI ChatGPT and Google Bard. The model performs best in English and is NOT good at answering math, reasoning, and coding questions. NB-1: Free of charge. This model is up and running on our servers and can be used for free. NB-2: cannot be used for commercial purposes due to license restriction.
GPT-4 openai-api-gpt4 no supposedly, 175B - (cannot be run locally) 8,192 tokens available under subscription plan, commercial use allowed A multilingual instruction-based large language model which is capable of code generation and other complex tasks. More capable than any GPT-3.5 model, able to do more complex tasks, and optimized for chat. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
GPT-4 32K openai-api-gpt4-32k no supposedly, 175B - (cannot be run locally) 32,768 tokens available under subscription plan, commercial use allowed A multilingual instruction-based large language model which is capable of code generation and other complex tasks. Same capabilities as the base gpt-4 mode but with 4x the context length. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
GPT-JT 6B transformers-lm-gptjt yes 6B 14GB (half-precision) 2,048 tokens Apache 2.0 , commercial use is allowed An open-source English-only large language model which was fine-tuned for instruction following but is NOT capable of code generation. NB: free of charge. This model is up and running on our servers and can be used for free.
ChatGPT 16k openai-api-chatgpt-16k no supposedly, 175B - (cannot be run locally) 16,384 tokens available under subscription plan, commercial use allowed Same capabilities as the standard gpt-3.5-turbo model but with 4 times the context. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
Anthropic Claude-v1 anthropic-api-claude-v1 no supposedly, 52B - (cannot be run locally) 9,000 tokens available under subscription plan, commercial use allowed The largest model, ideal for a wide range of more complex tasks. NB: paid. You must provide your Anthropic API key to use the model. Your Anthropic API account will be charged according to your usage.
Anthropic Claude Instant v1 anthropic-api-claude-instant-v1 no (paid access via API) supposedly, 52B - (cannot be run locally) 9,000 tokens available under subscription plan, commercial use allowed A smaller model with far lower latency, sampling at roughly 40 words/sec! Its output quality is somewhat lower than the latest claude-1 model, particularly for complex tasks. However, it is much less expensive and blazing fast. NB: paid. You must provide your Anthropic API key to use the model. Your Anthropic API account will be charged according to your usage.
Russian XGLM 4.5B (private weights) transformers-lm-ruxglm no 4.5B 15GB 2,048 tokens Not available yet A private large language model for the Russian language which was fine-tuned for instruction following by Dmitry Kosenko in Summer 2023. This model is up and running on our servers and can be used for free.
ruGPT-3.5-13B transformers-lm-rugpt35 yes 13B 35GB (half-precision) 2,048 tokens MIT A large language model for the Russian language which was used for trainig GigaChat. This model is up and running on our servers and can be used for free.