Stars
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
An Open Large Reasoning Model for Real-World Solutions
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Train transformer language models with reinforcement learning.
Example models using DeepSpeed
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
Code for the paper Fine-Tuning Language Models from Human Preferences
✨✨Latest Advances on Multimodal Large Language Models
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Set of tools to assess and improve LLM security.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
LAVIS - A One-stop Library for Language-Vision Intelligence
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Robust Speech Recognition via Large-Scale Weak Supervision
An open-source tool-augmented conversational language model from Fudan University
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821