Skip to content
View 4vicii's full-sized avatar

Block or report 4vicii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,221 329 Updated Jan 22, 2025

An open-source implementaion for fine-tuning Llama3.2-Vision series by Meta.

Python 128 18 Updated Jan 25, 2025

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 244 27 Updated Feb 2, 2025

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

397 13 Updated Apr 18, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,483 4,970 Updated Feb 14, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,202 2,330 Updated Feb 12, 2025

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,443 472 Updated Feb 14, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 29,156 1,928 Updated Feb 15, 2025

LLM Finetuning with peft

Jupyter Notebook 2,310 633 Updated Jul 8, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 5,248 609 Updated Oct 22, 2024

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 16,334 4,582 Updated Jun 21, 2022

该仓库主要记录 NLP 算法工程师相关的面试题

2,657 508 Updated Apr 12, 2022

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,590 432 Updated Aug 18, 2024

Consistent Prompting for Rehearsal-Free Continual Learning [CVPR2024]

Python 31 1 Updated Jun 20, 2024

Summarize Massive Datasets using Submodular Optimization

Jupyter Notebook 1 3 Updated Mar 29, 2024

NeurIPS2023

Python 7 Updated Oct 30, 2023

Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"

Python 1,413 268 Updated Jan 10, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,926 94 Updated Jan 26, 2025
Python 162 19 Updated Jul 13, 2024

Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.

Python 32 2 Updated Jan 20, 2024

算法岗笔试面试大全,励志做算法届的《五年高考,三年模拟》!

301 21 Updated Dec 5, 2024

The collections of MOE (Mixture Of Expert) papers, code and tools, etc.

11 Updated Mar 15, 2024

Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024

Python 182 13 Updated Nov 17, 2024

Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models

Python 86 5 Updated Mar 5, 2024

An Efficient Dataset Condensation Plugin and Its Application to Continual Learning. NeurIPS, 2023.

Python 9 Updated Nov 29, 2023

The MATH Dataset (NeurIPS 2021)

Python 1,016 91 Updated Aug 5, 2024

Editing Models with Task Arithmetic

Python 451 41 Updated Jan 11, 2024

🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox

Python 357 38 Updated Feb 8, 2025

Our code for ICCV'23 paper "CAME: Contrastive Automated Model Evaluation".

Python 26 3 Updated Jan 25, 2024
Next
Showing results