数据更新: 2024-10-10 / 温馨提示:中文项目泛指「文档母语为中文」OR「含有中文翻译」的项目,通常在项目的「readme/wiki/官网」可以找到
# | Repository | Description | Stars | Updated | Created |
---|---|---|---|---|---|
1 | RVC-Boss/GPT-SoVITS | 1 min voice data can also be used to train a good TTS model! (few shot voice cloning) | 33797 | 2024-10-02 | 2024-01-14 |
2 | All-Hands-AI/OpenHands | 🙌 OpenHands: Code Less, Make More | 32822 | 2024-10-09 | 2024-03-13 |
3 | 2noise/ChatTTS | A generative speech model for daily dialogue. | 31333 | 2024-10-09 | 2024-05-27 |
4 | myshell-ai/OpenVoice | Instant voice cloning by MIT and MyShell. | 29071 | 2024-08-21 | 2023-11-29 |
5 | hpcaitech/Open-Sora | Open-Sora: Democratizing Efficient Video Production for All | 21814 | 2024-08-09 | 2024-02-20 |
6 | infiniflow/ragflow | RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. | 18857 | 2024-10-09 | 2023-12-12 |
7 | VikParuchuri/marker | Convert PDF to markdown quickly with high accuracy | 16882 | 2024-09-07 | 2023-10-30 |
8 | harry0703/MoneyPrinterTurbo | 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM. | 16408 | 2024-07-26 | 2024-03-11 |
9 | ScrapeGraphAI/Scrapegraph-ai | Python scraper based on AI | 14788 | 2024-10-09 | 2024-01-27 |
10 | THUDM/ChatGLM3 | ChatGLM3 series: Open Bilingual Chat LLMs 开源双语对话语言模型 | 13379 | 2024-07-10 | 2023-10-26 |
11 | KwaiVGI/LivePortrait | Bring portraits to life! | 12216 | 2024-10-07 | 2024-07-03 |
12 | OpenBMB/MiniCPM-V | MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone | 12189 | 2024-09-13 | 2024-01-29 |
13 | netease-youdao/QAnything | Question and Answer based on Anything. | 11580 | 2024-09-27 | 2024-01-03 |
14 | PKU-YuanGroup/Open-Sora-Plan | This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project. | 11308 | 2024-10-08 | 2024-02-20 |
15 | VikParuchuri/surya | OCR, layout analysis, reading order, table recognition in 90+ languages | 10036 | 2024-10-08 | 2024-01-10 |
16 | fudan-generative-vision/hallo | Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation | 9281 | 2024-09-14 | 2024-06-12 |
17 | OpenBMB/XAgent | An Autonomous LLM Agent for Complex Task Solving | 8078 | 2024-08-12 | 2023-10-16 |
18 | microsoft/UFO | A UI-Focused Agent for Windows OS Interaction. | 7633 | 2024-09-25 | 2024-01-08 |
19 | jianchang512/clone-voice | A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频 | 7355 | 2024-08-22 | 2023-11-19 |
20 | netease-youdao/EmotiVoice | EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine | 7314 | 2024-08-13 | 2023-11-08 |
21 | deepseek-ai/DeepSeek-Coder | DeepSeek Coder: Let the Code Write Itself | 6640 | 2024-05-21 | 2023-10-20 |
22 | modelscope/DiffSynth-Studio | Enjoy the magic of Diffusion models! | 6409 | 2024-10-09 | 2023-12-07 |
23 | jianchang512/ChatTTS-ui | 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces. | 6020 | 2024-08-29 | 2024-05-30 |
24 | OpenGVLab/InternVL | [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型 | 5692 | 2024-09-19 | 2023-11-22 |
25 | FunAudioLLM/CosyVoice | Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. | 5386 | 2024-09-29 | 2024-07-03 |
26 | Upsonic/gpt-computer-assistant | Intelligence development framework in python for your product like Apple Intelligence | 5212 | 2024-09-10 | 2024-05-26 |
27 | adithya-s-k/omniparse | Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks | 5138 | 2024-09-23 | 2024-06-04 |
28 | Ucas-HaoranWei/GOT-OCR2.0 | Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model | 5121 | 2024-10-02 | 2024-09-02 |
29 | opendatalab/PDF-Extract-Kit | A Comprehensive Toolkit for High-Quality PDF Content Extraction | 5017 | 2024-10-09 | 2024-06-27 |
30 | modelscope/agentscope | Start building LLM-empowered multi-agent applications in an easier way. | 4983 | 2024-10-08 | 2024-01-12 |
31 | OpenInterpreter/01 | The #1 open-source voice interface for desktop, mobile, and ESP32 chips. | 4924 | 2024-10-02 | 2024-01-11 |
32 | InternLM/MindSearch | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) | 4813 | 2024-09-25 | 2024-07-28 |
33 | THUDM/GLM-4 | GLM-4 series: Open Multilingual Multimodal Chat LMs 开源多语言多模态对话模型 | 4800 | 2024-10-06 | 2024-05-15 |
34 | xxlong0/Wonder3D | Single Image to 3D using Cross-Domain Diffusion for 3D Generation | 4723 | 2024-08-29 | 2023-10-14 |
35 | dyang886/Game-Cheats-Manager | Easily download and manage game cheats for your convenience | 4693 | 2024-10-05 | 2023-12-31 |
36 | myshell-ai/MeloTTS | High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean. | 4581 | 2024-08-09 | 2024-02-19 |
37 | YaoFANGUK/video-subtitle-remover | 基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures. | 4100 | 2024-10-09 | 2023-10-25 |
38 | TeamWiseFlow/wiseflow | Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and upl ... | 4071 | 2024-09-04 | 2024-04-24 |
39 | Kwai-Kolors/Kolors | Kolors Team | 3683 | 2024-09-04 | 2024-07-05 |
40 | lipku/livetalking | Real time interactive streaming digital human | 3587 | 2024-10-05 | 2023-12-19 |
41 | Tencent/HunyuanDiT | Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding | 3337 | 2024-08-15 | 2024-05-10 |
42 | huggingface/speech-to-speech | Speech To Speech: an effort for an open-sourced and modular GPT4-o | 3200 | 2024-09-27 | 2024-08-07 |
43 | AiuniAI/Unique3D | Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image | 2954 | 2024-09-18 | 2024-05-30 |
44 | FunAudioLLM/SenseVoice | Multilingual Voice Understanding Model | 2869 | 2024-09-25 | 2024-07-03 |
45 | X-PLUG/MobileAgent | Mobile-Agent: The Powerful Mobile Device Operation Assistant Family | 2792 | 2024-09-26 | 2024-01-26 |
46 | gpt-omni/mini-omni | open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities. | 2775 | 2024-09-25 | 2024-08-27 |
47 | BadToBest/EchoMimic | Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning | 2595 | 2024-08-15 | 2024-07-03 |
48 | TMElyralab/MuseTalk | MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting | 2546 | 2024-09-23 | 2024-03-26 |
49 | QwenLM/Qwen2-VL | Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud. | 2508 | 2024-10-04 | 2024-08-29 |
50 | PeterH0323/Streamer-Sales | Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端 ... | 2427 | 2024-09-29 | 2024-04-05 |
51 | TMElyralab/MuseV | MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising | 2377 | 2024-06-28 | 2024-03-25 |
52 | FujiwaraChoki/MoneyPrinterV2 | Automate the process of making money online. | 2369 | 2024-04-17 | 2024-02-12 |
53 | DeepInsight-AI/DeepBI | LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI. | 2329 | 2024-10-08 | 2023-11-20 |
54 | jianchang512/stt | Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式 | 2243 | 2024-10-07 | 2023-12-28 |
55 | jingyaogong/minimind | 「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练! | 2203 | 2024-10-09 | 2024-07-27 |
56 | aixcoder-plugin/aiXcoder-7B | official repository of aiXcoder-7B Code Large Language Model | 2194 | 2024-08-29 | 2024-03-30 |
57 | MzeroMiko/VMamba | VMamba: Visual State Space Models,code is based on mamba | 2078 | 2024-09-25 | 2024-01-11 |
58 | malinkang/weread2notion-pro | - | 2048 | 2024-10-09 | 2024-01-03 |
59 | THUDM/CogVLM2 | GPT4V-level open-source multi-modal model based on Llama3-8B | 2040 | 2024-09-03 | 2024-05-10 |
60 | Alpha-VLLM/Lumina-T2X | Lumina-T2X is a unified framework for Text to Any Modality Generation | 2040 | 2024-08-06 | 2024-03-28 |
61 | Langboat/Mengzi3 | - | 2032 | 2024-10-09 | 2024-03-29 |
62 | buaacyw/MeshAnything | From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers" | 1999 | 2024-08-05 | 2024-06-16 |
63 | 6drf21e/ChatTTS_colab | 🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。 | 1979 | 2024-07-02 | 2024-05-30 |
64 | JaveleyQAQ/WeChatOpenDevTools-Python | WeChatOpenDevTool 微信小程序强制开启开发者工具 | 1943 | 2024-09-15 | 2024-01-17 |
65 | PKU-YuanGroup/MoE-LLaVA | Mixture-of-Experts for Large Vision-Language Models | 1937 | 2024-05-15 | 2023-12-14 |
66 | lanqian528/chat2api | A service that can convert ChatGPT on the web to OpenAI API format. | 1894 | 2024-10-07 | 2024-04-04 |
67 | NexaAI/nexa-sdk | Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ... | 1891 | 2024-10-09 | 2024-08-16 |
68 | Kedreamix/Linly-Talker | Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction me ... | 1863 | 2024-09-27 | 2023-10-17 |
69 | liuzhao1225/YouDub-webui | - | 1835 | 2024-05-14 | 2023-12-26 |
70 | Ucas-HaoranWei/Vary | [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models. | 1776 | 2024-09-28 | 2023-12-07 |
71 | Clouditera/SecGPT | SecGPT网络安全大模型 | 1774 | 2024-05-08 | 2023-11-20 |
72 | Tele-AI/Telechat | - | 1765 | 2024-08-27 | 2024-01-07 |
73 | hanxi/xiaomusic | 使用小爱音箱播放音乐,音乐使用 yt-dlp 下载。 | 1726 | 2024-10-09 | 2023-10-14 |
74 | xinsir6/ControlNetPlus | ControlNet++: All-in-one ControlNet for image generations and editing! | 1685 | 2024-09-30 | 2024-07-02 |
75 | google-deepmind/penzai | A JAX research toolkit for building, editing, and visualizing neural networks. | 1654 | 2024-09-11 | 2024-04-04 |
76 | ymcui/Chinese-LLaMA-Alpaca-3 | 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3 | 1582 | 2024-09-23 | 2023-11-09 |
77 | xingpingcn/enhanced-FaaS-in-China | 提升部署在cloudflare、vercel或netlify的网页在中国的访问速度和稳定性 Improve the access speed and stability in China of web pages hosted on cloudflare, vercel or netlify by merely changing your CNAME record. cf优选域名 cf优选ip ... | 1543 | 2024-10-09 | 2024-03-07 |
78 | sqzw-x/mdcx | Movie metadata scraper | 1543 | 2024-10-03 | 2023-11-06 |
79 | yerfor/GeneFacePlusPlus | GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code | 1500 | 2024-06-05 | 2024-02-01 |
80 | linyqh/NarratoAI | 利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click. | 1484 | 2024-10-08 | 2024-08-12 |
81 | qnguyen3/chat-with-mlx | An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework. | 1470 | 2024-09-06 | 2024-02-16 |
82 | InternLM/HuixiangDou | HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance | 1459 | 2024-10-03 | 2023-12-28 |
83 | yixiu001/serv00-login | 同时支持serv00与ct8自动化批量保号,每3天自动登录一次面板,并且发送消息到Telegram | 1455 | 2024-07-19 | 2024-07-03 |
84 | Frrrrrrrrank/auto_job__find__chatgpt__rpa | This is a tool used to automatically generate a cover letter using chatgpt based on your resume and job description and send messages to bosses in China. | 1443 | 2024-10-09 | 2024-01-02 |
85 | SunoAI-API/Suno-API | Create Music in Seconds with SunoAPI. 👇 | 1436 | 2024-08-12 | 2024-03-26 |
86 | QwenLM/Qwen-Audio | The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud. | 1425 | 2024-07-05 | 2023-11-07 |
87 | EvolvingLMMs-Lab/lmms-eval | Accelerating the development of large multimodal models (LMMs) with lmms-eval | 1423 | 2024-10-09 | 2024-03-07 |
88 | netease-youdao/BCEmbedding | Netease Youdao's open-source embedding and reranker models for RAG products. | 1410 | 2024-09-06 | 2024-01-02 |
89 | THUDM/LongWriter | LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs | 1386 | 2024-09-27 | 2024-08-12 |
90 | TencentARC/BrushNet | [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion" | 1374 | 2024-07-17 | 2024-03-10 |
91 | ZHO-ZHO-ZHO/ComfyUI-InstantID | Unofficial implementation of InstantID for ComfyUI | 1330 | 2024-05-22 | 2024-01-22 |
92 | THUDM/CodeGeeX4 | CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more. | 1306 | 2024-08-25 | 2024-07-03 |
93 | KimMeen/Time-LLM | [ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models" | 1303 | 2024-08-29 | 2024-01-20 |
94 | d60/twikit | Twitter API Scraper Without an API key Twitter Internal API Free Twitter scraper Twitter Bot | 1280 | 2024-10-09 | 2024-01-20 |
95 | jianchang512/vocal-separate | an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网 | 1280 | 2024-07-29 | 2023-12-26 |
96 | chflame163/ComfyUI_LayerStyle | A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality. | 1258 | 2024-10-09 | 2024-01-17 |
97 | linyiLYi/voice-assistant | A simple toy demo of a local voice assistant with whisper and large language model. | 1255 | 2024-04-30 | 2023-12-09 |
98 | sustcsonglin/flash-linear-attention | Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton | 1252 | 2024-10-01 | 2023-12-20 |
99 | lxtGH/OMG-Seg | OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24] | 1249 | 2024-10-02 | 2024-01-05 |
100 | Mzdyl/LiteLoaderQQNT_Install | 针对 LiteLoaderQQNT 的安装脚本 | 1247 | 2024-10-08 | 2024-01-21 |
101 | DLYuanGod/TinyGPT-V | TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones | 1238 | 2024-04-18 | 2023-12-28 |
102 | cubiq/ComfyUI_InstantID | - | 1232 | 2024-09-30 | 2024-01-27 |
103 | aigc-apps/EasyAnimate | 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion | 1221 | 2024-08-22 | 2024-04-11 |
104 | kwuking/TimeMixer | [ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting" | 1206 | 2024-09-21 | 2024-03-05 |
105 | thuml/iTransformer | Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah | 1191 | 2024-09-09 | 2023-10-19 |
106 | mini-sora/minisora | MiniSora: A community aims to explore the implementation path and future development direction of Sora. | 1182 | 2024-10-08 | 2024-02-21 |
107 | RUC-NLPIR/FlashRAG | ⚡FlashRAG: A Python Toolkit for Efficient RAG Research | 1174 | 2024-10-08 | 2024-03-14 |
108 | sMythicalBird/ZenlessZoneZero-Auto | 绝区零 ZenlessZoneZero 零号空洞 自动战斗 自动化 图片分类 OCR识别 | 1164 | 2024-10-06 | 2024-07-13 |
109 | julep-ai/julep | API for building multi-step agent workflows. (An orchestrator for AI agents and workflows.) | 1152 | 2024-10-09 | 2024-04-04 |
110 | emcf/thepipe | Extract clean data from anywhere, powered by vision-language models ⚡ | 1137 | 2024-09-30 | 2024-03-22 |
111 | QwenLM/Qwen2-Audio | The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud. | 1132 | 2024-08-13 | 2024-06-24 |
112 | open-compass/VLMEvalKit | Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks | 1125 | 2024-10-09 | 2023-12-01 |
113 | hect0x7/JMComic-APK | 防迷路 禁漫天堂APK 禁漫APP安卓安装包 jm天堂 GitHub Actions | 1113 | 2024-09-24 | 2023-12-03 |
114 | KwaiKEG/KwaiAgents | A generalized information-seeking agent system with Large Language Models (LLMs). | 1085 | 2024-06-19 | 2023-11-13 |
115 | zhulu111/ComfyUI_Bxb | SD变现宝:一键把comfyui工作流转换成小程序。 | 1078 | 2024-10-03 | 2024-05-18 |
116 | TencentQQGYLab/ELLA | ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment | 1064 | 2024-07-17 | 2024-03-07 |
117 | LinYuanovo/pikpak_auto_invite | PikPak自动邀请程序,附带图像识别过验证码,支持本地及GitHub Actions云端运行 | 1058 | 2024-07-04 | 2024-06-08 |
118 | fufankeji/MateGen | Next-Generation Interactive Intelligent Programming Assistant | 1033 | 2024-09-20 | 2024-07-14 |
119 | ok-oldking/ok-wuthering-waves | 鸣潮 后台自动战斗 自动刷声骸上锁合成 Automation for Wuthering Waves | 1032 | 2024-10-08 | 2024-06-08 |
120 | Aabyss-Team/ARL | ARL官方仓库备份项目:ARL(Asset Reconnaissance Lighthouse)资产侦察灯塔系统旨在快速侦察与目标关联的互联网资产,构建基础资产信息库。 协助甲方安全团队或者渗透测试人员有效侦察和检索资产,发现存在的薄弱点和攻击面。 | 1012 | 2024-08-09 | 2024-05-13 |
121 | stanford-oval/WikiChat | WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus. | 1009 | 2024-10-07 | 2023-10-19 |
122 | qhjqhj00/MemoRAG | Empowering RAG with a memory-based data interface for all-purpose applications! | 1002 | 2024-09-29 | 2024-09-04 |
123 | ssili126/tv | 自动收集的IPv4酒店电视直播源,自动测试播放速度,每日自动更新。 有CCTV央视卫视频道,及部分地方频道,播放流畅。也可在openwrt或群辉的docker运行。更新了不需要chromedriver的方式。 | 997 | 2024-09-30 | 2024-01-12 |
124 | DachunKai/EvTexture | [ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution | 994 | 2024-09-17 | 2024-05-18 |
125 | ogkalu2/comic-translate | Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages. | 981 | 2024-09-23 | 2024-01-25 |
126 | LibraHp/GetQzonehistory | 获取QQ空间发布的历史说说 | 977 | 2024-10-09 | 2024-02-12 |
127 | QiuChenly/InjectLib | 你知道我要说什么 | 945 | 2024-10-08 | 2024-06-15 |
128 | yolain/ComfyUI-Easy-Use | In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes. | 941 | 2024-10-09 | 2023-12-10 |
129 | DoctorReid/ZenlessZoneZero-OneDragon | 绝区零 一条龙 全自动 自动闪避 自动每日 自动空洞 支持手柄 | 925 | 2024-10-09 | 2024-06-07 |
130 | yerfor/Real3DPortrait | Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code | 915 | 2024-07-04 | 2024-02-02 |
131 | tech-shrimp/WechatMoments | 微信朋友圈导出工具-技术爬爬虾 | 907 | 2024-06-18 | 2024-03-28 |
132 | gusye1234/nano-graphrag | A simple, easy-to-hack GraphRAG implementation | 906 | 2024-10-01 | 2024-07-25 |
133 | BAAI-DCAI/Bunny | A family of lightweight multimodal models. | 897 | 2024-09-18 | 2024-01-31 |
134 | open-mmlab/PIA | [CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画 | 889 | 2024-08-05 | 2023-12-21 |
135 | bilibili/Index-1.9B | A SOTA lightweight multilingual LLM | 881 | 2024-09-20 | 2024-06-11 |
136 | LazyAGI/LazyLLM | Easiest and laziest way for building multi-agent LLMs applications. | 868 | 2024-10-09 | 2024-06-04 |
137 | honmashironeko/ProxyCat | 一款部署于云端或本地的代理池中间件,可将静态代理IP灵活运用成隧道IP,提供固定请求地址,一次部署终身使用 | 866 | 2024-10-06 | 2024-08-21 |
138 | multimodal-art-projection/MAP-NEO | - | 847 | 2024-06-21 | 2024-05-08 |
139 | wxywb/history_rag | - | 842 | 2024-08-07 | 2023-12-29 |
140 | Kiteretsu77/APISR | APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024) | 838 | 2024-06-28 | 2024-03-02 |
141 | eeeeeeeeee-code/e0e1-wx | 微信小程序辅助渗透-自动化 | 837 | 2024-08-30 | 2024-05-10 |
142 | vivo-ai-lab/BlueLM | BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab | 830 | 2024-04-22 | 2023-10-16 |
143 | heshengtao/comfyui_LLM_party | Dify in ComfyUI includes Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai/gemini interfaces, such as o1,ollama, qwen, GLM ... | 827 | 2024-10-09 | 2024-04-13 |
144 | showlab/MotionDirector | [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models. | 810 | 2024-08-21 | 2023-10-12 |
145 | WECENG/ticket-purchase | 大麦自动抢票,支持人员、城市、日期场次、价格选择 | 807 | 2024-04-28 | 2023-10-12 |
146 | alipay/agentUniverse | agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications. | 799 | 2024-10-08 | 2024-04-23 |
147 | codefuse-ai/ModelCache | A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs. | 790 | 2024-09-14 | 2023-11-01 |
148 | SmartFlowAI/EmoLLM | 心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1 | 789 | 2024-09-19 | 2024-01-11 |
149 | ZHO-ZHO-ZHO/ComfyUI-PhotoMaker-ZHO | Unofficial implementation of PhotoMaker for ComfyUI | 785 | 2024-05-22 | 2024-01-15 |
150 | kingmo888/rustdesk-api-server | 基于Django的RustDesk Api&Web Server,除了支持api所有功能,还支持web注册、管理、展示等。已支持到最新1.2.7版本。 | 782 | 2024-09-25 | 2023-12-05 |
151 | om-ai-lab/OmAgent | A multimodal agent framework for solving complex tasks [EMNLP'2024] | 779 | 2024-10-09 | 2024-07-04 |
152 | jiayev/GPT4V-Image-Captioner | - | 774 | 2024-10-07 | 2023-12-27 |
153 | LetheSec/HuggingFace-Download-Accelerator | 利用HuggingFace的官方下载工具从镜像网站进行高速下载。 | 768 | 2024-09-05 | 2023-11-27 |
154 | WeChatAPIs/WeChatMsgHistory_real | Real-time Chat-重现微信群组和个人聊天记录查询项目,本项目为开发者和研究者提供一种深入查看微信聊天内容的解决方案,允许用户在特定条件下获取特定群组或私聊的聊天记录,并通过我们提供的API进行控制 | 764 | 2024-08-02 | 2024-02-23 |
155 | yuanzl77/IPTV | 每天自动更新IPTV直播源,支持IPV4/IPV6双栈访问!自定义频道,高质量直播源,❌不含有广告。Automatically update IPTV live streaming sources every day, supporting IPV4/IPV6 dual stack access! Custom channels, high-quality live streaming sourc ... | 761 | 2024-10-08 | 2024-04-04 |
156 | Guovin/TV | 📺电视直播源更新工具🚀:结果包含广东频道、央视(付费)频道、卫视频道、港·澳·台频道、电影频道、咪咕直播;支持组播源、酒店源、订阅源、线上检索多种更新方式;支持自定义频道(含图标);每天自动更新两次,结果可用于TVBox等播放软件;支持工作流、Docker、命令行、软件界面多种部署/运行方式 Live TV source update tool | 724 | 2024-10-09 | 2024-02-04 |
157 | lenML/Speech-AI-Forge | 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI. | 705 | 2024-10-09 | 2024-06-01 |
158 | Autumn-27/ScopeSentry | ScopeSentry-网络空间测绘、子域名枚举、端口扫描、敏感信息发现、漏洞扫描、分布式节点 | 701 | 2024-09-09 | 2024-02-27 |
159 | Bao-qing/123pan | 123pan下载工具,使用安卓客户端协议,绕开流量限制,解决123pan自用流量不足 | 697 | 2024-10-01 | 2023-12-15 |
160 | ZhiXuanWang/cf-speed-dns | CloudflareSpeedTest 推送「每5分钟自选优选 IP」https://ip.164746.xyz | 684 | 2024-10-09 | 2023-12-18 |
161 | IEIT-Yuan/Yuan-2.0 | Yuan 2.0 Large Language Model | 682 | 2024-07-11 | 2023-11-26 |
162 | lazydog28/mc_auto_boss | 鸣潮后台自动刷BOSS声骸 | 675 | 2024-08-18 | 2024-06-01 |
163 | WeChatAPIs/WechatBotCMD | 微信机器人WechatBotCMD是一个创新的基于Python 3.11开发的项目,通过结合ChatGPT模型和微信原生API,为用户提供智能聊天、自动绘画、自动发朋友圈、自动发视频号等多样的API集成服务,旨在提升日常沟通效率和趣味性。 | 669 | 2024-05-09 | 2023-12-18 |
164 | R4gd0ll/I-Wanna-Get-All | OA漏洞利用工具 | 665 | 2024-05-28 | 2024-02-23 |
165 | ZHO-ZHO-ZHO/ComfyUI-Gemini | Using Gemini in ComfyUI | 663 | 2024-05-22 | 2023-12-19 |
166 | DoMusic/Hybrid-Net | Real-time audio to chords, lyrics, beat, and melody. | 661 | 2024-08-15 | 2024-03-26 |
167 | JadyXuan/NTTS | NO TIME TO SLEEP | 640 | 2024-05-26 | 2024-05-14 |
168 | HIT-SCIR/Chinese-Mixtral-8x7B | 中文Mixtral-8x7B(Chinese-Mixtral-8x7B) | 640 | 2024-08-17 | 2024-01-16 |
169 | X-T-E-R/Uni-TTS | 本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务 | 632 | 2024-04-15 | 2024-02-16 |
170 | NoCLin/LightMirrors | LightMirrors is a lightweight mirror server with caching capabilities that currently supports DockerHub, K8S, PyPI, PyTorch, and NPM. | 630 | 2024-10-07 | 2024-02-24 |
171 | DennisThink/awesome_twitter_CN | 值得关注的中文twitter用户 | 616 | 2024-09-26 | 2024-07-12 |
172 | ViggoZ/producthunt-daily-hot | 自动生成每日Product Hunt热门产品中文榜单,基于GitHub Actions自动提交Markdown文件 | 604 | 2024-10-09 | 2024-08-08 |
173 | 52beijixing/smartedu-download | 国家中小学智慧教育平台 | 596 | 2024-08-19 | 2024-07-02 |
174 | ZHO-ZHO-ZHO/ComfyUI-YoloWorld-EfficientSAM | Unofficial implementation of YOLO-World + EfficientSAM for ComfyUI | 595 | 2024-05-22 | 2024-02-19 |
175 | open-mmlab/PowerPaint | [ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一 ... | 595 | 2024-09-08 | 2023-12-05 |
176 | co01cat/SqlmapXPlus | sqlmap Xplus 基于 sqlmap,对经典的数据库注入漏洞利用工具进行二开! | 587 | 2024-05-08 | 2024-02-03 |
177 | THUDM/AutoWebGLM | An LLM-based Web Navigating Agent (KDD'24) | 586 | 2024-09-27 | 2024-04-03 |
178 | adysec/nuclei_poc | Nuclei POC,每日更新 自动整合全网Nuclei的漏洞POC,实时同步更新最新POC,保存已被删除的POC。通过批量克隆Github项目,获取Nuclei POC,并将POC按类别分类存放,使用Github Action实现(已有11wPOC,已校验有效性并去重) | 580 | 2024-10-09 | 2024-05-07 |
179 | ymcui/Chinese-Mixtral | 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs) | 580 | 2024-04-30 | 2024-01-11 |
180 | saermart/DouyinLiveWebFetcher | 抖音直播间网页版的弹幕数据抓取(2024最新版本) | 570 | 2024-10-08 | 2024-01-02 |
181 | allwefantasy/auto-coder | - | 569 | 2024-10-09 | 2024-03-13 |
182 | lanbinshijie/bili2text | Bilibili视频转文字,一步到位,输入链接即可使用 | 555 | 2024-07-18 | 2023-11-25 |
183 | worm128/AI-YinMei | AI吟美-人工智能主播-Vtuber | 550 | 2024-09-12 | 2024-02-18 |
184 | ihmily/outfit-anyone | Outfit Anyone(最新修复版): Ultra-high quality virtual try-on for Any Clothing and Any Person | 541 | 2024-06-14 | 2024-04-18 |
185 | HZJQF/help_tool | 推理算法助手(降维打击) | 539 | 2024-10-05 | 2024-08-31 |
186 | t41372/Open-LLM-VTuber | Talk to any LLM with hands-free voice interaction, voice interruption, Live2D taking face, and long-term memory running locally across platforms | 539 | 2024-10-09 | 2023-11-24 |
187 | HKUDS/GraphGPT | [SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models" | 539 | 2024-06-25 | 2023-10-15 |
188 | infrost/DeeplxFile | 基于Deeplx和Playwright提供的简单易用,快速,免费,不限制文件大小,支持超长文本翻译,跨平台的文件翻译工具 / Easy-to-use, fast, free, unlimited file size and cross platform file translation tool based on Deeplx & Playwright that supports long tex ... | 535 | 2024-09-09 | 2024-08-15 |
189 | swoole/phpy | Connecting the Python and PHP ecosystems together | 533 | 2024-09-09 | 2023-12-04 |
190 | magicbear/palworld-server-toolkit | PalWorld Server Toolkits - For Save file modify, list the players, repair sav file, etc... | 530 | 2024-07-07 | 2024-01-27 |
191 | MeoProject/lx-music-api-server | 适用于 LX Music 的解析接口服务器的 Python 实现 | 530 | 2024-10-09 | 2023-11-10 |
192 | VisionRush/DeepFakeDefenders | Image forgery recognition algorithm | 529 | 2024-09-09 | 2024-08-31 |
193 | liwenju0/cutword | 一个简单快速的分词、命名实体识别工具 | 523 | 2024-07-01 | 2023-12-10 |
194 | FutureUniant/Tailor | Tailor是一款视频智能裁剪、视频生成和视频优化的视频剪辑工具。目前的目标是通过人工智能技术减少视频剪辑的繁琐操作,让普通人也能简单实现专业剪辑人的水准!长远目标是让视频剪辑实现真正的AIGC! | 520 | 2024-10-03 | 2024-05-28 |
195 | opendilab/LLMRiddles | Open-Source Reproduction/Demo of the LLM Riddles Game | 518 | 2024-07-30 | 2023-11-07 |
196 | see2023/Bert-VITS2-ext | 基于Bert-VITS2做的表情、动画测试. Animation testing based on Bert-VITS2. | 516 | 2024-08-27 | 2023-12-27 |
↓ -- 感谢读者 -- ↓
榜单持续更新,如有帮助请加星收藏,方便后续浏览,感谢你的支持!