Stars
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
AI模型接口管理与分发系统,支持将多种大模型转为OpenAI格式调用、支持Midjourney Proxy、Suno、Rerank,兼容易支付协议,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple lan…
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
📱 Graphical Scrcpy to display and control Android, devices powered by Electron.
A course on aligning smol models.
Simple, unified interface to multiple Generative AI providers
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…
Elegant reading of real-time and hottest news
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
程序员应该访问的最佳网站中文版
This is a study aim to transfer the single concept by using DIT model self-attention capablity
Convert PDF to HTML without losing text or format.
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Your Next Store: Modern Commerce with Next.js and Stripe as the backend.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
AGI 社交网络 Bot. BiliBili | 直播聊天数字人 | 视频@自动回复 | 私信bot | 终端聊天 | 语音交互
《一人企业方法论》第二版,也适合做其他副业(比如自媒体、电商、数字商品)的非技术人群。
Real time interactive streaming digital human