local-inference

Star

Here are 10 public repositories matching this topic...

SJTU-IPADS / PowerInfer

Star

High-speed Large Language Model Serving for Local Deployment

llama large-language-models llm local-inference llm-inference

Updated Feb 19, 2025
C++

efeslab / fiddler

Star

[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

mixture-of-experts llm local-inference llm-inference mixtral-8x7b

Updated Nov 18, 2024
Python

BorjaOteroFerreira / IALab-Suite

Star

Tool for test diferents large language models without code.

api-rest chat-application flask-api inference-api large-language-models llm local-inference llamacpp llm-inference llama2 llama-cpp-python llama2-7b mixtral-8x7b

Updated Jun 14, 2025
JavaScript

tinyBigGAMES / JetInfero

Sponsor

Star

Local LLM Inference Library

pascal library win64 c-cpp ai-inference local-inference llama-cpp procedural-api

Updated Jan 30, 2025
Pascal

yas-sim / openvino-llm-chatbot-rag

Star

LLM chatbot example using OpenVINO with RAG (Retrieval Augmented Generation).

natural-language-processing offline chatbot intel edge-computing rag openvino huggingface edge-inference cloud-free llm local-inference langchain dolly2 retrieval-augmented-generation llama2 neural-chat

Updated Jan 25, 2024
Python

Raxephion / AuraGen-AuraFlow-WebUI

Star

Lightweight 6GB VRAM Gradio web app with auto-installer for running AuraFlow locally — no cloud, no clutter.

python open-source image-generation webui gradio text-to-image stable-diffusion diffusers local-inference generative-ai ai-image-generator auraflow low-vram

Updated Jun 7, 2025
Python

aTh1ef / ai-debate-agents

Star

Verify claims using AI agents that debate using scraped evidence and local language models.

python scraping beautifulsoup autonomous-agents local-inference qwen lm-studio langgraph private-llm agentic-ai claim-verification phi-4-mini evidence-based-ai

Updated Jun 1, 2025
Python

Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

ai local-inference

Updated Nov 19, 2024
Python

nazago / rag-llm-local

Star

script which performs RAG and use a local LLM for Q&A

rag llm local-inference langchain-python ollama

Updated Sep 16, 2024
Python

nazago / meeting-minutes-generator

Star

Script which takes a .wav audio file, performs speech-to-text using OpenAI/Whisper, and then, using Llama3, summarization and action point from the transcript generated

summarization speech-to-text whisper meeting-minutes local-inference langchain-python llm-inference ollama

Updated Sep 16, 2024
Python

Improve this page

Add a description, image, and links to the local-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the local-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

local-inference

Here are 10 public repositories matching this topic...

SJTU-IPADS / PowerInfer

efeslab / fiddler

BorjaOteroFerreira / IALab-Suite

tinyBigGAMES / JetInfero

yas-sim / openvino-llm-chatbot-rag

Raxephion / AuraGen-AuraFlow-WebUI

aTh1ef / ai-debate-agents

cuiyuheng / nexa-sdk

nazago / rag-llm-local

nazago / meeting-minutes-generator

Improve this page

Add this topic to your repo