Stars
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Question and Answer based on Anything.
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
[TPAMI 2024] This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.
Efficient Retrieval Augmentation and Generation Framework
Open-source tool to visualise your RAG 🔮
Sharing the learning along the way we been gathering to enable Azure OpenAI at enterprise scale in a secure manner. GPT-RAG core is a Retrieval-Augmented Generation pattern running in Azure, using …
Unified framework for building enterprise RAG pipelines with small, specialized models
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
A High-efficiency Open-source Toolkit for Table-to-Latex Task
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A Comprehensive Toolkit for High-Quality PDF Content Extraction
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
图像配准算法。包括 SIFT、ORB、SURF、AKAZE、BRIEF、matchTemplate
convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino
Text-To-Image Generation with Chinese Characters
The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
The official implementation of "Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN"
Using modified BiSeNet for face parsing in PyTorch
Detecting face masks using Python, Keras, OpenCV on real video streams