Skip to content
View x1001000's full-sized avatar
😎
Dogfighting
😎
Dogfighting

Block or report x1001000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

CV, OCR, YOLO, ID

20 repositories
Python 1 Updated Apr 29, 2023

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,051 486 Updated Jul 11, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,253 598 Updated Apr 16, 2024

Turn any computer or edge device into a command center for your computer vision projects.

Python 1,515 147 Updated Feb 21, 2025

We write your reusable computer vision tools. 💜

Python 24,924 1,872 Updated Feb 19, 2025

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…

Jupyter Notebook 7,069 1,120 Updated Feb 20, 2025

Ultralytics YOLO11 🚀

Python 36,850 7,128 Updated Feb 21, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,066 485 Updated Nov 5, 2024

ComfyUI YOLO-World Integration

Python 39 3 Updated Jul 5, 2024

Unofficial implementation of YOLO-World + EfficientSAM for ComfyUI

Python 682 63 Updated May 22, 2024

YOLO Face 🚀 in PyTorch

Python 367 32 Updated Jan 19, 2025

State-of-the-art 2D and 3D Face Analysis Project

Python 24,322 5,496 Updated Feb 18, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,418 833 Updated Jul 18, 2024

Torchreid: Deep learning person re-identification in PyTorch.

Python 4,405 1,149 Updated Jul 22, 2024

Document to Markdown OCR library with Llama 3.2 vision

TypeScript 2,174 202 Updated Jan 20, 2025

A Python wrapper for Google Tesseract

Python 6,013 721 Updated Feb 17, 2025

Tesseract Open Source OCR Engine (main repository)

C++ 64,740 9,703 Updated Feb 12, 2025

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 31,907 7,914 Updated Aug 3, 2024

tiny vision language model

Python 7,426 576 Updated Feb 20, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,378 182 Updated Feb 21, 2025