strint

Xiaoyu Xu strint

a user of processor and matrix, strint.github.io

137 followers · 94 following

SiliconFlow Inc
ShenZhen
https://strint.github.io/

Achievements

x3 x3 x2

Achievements

x3 x3 x2

Highlights

Stars

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 1,732 120 Updated Mar 30, 2025

ZhihaoAIRobotic / MetaAgent

🤖 The next generation of Multi-Modal Multi-Agent platform. 👾 🦄 🔮

Python 92 4 Updated Mar 20, 2025

Lyken17 / pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Python 4,974 531 Updated Jul 8, 2024

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,113 536 Updated Mar 28, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,332 685 Updated Mar 28, 2025

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 3,347 311 Updated Mar 29, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,392 812 Updated Mar 1, 2025

sipie800 / ComfyUI-PuLID-Flux-Enhanced

Python 181 45 Updated Feb 7, 2025

kevmo314 / scuda

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 1,687 64 Updated Mar 10, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 720 61 Updated Feb 24, 2025

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,064 44 Updated Feb 23, 2025

bentoml / comfy-pack

A comprehensive toolkit for reliably locking, packing and deploying environments for ComfyUI workflows.

Python 127 14 Updated Mar 10, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,197 133 Updated Mar 30, 2025

kohya-ss / sd-scripts

Python 5,934 971 Updated Mar 30, 2025

wjakob / nanobind

nanobind: tiny and efficient C++/Python bindings

C++ 2,677 221 Updated Mar 28, 2025

mit-han-lab / nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Cuda 1,059 70 Updated Mar 25, 2025

xdit-project / DiTCacheAnalysis

An auxiliary project analysis of the characteristics of KV in DiT Attention.

Python 28 1 Updated Nov 29, 2024

Lightricks / LTX-Video

Official repository for LTX-Video

Python 3,226 284 Updated Mar 5, 2025

gaogaotiantian / viztracer

A debugging and profiling tool that can trace and visualize python code execution

Python 6,288 428 Updated Mar 25, 2025

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 36,172 2,790 Updated Mar 27, 2025

genmoai / mochi

The best OSS video generation models

Python 3,054 324 Updated Jan 8, 2025

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,639 2,448 Updated Feb 10, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 2,527 262 Updated Mar 30, 2025

gojasper / flash-diffusion

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

Python 568 39 Updated Mar 11, 2025

siliconflow / onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,848 125 Updated Jan 13, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,751 289 Updated Mar 30, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 816 53 Updated Mar 19, 2025

nektos / act

Run your GitHub Actions locally 🚀

Go 58,826 1,489 Updated Mar 29, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 21,101 1,491 Updated Feb 6, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 12,646 1,392 Updated Mar 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiaoyu Xu strint

Achievements

Achievements

Highlights

Block or report strint

Stars

QwenLM / Qwen2.5-Omni

ZhihaoAIRobotic / MetaAgent

Lyken17 / pytorch-OpCounter

deepseek-ai / DeepGEMM

deepseek-ai / DeepEP

vllm-project / aibrix

deepseek-ai / FlashMLA

sipie800 / ComfyUI-PuLID-Flux-Enhanced

kevmo314 / scuda

zhuzilin / ring-flash-attention

FoundationVision / Infinity

bentoml / comfy-pack

HazyResearch / ThunderKittens

kohya-ss / sd-scripts

wjakob / nanobind

mit-han-lab / nunchaku

xdit-project / DiTCacheAnalysis

Lightricks / LTX-Video

gaogaotiantian / viztracer

unslothai / unsloth

genmoai / mochi

Rudrabha / Wav2Lip

flashinfer-ai / flashinfer

gojasper / flash-diffusion

siliconflow / onediff

linkedin / Liger-Kernel

bytedance / flux

nektos / act

black-forest-labs / flux

sgl-project / sglang