Skip to content

Latest commit

 

History

History
5 lines (4 loc) · 518 Bytes

README.md

File metadata and controls

5 lines (4 loc) · 518 Bytes

StableTriton (WIP) (Diffusers implementation working; Boosts it/s from 5.94it/s to 8.58it/s with triton and cuda graphs)

A Triton inference engine built in the same grain as Kernl https://github.com/ELS-RD/kernl/ but for diffusion models, specifically for SDXL

Usage

When done you will simply need to run model = compile(model). As long as it is a nn.Module torch.fx should overwrite native torch ops with triton ops. Will initally set it up with ComfyUI and Diffusers. Automatic1111 will be worked on soon after