Triton

Triton是一种用于编写高效自定义深度学习原语的语言和编译器。它的目标是提供一个开源环境，让用户能够以比使用 CUDA 更高的生产效率编写快速代码，同时还能比其他现有的领域特定语言（DSL）更具灵活性。

Grid

在 Triton 中，grid 用于定义 GPU 内核的执行网格，即决定内核在 GPU 上如何并行执行。

构建 grid 的方式：

固定 grid（直接赋值为元组）

grid = (16, 16)  # 2D grid，16x16 blocks
kernel[grid](...)

动态 grid （lambda 函数）

grid = lambda meta: (triton.cdiv(n_elements, meta['BLOCK_SIZE']), )
kernel[grid](...)

更复杂的动态 grid（自定义函数）

def grid(META):
    return (triton.cdiv(q.shape[2], META["BLOCK_M"]), q.shape[0] * q.shape[1], 1)
kernel[grid](...)

Installation from source

git clone https://github.com/triton-lang/triton.git
cd triton
pip install ninja cmake wheel; # build-time dependencies
pip install -e .

Triton前端与接口部分使用 Python 实现，而核心部分使用 C++ 实现，这是由于其核心任务涉及矩阵运算等密集型计算，以及对底层硬件指令的精准控制。因此，安装Triton涉及对其核心部分的 C++ 代码进行编译。（note: 与后续提到的Triton kernel的编译是不同的概念。）

Triton的 C++ 核心实现目录为lib/，包含：

Triton IR的数据结构和操作
编译 Pass管理（优化、调度、IR-Lowering）
将Triton IR转换为LLVM IR的代码
调用LLVM生成PTX的逻辑

最终产物为一个共享库python/triton/_C/libtriton.so

Triton Kernel Compilation (Lowering)

libtriton.so是Triton的 C++ 编译器核心，其通过Pybind11暴露为 Python 可调用模块triton._C.libtriton。它通常不会由用户手动调用，而是由Triton的 Python 包的内部模块自动使用。

Python Triton kernel -> Triton IR (TTIR)

在 Python 中完成，不直接调用libtriton.so

Triton IR -> LLVM IR (LLIR)

# python/triton/compiler.py
libtriton.compile_ttir_to_llir(...)

LLVM IR -> PTX

# python/triton/compiler.py
libtriton.compile_llir_to_ptx(...)

PTX -> CUBIN

# python/triton/compiler.py
libtriton.link_ptx(...)

Kernel Launch

# python/triton/runtime/launcher.py
libtriton.get_function(...)
libtriton.launch(...)

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
06-fused-attention		06-fused-attention
01-vector-add.py		01-vector-add.py
02-fused-softmax.png		02-fused-softmax.png
02-fused-softmax.py		02-fused-softmax.py
03-matrix-multiplication.png		03-matrix-multiplication.png
03-matrix-multiplication.py		03-matrix-multiplication.py
04-low-memory-dropout.png		04-low-memory-dropout.png
04-low-memory-dropout.py		04-low-memory-dropout.py
05-layer-norm-backward.csv		05-layer-norm-backward.csv
05-layer-norm-backward.png		05-layer-norm-backward.png
05-layer-norm.py		05-layer-norm.py
07-extern-functions.png		07-extern-functions.png
07-extern-functions.py		07-extern-functions.py
README.md		README.md
Triton_lowering.png		Triton_lowering.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Triton

Grid

Installation from source

Triton Kernel Compilation (Lowering)

Python Triton kernel -> Triton IR (TTIR)

Triton IR -> LLVM IR (LLIR)

LLVM IR -> PTX

PTX -> CUBIN

Kernel Launch

流程图：

About

Uh oh!

Releases

Packages

Languages

cschenjunlin/Triton

Folders and files

Latest commit

History

Repository files navigation

Triton

Grid

Installation from source

Triton Kernel Compilation (Lowering)

Python Triton kernel -> Triton IR (TTIR)

Triton IR -> LLVM IR (LLIR)

LLVM IR -> PTX

PTX -> CUBIN

Kernel Launch

流程图：

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages