Skip to content

InfiniTensor/ninetoothed-examples

Repository files navigation

NineToothed Examples

This repository contains examples for NineToothed, including implementations of several common compute kernels written using NineToothed.

Usage

After cloning this repository, you can run any of the examples using Python. For instance, to run the matrix multiplication example, execute the following command:

python matmul.py

Autotuning Behavior

By default, the examples apply autotuning, which may take several minutes or longer to complete for complex kernels. If you wish to disable autotuning, you can replace symbol definitions with concrete values. Consider the following example:

BLOCK_SIZE = Symbol("BLOCK_SIZE", meta=True)

Here, meta=True specifies that BLOCK_SIZE is a meta symbol for autotuning. To disable autotuning, you can:

  1. Set constexpr=True and pass a value when invoking the kernel.
  2. Replace the symbol definition with a fixed integer value, as shown below:
BLOCK_SIZE = 1024

These approaches allow you to obtain results in seconds. However, selecting optimal values is crucial for good performance. Experiment with different values to determine the best configuration.

Third-Party Code and Licenses

This project includes code modified or inspired from the following open-source repositories:

Licenses for third-party code are stored in the third_party directory. Each subdirectory contains its associated LICENSE file.

License

This repository is distributed under the Apache-2.0 license. See the included LICENSE file for details.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages