Triton Resources

This page serves as a reference for useful Triton resources. Triton is a language and compiler for parallel programming that provides a Python-based programming environment for writing custom DNN compute kernels capable of running at maximal throughput on modern GPU hardware.

What is Triton?

Triton is a language and compiler for parallel programming that aims to provide a Python-based programming environment for productively writing custom DNN compute kernels capable of running at maximal throughput on modern GPU hardware.

Python-Based Programming

Write custom DNN compute kernels in Python with a familiar programming environment.

Maximal Throughput

Achieve optimal performance on modern GPU hardware with compiler optimizations.

Parallel Programming

Language and compiler designed specifically for parallel computing workloads.

Open Source

MIT licensed with an active community of contributors and users.

Kernelize & Triton

Kernelize builds on the open-source Triton compiler to provide AI inference systems for AI inference accelerators. Our platform leverages Triton's powerful capabilities to auto-generate optimized backends for a wide variety of hardware targets, helping developers write high-performance GPU kernels in Python once and deploy them across GPUs, NPUs, TPUs, and more.