Kernelize is Powered by Triton
Triton is the key enabling technology that makes day-0 support for new models possible. Kernelize uses Triton to generate optimized kernels for new hardware targets, enabling vLLM and Ollama to run on new hardware alternatives.
Day-0 Support Enables Lower Costs
By using Triton to generate optimized kernels for new hardware devices, Kernelize provides day-0 support for new models at a fraction of the cost.
Inference Platforms We Enable
What is Triton?
Triton is a language and compiler for parallel programming that aims to provide a Python-based programming environment for productively writing custom DNN compute kernels capable of running at maximal throughput on modern GPU hardware. It's the enabling technology that makes day-0 support for new models possible across diverse hardware.
Python-Based Programming
Write custom DNN compute kernels in Python with a familiar programming environment.
Maximal Throughput
Achieve optimal performance on modern GPU hardware with compiler optimizations.
Parallel Programming
Language and compiler designed specifically for parallel computing workloads.
Open Source
MIT licensed with an active community of contributors and users.
Official Resources
Official Documentation
Complete documentation for the Triton language and compiler, including tutorials, API reference, and programming guides.
Key Features:
- Getting Started Guide
- Python API Reference
- Triton MLIR Dialects
- Programming Tutorials
GitHub Repository
Official development repository for the Triton language and compiler. Access source code, issues, and contribute to the project.
Key Features:
- 15.8k+ Stars
- Active Development
- MIT License
- Community Contributions
Official YouTube Channel
Official YouTube channel featuring tutorials, talks, and updates about Triton development and usage.
Key Features:
- Video Tutorials
- Technical Talks
- Latest Updates
- Community Content
Learn Triton
LeetGPU
Interactive platform for learning GPU programming with Triton. Practice writing and optimizing GPU kernels through hands-on exercises and challenges.
Triton Tutorial Series
Comprehensive video tutorial series covering Triton fundamentals, advanced concepts, and practical examples for GPU programming.