
Kernelize Forge
Kernelize Forge is a modular backend for Triton. It extends Triton so the compiler can target more than GPUs. Forge generates an LLVM output based on target-specific primitives. It supports the autotune process to find both what is supported and what is optimal. Forge is configured at compile time based on the Nexus device discovery and autotune.