Treni
GPU agents. Fast like a train.
A single C/CUDA binary serving 4 ML models at 29x the speed of Python — sub-100ms p99, identical outputs, 1s cold boot.
GPU agents. Fast like a train.
A single C/CUDA binary serving 4 ML models at 29x the speed of Python — sub-100ms p99, identical outputs, 1s cold boot.