
OpenAI Built Its Own Chip. Inference Just Got 50% Cheaper.
OpenAI and Broadcom unveiled Jalapeño, a custom inference ASIC designed from scratch for LLM workloads. Built in nine months with AI-assisted design, it claims 50% lower cost per token than NVIDIA GPUs. With Google, Amazon, and Microsoft all building competing custom silicon, the era of GPU-only inference is ending — and the enterprise AI cost structure is about to be rewritten.
June 24, 2026