Cutting AI voice latency from 1.5s to 200ms: measure time-to-first-byte, not total timeThree levers — streaming, the flash model, and sentence chunking — with the real TTFB numbers behind each.Jun 9, 2026·5 min read