.webp&w=3840&q=95)
How we engineered RAG to be 50% faster
Tips from latency-sensitive RAG systems in production
In November we announced our new, fastest model that generates speech at ≈400ms latency (+ network latency) and is over twice as fast as our V1 models.
Unfortunately users found that it struggled to pronounce long numbers. Give a listen to this generation of "The current stock price for NVIDIA is $867.49.":
Today we just released improved numbers pronunciation for our Turbo v2 model. Here's pronunciation after the change:
Thank you to all of the users who submitted feedback that inspired this fix - and please continue to share areas where our models can be improved.
Tips from latency-sensitive RAG systems in production
Eagr.ai transformed sales coaching by integrating ElevenLabs' conversational AI, replacing outdated role-playing with lifelike simulations. This led to a significant 18% average increase in win-rates and a 30% performance boost for top users, proving the power of realistic AI in corporate training.
Powered by ElevenLabs Agents