Présentation de Eleven v3 (alpha)

3 juin 2025 • 6 minutes de lecture

Mati Staniszewski, Co-founder,

Piotr Dabkowski, Co-Founder, Research

Le modèle de Text to Speech le plus expressif

Contactez les ventes Eleven v3 Prompting v3

Nous sommes ravis de dévoiler Eleven v3 (alpha) — le modèle Text to Speech le plus expressif.

Cette prévisualisation de recherche apporte un contrôle et un réalisme sans précédent à la génération de la parole avec :

70+ langues
Dialogue multi-locuteurs
Audio tags like [excited], [whispers], and [sighs]

Eleven v3 (alpha) nécessite plus d'ingénierie de prompt que les modèles précédents — mais les générations sont époustouflantes.

Si vous travaillez sur des vidéos, des livres audio ou des outils médias — cela débloque un nouveau niveau d'expressivité. Pour les cas d'utilisation en temps réel et conversationnels, nous recommandons de rester avec v2.5 Turbo ou Flash pour le moment. Une version en temps réel de v3 est en développement.

Eleven v3 est disponible dès aujourd'hui sur notre site web. L'accès public à l'API arrive bientôt. Pour un accès anticipé, veuillez contacter les ventes.

L'utilisation du nouveau modèle dans l'application ElevenLabs est à 80% de réduction jusqu'à la fin de juin. Inscrivez-vous ici.

Why we built v3

Pourquoi nous avons créé v3expressiveness. More exaggerated emotions, conversational interruptions, and believable back-and-forth were difficult to achieve.

Depuis le lancement de Multilingual v2, nous avons vu la voix IA adoptée dans le cinéma professionnel, le développement de jeux, l'éducation et l'accessibilité. Mais la limitation constante n'était pas la qualité sonore — c'était

Eleven v3 comble cette lacune. Il a été conçu de A à Z pour offrir des voix qui soupirent, chuchotent, rient et réagissent — produisant une parole qui semble vraiment réactive et vivante.

Feature	What it unlocks
Audio tags	Inline control of tone, emotion, and non-verbal reactions
Dialogue mode	Multi-speaker conversations with natural pacing and interruptions
70+ languages	Full coverage of high-demand global languages
Deeper text understanding	Better stress, cadence, and expressivity from text input

Hear v3 for yourself

Using audio tags

Utilisation des balises audioprompting guide for v3 in the docs.

Les balises audio se trouvent en ligne avec votre script et sont formatées avec des crochets carrés en minuscules. Vous pouvez en savoir plus sur les balises audio dans notre

1“[happily][shouts] We did it! [laughs].”

Par exemple, vous pourriez suggérer : « [chuchote] Quelque chose arrive… [soupire] Je le sens. » Ou pour un contrôle plus expressif, vous pouvez combiner plusieurs balises :

Créer un dialogue multi-locuteursText to Dialogue API endpoint. Provide a structured array of JSON objects — each representing a speaker turn — and the model generates a cohesive, overlapping audio file:

1[
2  {"speaker_id": "scarlett", "text": "(cheerfully) Perfect! And if that pop-up is bothering you, there’s a setting to turn it off under Notifications → Preferences."},
3  {"speaker_id": "lex", "text": "You are a hero. An actual digital wizard. I was two seconds from sending a very passive-aggressive support email."},
4  {"speaker_id": "scarlett", "text": "(laughs) Glad we could stop that in time. Anything else I can help with today?"}
5]
6

Eleven v3 est pris en charge dans notre point de terminaison Text to Speech existant. De plus, nous introduisons un nouveau

Le point de terminaison gère automatiquement les transitions de locuteur, les changements émotionnels et les interruptions.here.

v3 is our most expressive model

En savoir plus

Plan	Launch promo	After 30 days
UI (self-serve)	80% off (~5× cheaper)	Same as Multilingual V2
API (self-serve & enterprise)	Same as Multilingual V2	Same
Enterprise UI	Same as Multilingual V2	Same

Tarification et disponibilité

Use the Model Picker and select Eleven v3 (alpha)

Pour activer v3 :contact sales.

L'accès à l'API et le support dans Studio arrivent bientôt. Pour un accès anticipé, veuillez

Quand ne pas utiliser v3v3 documentation and FAQ.

Try it today

Log in to ElevenLabs UI
documentation complète de v3 3 (alpha) in the model dropdown
Paste your script — use tags or dialogue
Generate audio

We’re excited to see how you bring v3 to life across new use cases — from immersive storytelling to cinematic production pipelines.

Eleven v3 is 80% off until the end of June 2025 for self-serve users using it through the UI.

They were generated with only the Eleven v3 model.

Text to Dialogue weaves multiple voices together to create a seamless interaction between them. Matching prosody, emotional range and taking cues from audio tags, Text to Dialogue is a leap forward in generating engaging conversations.

Public API for Eleven v3 (alpha) is coming soon. For early access, please contact sales.

Eleven v3 supports a wide variety of audio tags and are somewhat voice and context dependent. Read the prompting guide for further information.

Afrikaans (afr), Arabic (ara), Armenian (hye), Assamese (asm), Azerbaijani (aze), Belarusian (bel), Bengali (ben), Bosnian (bos), Bulgarian (bul), Catalan (cat), Cebuano (ceb), Chichewa (nya), Croatian (hrv), Czech (ces), Danish (dan), Dutch (nld), English (eng), Estonian (est), Filipino (fil), Finnish (fin), French (fra), Galician (glg), Georgian (kat), German (deu), Greek (ell), Gujarati (guj), Hausa (hau), Hebrew (heb), Hindi (hin), Hungarian (hun), Icelandic (isl), Indonesian (ind), Irish (gle), Italian (ita), Japanese (jpn), Javanese (jav), Kannada (kan), Kazakh (kaz), Kirghiz (kir), Korean (kor), Latvian (lav), Lingala (lin), Lithuanian (lit), Luxembourgish (ltz), Macedonian (mkd), Malay (msa), Malayalam (mal), Mandarin Chinese (cmn), Marathi (mar), Nepali (nep), Norwegian (nor), Pashto (pus), Persian (fas), Polish (pol), Portuguese (por), Punjabi (pan), Romanian (ron), Russian (rus), Serbian (srp), Sindhi (snd), Slovak (slk), Slovenian (slv), Somali (som), Spanish (spa), Swahili (swa), Swedish (swe), Tamil (tam), Telugu (tel), Thai (tha), Turkish (tur), Ukrainian (ukr), Urdu (urd), Vietnamese (vie), Welsh (cym)

Découvrez les articles de l'équipe ElevenLabs

Product

Product

How we engineered RAG to be 50% faster

Tips from latency-sensitive RAG systems in production

Customer stories

Customer stories

Eagr.ai Supercharges Sales Training with ElevenLabs' Conversational AI Agents

Eagr.ai transformed sales coaching by integrating ElevenLabs' conversational AI, replacing outdated role-playing with lifelike simulations. This led to a significant 18% average increase in win-rates and a 30% performance boost for top users, proving the power of realistic AI in corporate training.

Créez avec l'audio AI de la plus haute qualité.

Se lancer gratuitement

Vous avez déjà un compte ? Se connecter

1	[
2	{"speaker_id": "scarlett", "text": "(cheerfully) Perfect! And if that pop-up is bothering you, there’s a setting to turn it off under Notifications → Preferences."},
3	{"speaker_id": "lex", "text": "You are a hero. An actual digital wizard. I was two seconds from sending a very passive-aggressive support email."},
4	{"speaker_id": "scarlett", "text": "(laughs) Glad we could stop that in time. Anything else I can help with today?"}
5	]
6

Présentation de Eleven v3 (alpha)

Why we built v3

Eleven v3 comble cette lacune. Il a été conçu de A à Z pour offrir des voix qui soupirent, chuchotent, rient et réagissent — produisant une parole qui semble vraiment réactive et vivante.

Hear v3 for yourself

Using audio tags

Par exemple, vous pourriez suggérer : « [chuchote] Quelque chose arrive… [soupire] Je le sens. » Ou pour un contrôle plus expressif, vous pouvez combiner plusieurs balises :

v3 is our most expressive model

En savoir plus

L'accès à l'API et le support dans Studio arrivent bientôt. Pour un accès anticipé, veuillez

Try it today

How does the Eleven v3 80% discount work?

How were the samples in the video and website generated?

How does dialogue generation work?

Is this available over API?

What audio tags are supported?

What languages does it support?

Découvrez les articles de l'équipe ElevenLabs

How we engineered RAG to be 50% faster

Eagr.ai Supercharges Sales Training with ElevenLabs' Conversational AI Agents