Question 1

What is text-to-speech?

Accepted Answer

Text-to-speech (TTS) is AI technology that converts written text into spoken audio. Modern neural TTS systems produce natural-sounding voices with appropriate intonation, pacing, and emotion, far surpassing the robotic voices of earlier systems.

Question 2

How is TTS used in meeting tools?

Accepted Answer

TTS in meeting tools can read back summaries and action items, provide audio versions of meeting notes for review during commutes, and power AI assistant responses. It complements speech-to-text by closing the loop between spoken and written content.

Question 3

What is the difference between TTS and STT?

Accepted Answer

TTS (text-to-speech) converts text into audio — it speaks. STT (speech-to-text) converts audio into text — it listens. They are complementary technologies. Meeting AI primarily uses STT for transcription, while TTS is used for readback and accessibility features.

What is Text-to-Speech (TTS)?

How It Works

Why It Matters

Related Terms

Get Started with Whisper