Gemini 3.5 Live Translate is Google’s audio model for near real-time speech-to-speech translation in 70+ languages for calls, meetings, lessons, and broadcasts.
PodWalk: Guided Tours creates location-based audio walking tours for cities, towns, neighborhoods, and streets, with hands-free narration, offline playback, and multilingual support.
MAI-Voice-2 by Microsoft AI generates natural, expressive text-to-speech in 15 languages with emotion control and voice prompting.
Voiser.ai is an AI text-to-speech and voiceover generator for fast narration, promotional content, and multilingual audio projects.
Our Stories is a multilingual family story-sharing product to read, listen to, and share one story in the languages your family speaks.
Wallie is an open-source AI streamer framework with real-time vision, persona profiles, chat, TTS, and avatar output for VTuber-style streams.
Podio: News Podcast Maker uses AI to turn your chosen topics and news interests into a personalized daily podcast stream for hands-free listening on iPhone and iPad.
Reader Alive is an AI ebook reader for iPhone and iPad with EPUB, PDF, MOBI, and AZW3 support, plus translation, TTS, summaries, and chat.
Capture and extract text anywhere on your Mac screen—including images and videos—then translate on macOS 26+, listen via TTS, and mask sensitive data.
FlowSpeech converts scripts to human-like AI text-to-speech with context-aware emotion, precise pause control, and 30+ voices in 70+ languages.
Gemini 3.1 Flash TTS by Google is a text-to-speech model for natural, expressive AI speech with granular audio tags and SynthID watermarking.
Lightning TTS v3 is Smallest.ai’s low-latency, multilingual text-to-speech API with voice cloning—made for voice agents and production audio.
Speak with Claude and listen to voice responses in Claude Voice Mode. Switch between voice and text within the same conversation.
Read the Quran online for free with audio recitation and translations, including word-by-word analysis in 18 languages on easyquran.ai.
Voxtral TTS is Mistral AI’s multilingual TTS model for natural, low-latency speech and adaptable speaker voices for voice agents.
Clipchamp AI Voice Over Generator is an online text-to-speech tool to create realistic voiceovers for videos. Choose languages, speed & emotion.
LOVO is an AI voice generator and text-to-speech tool that creates realistic voiceovers in 100+ languages with an online video editor.
TADA (Text-Acoustic Dual Alignment) is Hume AI’s open-source text-to-speech model that synchronizes text and audio one-to-one for reliable, fast generation.
Ondoku is a text-to-speech TTS tool—paste your text, choose a voice and language, listen online, and download as .mp3.
Xeder is a Chrome extension that reads your X (Twitter) feed aloud, so you can listen to updates while your hands and eyes do other things.