Lightning TTS v3 is Smallest.ai’s low-latency, multilingual text-to-speech API with voice cloning—made for voice agents and production audio.
Voxtral TTS is Mistral AI’s multilingual TTS model for natural, low-latency speech and adaptable speaker voices for voice agents.
Gemini 3.1 Flash Live is Google’s real-time audio and voice model for natural, reliable voice interactions across Google products and developer APIs.
Turn any article into a podcast episode. Paste a link to listen in your podcast app or subscribe to a daily topic-curated feed.
Voizematic is AI voice agent software for inbound/outbound phone automation, including unlimited calls, Google Calendar booking, and follow-ups in 25+ languages.
Clipchamp AI Voice Over Generator is an online text-to-speech tool to create realistic voiceovers for videos. Choose languages, speed & emotion.
Maestra is an AI media translation platform that generates transcripts, subtitles, and multilingual voiceovers for video and audio localization.
Inworld AI offers realtime text-to-speech, speech-to-text, and speech-to-speech APIs—plus a Router for failover across multiple LLM providers.
Fliki creates AI videos and voiceovers from text, ideas, PPTs, blogs, or product URLs—multilingual with AI avatars. Start free, no credit card.
WikiTrip is a location-based travel audio guide for iPhone, reading nearby Wikipedia articles aloud in an AI voice for hands-free listening.
Synthesys.io is an AI content suite for realistic avatar videos with voice-overs, video dubbing into multiple languages, and matching images.
Turn a single live stream into a multilingual broadcast with real-time AI audio dubbing for YouTube, Twitch, X and more.
LOVO is an AI voice generator and text-to-speech tool that creates realistic voiceovers in 100+ languages with an online video editor.
Herodot AI delivers AI-powered audio guides and self-guided tours worldwide—snap a photo for stories and use map-based navigation on your phone.
TADA (Text-Acoustic Dual Alignment) is Hume AI’s open-source text-to-speech model that synchronizes text and audio one-to-one for reliable, fast generation.
Ondoku is a text-to-speech TTS tool—paste your text, choose a voice and language, listen online, and download as .mp3.
Coursebox is an AI training video generator that turns scripts and course content into avatar-presented videos with multilingual language and voice options.
LaterAI - Save, Read & Listen saves Safari articles into a clean reader or AI listening mode, with on-device reading, highlights, summaries & TTS.
Talkpal is an AI language learning web & mobile app for 80+ languages, offering interactive practice for speaking, listening, writing & pronunciation.
Xeder is a Chrome extension that reads your X (Twitter) feed aloud, so you can listen to updates while your hands and eyes do other things.