Text-zu-Sprache

64 Produkte

Oakamo

Oakamo is a calm reading space for saving articles, reading without distractions, listening to articles as audio, and returning to a synced personal library across devices.

VersaVoice AI

VersaVoice AI is a cross-lingual voice communication app for voice messages, text translation, voice cloning, and in-person translation. It is designed to help people and teams communicate across languages while keeping the speaker's voice and identity.

MonstaReel

MonstaReel is an AI short-form video creator that turns a topic into a scripted vertical MP4 with voice, captions, and export for TikTok, Instagram Reels, and YouTube Shorts. It is designed for creators who want to produce short videos faster without handling each editing step manually.

Narration Room

Narration Room turns scripts into playable narrations on Apple devices using on-device voices. It supports writing, pasting, importing, or dictating text, with a local library and optional iCloud sync for script copies.

Labs AI : Text to Speech

Labs AI : Text to Speech is an iPhone app that turns written text into natural-sounding speech with customizable voices and multi-language support. It is aimed at creators and professionals who need quick voiceovers or dubbed audio from text.

Gemini 3.5 Live Translate

Gemini 3.5 Live Translate is Google’s near real-time speech translation model for developers, Google Meet, and the Google Translate app. It supports 70+ languages and is designed to produce natural-sounding translated audio during live conversations.

MAI-Voice-2

MAI-Voice-2 is Microsoft AI’s text-to-speech model for natural, expressive speech in assistants, support experiences, long-form narration, and accessibility use cases. It is available in Microsoft Foundry and supports 15 languages/locales, emotion control, and short-reference custom voice creation.

Voiser AI

Voiser AI Voiceover turns text into spoken audio for voiceovers, with multilingual voice options and style controls for different narration needs. It supports a web studio workflow and shows free, paid, and enterprise paths on the site.

Our Stories

Our Stories is a family storytelling web app for creating, reading, and listening to custom stories in multiple languages. It is designed for multilingual households and for families who want to share bedtime stories across distance.

Wallie

Wallie is an open-source AI streamer that watches your screen, hears chat, and generates live commentary in a configurable persona. It runs locally on your machine with your own keys and is aimed at faceless content, autonomous streams, and real-time reactions.

Reader Alive

Reader Alive is an AI ebook reader for iPhone and iPad that supports EPUB, PDF, MOBI and AZW3 files. It adds translation, text-to-speech, summaries and book-aware chat for personal ebook libraries.

Selectable

Selectable is a macOS OCR and text-capture utility for extracting text from anywhere on your screen, including images and videos. It supports copying, translation on newer macOS versions, text-to-speech, and output cleanup.

FlowSpeech

FlowSpeech is a context-aware text-to-speech studio that turns scripts and uploaded files into human-like audio. It offers multiple generation modes, pause and emotion control, and a free plan alongside paid tiers.

Gemini 3.1 Flash TTS

Gemini 3.1 Flash TTS is Google’s preview text-to-speech model for generating expressive AI speech with fine-grained control over style and delivery. It is available across the Gemini API, Google AI Studio, Vertex AI, and Google Vids.

Smallest.ai Lightning TTS

Smallest.ai Lightning TTS is a text-to-speech API for generating spoken audio from text with low latency, multilingual support, and fast voice cloning. It is aimed at developers and product teams building voice agents, narrated content, and other production speech workflows.

Claude voice mode

Claude voice mode is a beta feature that enables spoken conversations with Claude on the web and in Claude Mobile for iOS and Android. It supports hands-free and push-to-talk interaction, voice selection, and switching between speech and text within the same chat.

easyquran.ai

Lies den Koran online kostenlos mit Audio-Rezitation und Übersetzungen, inkl. Wort-für-Wort-Analyse in 18 Sprachen – für alle 114 Suren.

Voxtral TTS

Voxtral TTS is Mistral’s text-to-speech model for generating lifelike, multilingual speech for voice agents and enterprise voice workflows. It supports short-reference voice adaptation, low-latency output, and access through Mistral Studio, Le Chat, the API, and open weights on Hugging Face.

Clipchamp

Clipchamp 的 AI 画外音生成器是一项在线文本转语音功能，可为视频生成旁白和配音。它支持多语言语音选择、语速与音色调整，并可直接在浏览器中使用。

TADA

TADA is Hume AI's open-source speech-language model for generating speech with one-to-one text-acoustic synchronization. It is aimed at developers and researchers who need fast, reliable text-to-speech that can also fit on-device or long-form use cases.