Online Text-to-Speech
Enter or paste text in the browser and have it read aloud with the selected voice, without installing desktop software. The page also notes that you can listen directly or download the result as an .mp3 audio file.
Ondoku is a browser-based text-to-speech tool that turns text into downloadable .mp3 audio, with free and paid plans, multilingual reading, image reading, and commercial use options.
Ondoku is an online text-to-speech tool mainly used to convert entered text into listenable, downloadable speech. The homepage says it can be used directly in the browser, and the generated speech can be downloaded as an .mp3 file or played back immediately.
The product offers both free and paid usage options. Unregistered users can try 1,000 characters per month, and free registration increases that to 5,000 characters; paid plans are billed by character quota, with the page listing several tiers from 2 million characters per month to 10 million characters per month.
Based on the site information, Ondoku is aimed at individuals and organizations that need to generate speech content quickly. It supports multilingual reading and allows commercial use as long as attribution and prohibited-use rules are followed. The homepage and help articles also mention image reading, history, conversation features, and sharing features as part of the workflow.
Enter or paste text in the browser and have it read aloud with the selected voice, without installing desktop software. The page also notes that you can listen directly or download the result as an .mp3 audio file.
The homepage includes an "Image" entry point, and related descriptions mention reading content from images, which is useful for turning text in images into speech.
There is a broad range of language and voice options. Related pages say Ondoku supports more than 80 languages and dialects and offers more than 650 voice actors or voice samples for preview.
Controls include basic speech settings such as speaking speed and pitch, making it easier to adjust delivery for different content scenarios.
The page emphasizes that generated speech can be downloaded as an audio file and used commercially; free use requires attribution, while paid members do not need attribution.
The Advanced AI Speech Synthesis Beta page says the platform provides a newer generation of AI speech synthesis for a more natural reading experience.
Enter articles, scripts, or short passages in the browser and instantly generate speech you can listen to, which is useful for anyone who wants to hear content before proofreading it.
Download the spoken result as an .mp3 file for use in video narration, explanation tracks, or other content production workflows that need audio files.
Choose the most suitable voice based on language, dialect, or voice sample, which is helpful for multilingual content, pronunciation study, or materials aimed at different regional audiences.
Use the generated speech for blogs, websites, videos, or other commercial scenarios while following attribution rules; paid members can use it without attribution.
Convert text in images into speech and combine that with history, sharing, or conversation features to handle repetitive tasks, which is useful for users who need to manage content in batches or continuously.
Yes. Ondoku offers a free usage quota: unregistered users can try 1,000 characters per month, and free registration increases that to 5,000 characters per month.
Yes. The page says generated speech can be saved as .mp3 audio files and used for commercial purposes, but free use requires attribution according to the rules.
Paid plans are billed monthly or yearly, and the page lists Basic, Value, and Premium tiers; paid members do not need attribution, and unused characters do not roll over to the next month.
Yes. The help page says you can cancel your subscription in Settings after logging in, and you can also delete your account; after cancellation, the remaining characters can still be used until the next renewal date.
Yes. The page provides both text input and an image selection entry point, and it mentions reading content from images; related pages also describe support for multiple languages and a range of voice samples.
Gemini 3.1 Flash TTS is Google’s preview text-to-speech model for generating expressive AI speech with fine-grained control over style and delivery. It is available across the Gemini API, Google AI Studio, Vertex AI, and Google Vids.
蓝藻AI是一款在线AI配音与语音合成产品,可将文字转成语音,并支持自助声音克隆。页面信息显示它面向短视频、有声书等需要配音的内容场景。
Typecast is an online AI voice generator that turns text into life-like speech with emotional delivery and a selection of hyper-realistic voices. It is a browser-based tool for creating spoken audio from written content.
Noiz AI is an AI text-to-speech, voice cloning, and voice design tool for creating lifelike speech from text. It also lets users shape voice delivery, including emotion, within the same workflow.
魔音工坊 (Moying Gongfang) is an intelligent online text-to-speech (TTS) platform that converts written text into high-quality voiceovers using realistic human voices with various accents.
TADA is Hume AI’s open-source speech-language model for generating speech with one-to-one text-acoustic alignment. It is aimed at developers and researchers building faster, more reliable voice systems, including on-device and long-form speech applications.