Text to speech with a large voice library
Generate voiceovers from text with a library of 500+ voices across 100 languages, aimed at content such as podcasts, YouTube videos, audiobooks, e-learning, advertisements, and corporate training.
LOVO is a web-based AI voice generator and text-to-speech platform with an integrated video editor, voice cloning, subtitles, and script tools. It is aimed at creators and teams producing narrated videos, training content, and other voice-driven media.
LOVO is a web-based AI voice generator and text-to-speech platform centered on its Genny editor. It combines voice generation with online video editing so users can produce narrated content, edit video, and manage related assets in one place.
The site says LOVO offers 500+ voices in 100 languages and supports realistic voices, subtitle generation, script writing, voice cloning, royalty-free image generation, team collaboration, and an API. The product is positioned for creators and teams making marketing, training, social media, podcast, audiobook, and corporate video content.
Generate voiceovers from text with a library of 500+ voices across 100 languages, aimed at content such as podcasts, YouTube videos, audiobooks, e-learning, advertisements, and corporate training.
Use Genny’s online video editor to create and edit videos in one workflow, with voiceover production and video editing kept together.
Add subtitles automatically in 20+ languages, then customize, animate, and transform the video from within the editor.
Create custom voices from about one minute of audio, giving teams a way to build a consistent brand voice for repeat use.
Use the AI writer and AI image generator to draft scripts and create royalty-free images without leaving the product.
Access team collaboration and an API for building with LOVO’s voices in your own app or service.
Produce narration for social clips, YouTube videos, ads, and other short-form or branded video content using synthetic voices and the online editor.
Build training modules, explainers, and e-learning videos that need clear narration and subtitles in multiple languages.
Create audiobooks, podcast narration, or character-style voice work where a consistent synthetic voice is useful.
Generate a custom voice from a short sample when a project needs a repeated brand voice or a specific speaker identity.
Let a team collaborate on projects in Genny and keep assets in cloud storage while iterating on voiceover and video edits.
LOVO is a web-based AI voice generator and text-to-speech product. You can enter script text and generate voice output using its voice library and Genny editor, or use voice cloning for custom voices.
The source mentions 500+ voices in 100 languages, and also notes text-to-speech support in 100+ languages on the homepage. It does not provide a full language list on the pages reviewed.
LOVO says Genny includes an online video editor, subtitle generation, AI writing, voice cloning, AI image generation, team collaboration, and an API. The source does not describe export formats or all workflow steps in detail.
The pricing page and homepage mention a free trial and no credit card required, but the pages reviewed do not list full plan pricing or limits in the provided text.
The source does not specify all supported output formats, detailed setup steps, or exact team-seat rules. For those details, the reviewed pages only indicate that team collaboration and cloud storage are available.
蓝藻AI是一款在线AI配音与语音合成产品,可将文字转成语音,并支持自助声音克隆。页面信息显示它面向短视频、有声书等需要配音的内容场景。
Noiz AI is an AI text-to-speech, voice cloning, and voice design tool for creating lifelike speech from text. It also lets users shape voice delivery, including emotion, within the same workflow.
CAMB.AI Streams dubs live audio in multiple languages in real time for broadcasts on platforms like YouTube, Twitch, and X. It plugs into existing live workflows using common streaming protocols and avoids a post-production step.
Kits AI is an AI music production platform for voice cloning, vocal generation, and vocal processing. It offers a Free plan, paid tiers, and a Windows desktop app for producers and creators working with studio-style audio workflows.
Gemini 3.1 Flash TTS is Google’s preview text-to-speech model for generating expressive AI speech with fine-grained control over style and delivery. It is available across the Gemini API, Google AI Studio, Vertex AI, and Google Vids.
Ondoku is a browser-based text-to-speech tool that turns text into downloadable .mp3 audio, with free and paid plans, multilingual reading, image reading, and commercial use options.