Smallest.ai Lightning TTS is a text-to-speech API for generating spoken audio from text with low latency, multilingual support, and fast voice cloning. It is aimed at developers and product teams building voice agents, narrated content, and other production speech workflows.
飞影数字人是一个AI数字人创作平台,支持形象克隆、声音克隆和口型同步视频生成。它面向个人创作者与企业用户,提供网页端、移动端小程序以及企业级 API 入口。
CAMB.AI Streams is a real-time multilingual live streaming product that dubs live audio into one or more languages as a broadcast happens. It is aimed at creators, sports, news, and entertainment teams that want to reach global audiences without post-production.
Inworld AI is a developer-focused voice AI platform for realtime text-to-speech, speech-to-text, and LLM routing. It supports streaming speech generation, voice cloning, voice design, and tiered pricing from On-Demand through enterprise custom plans.
NaturalReader offers AI-powered text-to-speech solutions for various applications, including personal use, commercial licensing, and educational purposes.
Listnr AI is an AI voice generator and text-to-speech platform for creating voiceovers, narration, podcasts, audiobooks, dubbing, and other spoken-audio content. It offers multilingual voices, voice cloning, and an API for integrating speech generation into apps and websites.
蓝藻AI是一款在线AI配音与语音合成产品,可将文字转成语音,并支持自助声音克隆。页面信息显示它面向短视频、有声书等需要配音的内容场景。
Noiz AI is an AI text-to-speech, voice cloning, and voice design tool for creating lifelike speech from text. It also lets users shape voice delivery, including emotion, within the same workflow.
Elai is a browser-based AI video generator for creating training, presentation, and marketing videos from text, URLs, slides, or prompts. It supports avatars, voice cloning, translation, templates, and interactive video features for teams that need to produce video at scale.
鬼手剪辑 is an AI video translation and dubbing platform for subtitles, translation, text removal, voice cloning, and background audio processing.
undefined数智人创作平台 is a web-based AI video generation platform for creating short videos with a user’s own image and voice. The collected pages also point to digital-human live streaming and separate creation and account areas, but they do not expose detailed pricing or technical documentation.
Magicam is a real-time face swap and voice change tool for live streams, video recording, and conference use. It supports a single-photo workflow, a built-in virtual camera, and a paid Pro upgrade shown on the homepage.
Rask AI is an AI video and audio localization platform for translation, dubbing, subtitles, and lip-sync. It is designed for creators, educators, and enterprises that need to adapt content for multilingual audiences.
The 讯飞虚拟数字人 platform utilizes the latest AI virtual avatar technology to provide multi-scenario virtual human solutions, including image customization, voice cloning, and multimodal intelligent interaction.
飞影数字人是一款AI数字人创作平台,用于克隆形象和声音并生成口型同步视频。它面向创作者、企业和开发者,提供网页端、多端访问以及API/OEM接入能力。
Wondercraft is an AI-native video studio for creating and editing videos, podcasts, and audio content with guided workflows and a built-in editor. It helps individuals and teams generate business-ready outputs from prompts or source files, then refine and export them in the same app.
Resemble AI is a generative AI security platform for creating, watermarking, verifying, and detecting synthetic media across audio, image, and video, with Flex and Enterprise options.
LOVO is a web-based AI voice generator and text-to-speech platform with an integrated video editor, voice cloning, subtitles, and script tools. It is aimed at creators and teams producing narrated videos, training content, and other voice-driven media.
Papers with Code is a research discovery site for browsing machine learning benchmarks, trending papers, and linked implementations. It helps readers track state-of-the-art results and move from a paper summary to arXiv or GitHub when available.
YunmuSync is a web-based AI video translation tool that uses voice cloning, lip-sync alignment, and subtitle handling to localize videos into multiple languages. It is positioned for short drama, film, e-commerce, online courses, and creator distribution workflows.