飞影数字人

飞影数字人是一个AI数字人创作平台，支持形象克隆、声音克隆和口型同步视频生成。它面向个人创作者与企业用户，提供网页端、移动端小程序以及企业级 API 入口。

About 飞影数字人

飞影数字人 is an AI avatar creation platform launched by Shanghai Lingzhiyu Technology Co., Ltd., with core capabilities including avatar cloning, voice cloning, and lip-sync video generation. The official site positions it as a tool for quickly creating a dedicated digital avatar appearance and voice, supporting creation workflows started from a single description, a photo, or a short video.

The product is aimed at individual creators and business users, and is suitable for scenarios that need steady output of spoken videos, brand content, livestream support materials, or avatar demonstrations. The official site also provides API documentation for enterprises and developers, as well as an OEM introduction, showing that it is not only a front-end creation tool but also supports deeper business integration.

Core Features

Avatar Cloning and AI Generation

Supports uploading about 5 seconds of personal video to quickly recreate a digital avatar, and also allows users to create a digital avatar from scratch, making it suitable for starting from a real person or building an avatar from zero.

Voice Cloning

Supports uploading 5 to 30 seconds of audio to recreate vocal characteristics, speaking style, accent, and acoustic environment as much as possible, creating a reusable voice asset.

Video Creation

Generate lip-sync videos from text or audio for spoken content, promotions, or demos.

Fast Modeling and Generation

The official site emphasizes fast training and generation: avatar modeling can be completed in seconds, and video generation can take only a few seconds, making it suitable for fast iteration.

Multi-Pose Avatar Driving

Supports avatar driving in different scene states such as front-facing, side-facing, walking, and running, covering a wider range of appearance and motion scenarios.

API and OEM Capabilities

Provides enterprise and developer APIs, along with OEM support, making it easier to integrate avatar capabilities into business systems or offer services under a proprietary brand.

Use Cases

Short Spoken-Video Creation
Suitable for making knowledge, emotional, parenting, and reading-related spoken-content videos, reducing the cost of on-camera appearances and repeated recordings.
E-commerce Livestream Assistance
Suitable for continuing content output during gaps in a live host’s schedule, extending livestream duration, and serving as auxiliary on-camera support for e-commerce livestreams.
Personal Branding and IP Building
Suitable for individual bloggers, business owners, or brand accounts that need continuous output but cannot appear on camera long term, helping maintain a consistent image.
Advertising and Sales Videos
Suitable for advertising and product promotion videos, combining digital avatars with product footage to create more convenient marketing materials.
Corporate Promotion and Training
Suitable for enterprises creating branded promotions, product introductions, internal training, and avatar content for press conference scenarios.

Pros and Cons

Pros

Supports avatar cloning, voice cloning, and video generation, covering the full workflow from source material collection to final output.
The official site emphasizes a low barrier to entry, allowing creation to start with only short video or audio materials.
Supports desktop browser, mobile browser, and WeChat mini program access, offering flexible entry points.
Provides enterprise-grade API and OEM documentation, making it suitable for teams that need system integration or branded delivery.

Cons

The official site does not provide a clear public pricing page, and the current pricing page returns 404, so fees and plans require further inquiry.
The API integration page says you must first contact sales to obtain a test token, which is a more enterprise-oriented integration model.
Some capability descriptions come from the official marketing pages, so actual results will vary based on material quality, usage scenario, and the specific model capabilities.

FAQ

What is 飞影数字人 mainly used for?

飞影数字人 is an AI avatar creation platform for cloning avatars, cloning voices, and generating lip-sync videos. According to the official site, it supports creating videos from text or audio, as well as avatar recreation through photo avatars, video avatars, and AI-generated avatars.

What materials are needed to clone an avatar and voice?

The official site says avatar recreation can be done by uploading about 5 seconds of personal video; voice cloning supports uploading 5 to 30 seconds of audio. After cloning, you can go directly into the creation workflow and generate videos from text or audio.

Can 飞影数字人 be used on a phone?

Yes. The official FAQ states that 飞影数字人 supports access through desktop and mobile browsers, and the mobile side can also use the "飞影数字人" WeChat mini program directly.

Does 飞影数字人 support API integration?

Yes. The official FAQ clearly states that 飞影数字人 supports API access; the API page further explains that enterprises and developers can integrate through the enterprise API, and API access requires contacting sales first to obtain a test token and view the interface documentation.

How do enterprises usually integrate or deploy it?

The main workflow described on the official site is to first complete avatar and voice cloning, then enter the creation page to input text or audio and generate lip-sync videos. The API page also mentions OEM services, which suit enterprises that need to offer avatar products under their own brand.

Quick Facts

Product Type: AI avatar creation platform
Developer: Shanghai Lingzhiyu Technology Co., Ltd.
Core Capabilities: Avatar cloning, voice cloning, lip-sync video generation
Access Methods: Web access, with support for mobile browsers and WeChat mini program
Enterprise Capabilities: Provides API and OEM documentation
Official Domain: flyworks.live

Alternativas a 飞影数字人

HeyGen Developers

Official HeyGen API documentation for building AI avatar videos, translations, lipsync, and interactive video-agent sessions. It supports direct API use plus MCP and CLI-style workflows for developers and AI agents.

HeyGen Avatar V

HeyGen Avatar V crea un gemelo digital a partir de un vídeo de webcam de 15 segundos y genera vídeos de avatar parlante con identidad, movimiento y voz consistentes.

Wallie

Wallie is an open-source AI streamer that watches your screen, hears chat, and generates live commentary in a configurable persona. It runs locally on your machine with your own keys and is aimed at faceless content, autonomous streams, and real-time reactions.

VIDEOAI.ME

VIDEOAI.ME is an AI video generator for making spokesperson-style videos, ads, explainers, and social content from a script. It is aimed at founders, marketers, agencies, and creators who want to produce videos without filming.

艺映AI

艺映AI is a free AI video creation tool for generating video from text, images, or existing footage. It is positioned for short-form social content, promotional clips, and stylized AI video projects.

TapNow

TapNow is a web-based AI visual creation platform for businesses, creators, and teams. It supports image and video generation along with editing, planning, and collaboration tools.