Getting Started

What is Video Avatar?

Input a video featuring an avatar and a script (text or audio), and generate a new avatar video with lip-sync to the script.

🌅

Our avatar uses zero-shot technology—no lengthy cloning training required. Just provide a video for instant lip-syncing.


How It Works

Use an existing avatar

"Existing avatar" includes avatar templates from the public library as well as avatars you have previously created and saved. To use these existing avatars, follow the steps below:

  1. Use List All Avatars to obtain the corresponding avatar ID.
  2. Use Submit Video Avatar Task to create a digital human video synthesis task.
  3. Use Query Video Avatar Task to check the task status and obtain the synthesis result.

Create and use your own avatar

  • If you want to use your own video as an avatar template, you can directly pass your video’s videoFileId parameter to Submit Video Avatar Task to use your video as the avatar template.

Related APIs


Billing Rules

The price is the same whether you use a public avatar template or upload your own video

Audio-Driven Mode Fees

  1. Charged at 0.02 credits per second based on the length of the generated video.
  2. Billing is calculated in whole seconds; any partial second is rounded up to the next full second. Text-Driven Mode Fees
  3. In addition to the audio-driven mode fees, an extra TTS (Text-to-Speech) fee is applied: An additional 0.1 credits are charged for every 100 characters, rounded up to the nearest 100 characters.