Upload a portrait along with a script or audio file, and Vozo will animate it with natural movements and lip sync to create a lifelike talking video. This guide will walk you through the steps.
You can watch the video or read the guide below—whichever works best for you! 👇
Upload Photo
To get started, navigate to your Dashboard and click Generate Talking Video - Start with Photo. This will open the upload dialog, where you can drag and drop your image files or click to upload.
Input Audio
You can input audio in multiple ways:
Text to Speech
Upload Audio
Text to Speech
If you have a script and want to generate speech from text, select this option.
Choose a Language and Voice:
Select your desired language and voice from the dropdown.
If you’re not satisfied with the voices listed in the dropdown, click "Choose More from Library" to explore additional options.
To use a cloned voice, click "Choose More from Library - Cloned Voice - Clone New Voice" and follow the instructions to upload or record audio to create your custom voice.
Input Script:
Upload Audio
If you already have an audio file, select this option to upload it directly.
Once you’ve configured your settings, click Generate to proceed.
Preview and Download
After the video is generated, preview the results directly on the project page.
To download the video, click the Download button in the top-right corner.