December 5, 2025

Table of contents

  1. Quality & Duration Matrix
  2. v5.5 Native Audio
  3. Feature Comparison
  4. Endpoint Compatibility
  5. Quality Tier Requirements

Quality & Duration Matrix

Quality v5.5 v5 v4.5
360p 5s, 8s, 10s 5s, 8s 5s, 8s
540p (default) 5s, 8s, 10s 5s, 8s 5s, 8s
720p 5s, 8s, 10s 5s, 8s 5s, 8s
1080p 5s, 8s 5s, 8s 5s only

v5.5 Native Audio

Model v5.5 introduces integrated audio generation - voice, lip sync, and background music are generated together with the video in a single step.

{
  "model": "v5.5",
  "audio": true,
  "prompt": "A woman says hello and waves at the camera"
}
v5.5 Audio v5/v4.5 Audio
Use audio: true Use lip_sync_tts_prompt + sound_effect_prompt
Voice integrated with video Separate lipsync step
Background music auto-generated Manual via sound_effect_prompt
Lipsync endpoint not supported Lipsync endpoint supported

Feature Comparison

Feature v5.5 v5 v4.5
Native Audio - -
Lip Sync TTS -
Sound Effects -
10s Duration - -
Preview Mode
Camera Movement - -
Motion Modes - -
Image Fusion -

Endpoint Compatibility

Endpoint v5.5 v5 v4.5
POST videos/create
POST videos/create-frames
POST videos/create-transition
POST videos/extend
POST videos/upscale
POST videos/restyle - -
POST videos/lipsync -
POST videos/create-fusion -

Quality Tier Requirements

  • 360p: All subscription tiers
  • 540p: All subscription tiers (default)
  • 720p: Standard or higher
  • 1080p: Pro/Premium