Expressive TTS with voice cloning
Generate a video from an image and motion prompt
generate a video from an image with a text prompt