Video Generation Model

Seedance 1.5 Pro

ByteDance's cinematic image-to-video AI model that generates high-quality 1080p videos with natively synchronized audio from static images in a single pass.

Start Building with Seedance 1.5 Pro View All Models

Publisher

ByteDance

TypeVideo

Context Window1,000 tokens

Price$0.05-$0.63/video

Provider

WaveSpeed

Source Image

Try Seedance 1.5 Pro →

About Seedance 1.5 Pro

Image-to-video with natively synchronized audio

Seedance 1.5 Pro is an image-to-video generation model developed by ByteDance that transforms static images into cinematic video clips at up to 1080p resolution. It uses a dual-branch Diffusion-Transformer (DB-DiT) architecture to generate video and audio simultaneously in a single pass, producing millisecond-level lip-sync and environmental audio without requiring post-production editing. Videos can range from 5 to 10 seconds in duration and support aspect ratios including 16:9, 9:16, and 21:9.

What distinguishes Seedance 1.5 Pro is its native audio-visual synthesis, which generates speech, sound effects, and ambient audio in sync with the video rather than layering them separately afterward. It supports multilingual lip-sync across six languages and offers over 15 controllable camera movements — such as dolly zooms, tracking shots, and orbits — specified through text prompts. The model is well-suited for content creators, marketers, and developers working on dialogue-driven content, social media clips, and multilingual voiceover projects where visual consistency and synchronized audio are required.

Capabilities

What Seedance 1.5 Pro supports

Image-to-Video

Converts a static source image into a dynamic video clip at resolutions up to 1080p, with durations of 5 to 10 seconds per generation.

Native Audio Synthesis

Generates speech, sound effects, and ambient audio simultaneously with video in a single pass using a dual-branch Diffusion-Transformer architecture, eliminating the need for separate audio post-processing.

Multilingual Lip-Sync

Produces accurate lip-sync across six languages with dialect-specific support, maintaining character identity and mouth movement alignment throughout the clip.

Camera Movement Control

Supports over 15 professional camera movements — including dolly zooms, tracking shots, orbits, pans, and tilts — controllable via text prompts.

Aspect Ratio Selection

Allows selection of output aspect ratios including 16:9, 9:16, and 21:9 to match platform requirements such as landscape, portrait, or cinematic formats.

Resolution Options

Offers selectable output resolutions of 480p, 720p, and 1080p, with a 5-second 1080p clip generating in approximately 41 seconds.

Reproducible Generation

Accepts a seed value as input so that specific outputs can be reproduced or iterated upon consistently across generation runs.

Complex Prompt Following

Handles multi-subject, multi-action text prompts with precise instruction following, enabling detailed scene and motion descriptions in a single generation.

Ready to build with Seedance 1.5 Pro?

Get Started Free

FAQ

Common questions about Seedance 1.5 Pro

What input does Seedance 1.5 Pro require to generate a video?

The model takes a static image URL as its primary input, along with text prompts and configuration options such as resolution, aspect ratio, duration, and an optional seed value.

What is the context window for Seedance 1.5 Pro?

The model has a context window of 1,000 tokens, which applies to the text prompt input used to guide video generation.

What resolutions and durations does the model support?

Seedance 1.5 Pro supports output resolutions of 480p, 720p, and 1080p, with video durations ranging from 5 to 10 seconds. Aspect ratios include 16:9, 9:16, and 21:9.

Does the model generate audio automatically, or is it added separately?

Audio is generated natively in the same single pass as the video using the dual-branch Diffusion-Transformer architecture. Speech, sound effects, and ambient audio are synchronized with the video without requiring separate post-production steps.

What languages does the lip-sync feature support?

The model supports accurate lip-sync across six languages, with dialect-specific support included for each.

Is there a knowledge cutoff date for this model?

No training cutoff date is specified in the available metadata for Seedance 1.5 Pro.

Resources

Documentation & links

Model Overview & Playground – EachlabsPlayground

→

Configuration

Parameters & options

Aspect RatioSelect

Default: 16:9

21:916:94:31:13:49:16

ResolutionSelect

Default: 720p

720p480p

DurationNumber

Default: 5Range: 4–12

SeedSeed

A specific value that is used to guide the 'randomness' of the generation.

Range: -1–2147483647

Related models

Explore similar models

Start building with Seedance 1.5 Pro

No API keys required. Create AI-powered workflows with Seedance 1.5 Pro in minutes — free.

Get Started Free Explore All Models