Skip to main content
MindStudio
Pricing
Blog About
My Workspace
Image Generation Model

Wan 2.7

Alibaba's powerful multimodal AI model that generates cinematic 1080p video with native audio synchronization, multi-shot storytelling, and advanced image creation.

Publisher Wan
Type Image
Context Window 2,000 tokens
Training Data December 2025
Price $0.0001/image
Provider WaveSpeed
Source Image

Wan 2.7

**Wan 2.6** is a multimodal AI generation system developed by Alibaba Cloud, released in December 2025. While it includes capable image generation tools, its primary strength lies in video generation — producing clips up to **15 seconds long at 1080p resolution and 24 frames per second**. The model family spans text-to-video, image-to-video, reference-to-video, and image generation modes, making it a comprehensive creative suite. ### Key Capabilities - **Native audio generation**: Wan 2.6 generates synchronized audio — including dialogue, sound effects, and lip-sync — directly alongside video, eliminating the need for separate dubbing tools - **Multi-shot storytelling**: A single prompt can produce multi-scene narratives with automatic camera transitions and consistent characters across shots - **Reference-to-video**: Upload reference images or video to maintain subject appearance, style, and motion consistency across generations - **Simulated world physics**: The model accurately renders gravity, fluid dynamics, and object interactions for realistic action and product shots - **Image generation**: Supports text-to-image, image-to-image transformation, and image editing at up to 2048×2048 pixels ### Architecture & Performance Wan 2.6 uses a **Mixture-of-Experts (MoE) architecture** with 14 billion total parameters, activating only ~20% during generation for improved speed. Separate expert models handle high-noise and low-noise generation stages. The model supports prompts in both **English and Chinese**, with optional AI-powered prompt expansion for enhanced output quality. Wan 2.6 is best suited for content creators, marketers, filmmakers, and developers who need high-fidelity video and image generation with minimal post-production work — particularly those who want to produce ready-to-publish video content complete with audio from a single prompt.

Ready to build with Wan 2.7?

Get Started Free

Parameters & options

Mode Toggle Group
Default: text-to-image
Images Image URL Array

Input images to edit (1-3). Reference them as "Figure 1", "Figure 2", etc. in the prompt.

Width Number
Default: 1024 Range: 512–4096
Height Number
Default: 1024 Range: 512–4096
Thinking Mode Select

When enabled, the model reasons about prompt intent before generating, improving composition and prompt adherence.

Default: true
OffOn
Seed Seed
Range: -1–2147483647

Start building with Wan 2.7

No API keys required. Create AI-powered workflows with Wan 2.7 in minutes — free.