Image Generation Model

Wan 2.7

Alibaba's powerful multimodal AI model that generates cinematic 1080p video with native audio synchronization, multi-shot storytelling, and advanced image creation.

Start Building with Wan 2.7 View All Models

Publisher

Wan

TypeImage

Context Window2,000 tokens

Training DataDecember 2025

Price$0.03/image

Provider

WaveSpeed

Source Image

Try Wan 2.7 →

Overview

Wan 2.7

**Wan 2.6** is a multimodal AI generation system developed by Alibaba Cloud, released in December 2025. While it includes capable image generation tools, its primary strength lies in video generation — producing clips up to **15 seconds long at 1080p resolution and 24 frames per second**. The model family spans text-to-video, image-to-video, reference-to-video, and image generation modes, making it a comprehensive creative suite. ### Key Capabilities - **Native audio generation**: Wan 2.6 generates synchronized audio — including dialogue, sound effects, and lip-sync — directly alongside video, eliminating the need for separate dubbing tools - **Multi-shot storytelling**: A single prompt can produce multi-scene narratives with automatic camera transitions and consistent characters across shots - **Reference-to-video**: Upload reference images or video to maintain subject appearance, style, and motion consistency across generations - **Simulated world physics**: The model accurately renders gravity, fluid dynamics, and object interactions for realistic action and product shots - **Image generation**: Supports text-to-image, image-to-image transformation, and image editing at up to 2048×2048 pixels ### Architecture & Performance Wan 2.6 uses a **Mixture-of-Experts (MoE) architecture** with 14 billion total parameters, activating only ~20% during generation for improved speed. Separate expert models handle high-noise and low-noise generation stages. The model supports prompts in both **English and Chinese**, with optional AI-powered prompt expansion for enhanced output quality. Wan 2.6 is best suited for content creators, marketers, filmmakers, and developers who need high-fidelity video and image generation with minimal post-production work — particularly those who want to produce ready-to-publish video content complete with audio from a single prompt.

Ready to build with Wan 2.7?

Get Started Free

Resources