WAN 2.6 is live

Create Videos With WAN 2.6

The newest WAN 2.6 release with improved realism, smoother edits, and curated creative pipelines designed for professional storytellers.

Configuration

Text prompts for video generation. Supports both Chinese and English, with a minimum of 1 characters and a maximum of 5,000 characters.

The duration of the generated video in seconds

Video resolution tier

Cost: --Sign in to view credits
Preview

Ready to Generate

Your generated video will appear here

What is the
WAN 2.6

Wan 2.6 is Alibaba’s latest AI video model, offering affordable multi-shot 1080p generation with stable characters and synchronized native audio.

It introduces stronger scene continuity, more consistent character performance, and improved control over camera movement and pacing, resulting in generated videos that appear meticulously crafted rather than disjointed.

Wan 2.6 Architecture Visualization

Features of WAN 2.6

Wan 2.6 seamlessly blends realism, generation speed, and character consistency.

Text to Video

WAN 2.6 can generate cinematic-quality videos directly from natural language.

Image to Video

Convert a single image into an animated image while preserving the subject's features and visual style.

Video to Video

Extract key features from reference videos for newly generated content, ensuring character continuity across different shots and related material.

Native 1080p

Generate cinema-grade 1080p resolution directly, ensuring crisp details without the artifacts of upscaling.

Lip-sync

The generated video includes native audio and maintains lip-sync accuracy.

Multi-scenario prompts

Multiple scenes can be set for a single shot, making the output video more realistic.

How It Works

Create professional-grade videos in three simple steps using our intuitive playground interface.

1

Select & Input

Choose your creative mode—Text, Image, or Video—and provide your prompt or upload media.

2

Customize

Fine-tune your output with precise controls for duration (5s, 10s) and resolution (720p, 1080p).

3

Generate

Click generate and watch Wan 2.6's advanced physics engine bring your vision to life in seconds.

What specific upgrades were made in WAN 2.6?

Three breakthroughs that set Wan 2.6 apart for cinematic, controllable video generation.

00:15

Cinema-grade precision in multi-camera storytelling

Wan 2.6 introduces a redesigned narrative engine capable of generating multi-shot 1080p videos featuring seamless transitions, balanced pacing, and natural camera movements. It comprehends storyboard-style prompts and scene descriptions, enabling developers to create coherent visual narratives based on textual or image inputs. This makes Wan 2.6 AI video generation model the ideal choice for cinematic-level storytelling and creative short-form video production.

Try Now
00:08

Reference-based Stable Identity and Speech Generation

The Wan 2.6 model introduces a powerful reference-based generation system that extracts appearance, motion style, and speech characteristics from reference clips. It consistently applies these attributes to new scenes, ensuring character and style consistency throughout the video.

Try Now
00:10

Supports longer duration and stronger time stability

Wan 2.6 extends video duration to a maximum of 15 seconds while maintaining high-definition clarity and frame-to-frame consistency, ensuring lighting, clothing, and environmental details remain stable throughout motion. This provides developers with greater flexibility to build richer narrative content for commercial-grade AI video generation projects.

Try Now
00:15

Integrated audio delivers lifelike high-definition sound.

Integrating native audio creation with an advanced camera physics engine into a single workflow enables the generation of synchronized dialogue, background music, and ambient sound effects, achieving precise lip-sync while executing realistic pan, zoom, and tracking shots.

Try Now
WAN Gallery

Shot with WAN 2.6

Browse curated clips produced during the private preview program.

Play
Play
Play
Play
Play
Play

WAN 2.6 vs. Other Video Models

See how Wan 2.6 stacks up against the competition in key performance metrics.

Feature
Wan 2.6New
Wan 2.5
Sora 2
Veo 3.1
Kling 2.6
Input Types
Text, Image, Video Reference
Text, Image
Text, Image
Text, Image
Text, Image
Typical Output Duration
Up to ~15 seconds
~8–10 seconds
Up to ~25 seconds
8 seconds, supports extended durations
~3–10 seconds
Resolution
1080p
1080p
1080p
1080p
1080p

FAQ about WAN 2.6

Some Information You Might Want to Know About WAN 2.6

Wan 2.6 AI video generation tool supports text-to-video, image-to-video, and video-to-video workflows, producing up to 15 seconds of 1080p cinematic-quality video output. With multi-camera storytelling, stable character recognition, consistent lighting effects, and native audio synchronization, Wan 2.6 is suitable for creative, commercial, and narrative applications.
VideosGo

Experience
WAN 2.6

Bring cinematic pipelines, curated workflows, and pro showcases into your studio.