WAN 2.6 is live

Create Videos With WAN 2.6

The newest WAN 2.6 release with improved realism, smoother edits, and curated creative pipelines designed for professional storytellers.

Configuration

prompt *

Text prompts for video generation. With a minimum of 1 characters and a maximum of 5,000 characters.

duration

resolution

Cost: --Sign in to view credits

Preview

Ready to Generate

Your generated video will appear here

What is the
WAN 2.6

Wan 2.6 is Alibaba’s latest AI video model, offering affordable multi-shot 1080p generation with stable characters and synchronized native audio.

It introduces stronger scene continuity, more consistent character performance, and improved control over camera movement and pacing, resulting in generated videos that appear meticulously crafted rather than disjointed.

Features of WAN 2.6

Wan 2.6 seamlessly blends realism, generation speed, and character consistency.

Text to Video

WAN 2.6 can generate cinematic-quality videos directly from natural language.

Image to Video

Convert a single image into an animated image while preserving the subject's features and visual style.

Video to Video

Extract key features from reference videos for newly generated content, ensuring character continuity across different shots and related material.

Native 1080p

Generate cinema-grade 1080p resolution directly, ensuring crisp details without the artifacts of upscaling.

Lip-sync

The generated video includes native audio and maintains lip-sync accuracy.

Multi-scenario prompts

Multiple scenes can be set for a single shot, making the output video more realistic.

How It Works

Create professional-grade videos in three simple steps using our intuitive playground interface.

Select & Input

Choose your creative mode—Text, Image, or Video—and provide your prompt or upload media.

Customize

Fine-tune your output with precise controls for duration (5s, 10s) and resolution (720p, 1080p).

Generate

Click generate and watch Wan 2.6's advanced physics engine bring your vision to life in seconds.

What specific upgrades were made in WAN 2.6?

Three breakthroughs that set Wan 2.6 apart for cinematic, controllable video generation.

00:15

Cinema-grade precision in multi-camera storytelling

Wan 2.6 introduces a redesigned narrative engine capable of generating multi-shot 1080p videos featuring seamless transitions, balanced pacing, and natural camera movements. It comprehends storyboard-style prompts and scene descriptions, enabling developers to create coherent visual narratives based on textual or image inputs. This makes Wan 2.6 AI video generation model the ideal choice for cinematic-level storytelling and creative short-form video production.

Try Now

00:08

Reference-based Stable Identity and Speech Generation

The Wan 2.6 model introduces a powerful reference-based generation system that extracts appearance, motion style, and speech characteristics from reference clips. It consistently applies these attributes to new scenes, ensuring character and style consistency throughout the video.

Try Now

00:10

Supports longer duration and stronger time stability

Wan 2.6 extends video duration to a maximum of 15 seconds while maintaining high-definition clarity and frame-to-frame consistency, ensuring lighting, clothing, and environmental details remain stable throughout motion. This provides developers with greater flexibility to build richer narrative content for commercial-grade AI video generation projects.

Try Now

00:15

Integrated audio delivers lifelike high-definition sound.

Integrating native audio creation with an advanced camera physics engine into a single workflow enables the generation of synchronized dialogue, background music, and ambient sound effects, achieving precise lip-sync while executing realistic pan, zoom, and tracking shots.

Try Now

WAN 2.6 Gallery

Create with WAN 2.6

Browse curated clips produced during the private preview program.

WAN 2.6Santa Claus

WAN 2.6Monkey King

WAN 2.6Capybara

WAN 2.6Sing a song

WAN 2.6Train Beauty

WAN 2.6Walking the dog

WAN 2.6 vs. Other Video Models

See how Wan 2.6 stacks up against the competition in key performance metrics.

Feature

Wan 2.6New

Wan 2.5

Sora 2

Veo 3.1

Kling 2.6

Input Types

Text, Image, Video Reference

Text, Image

Typical Output Duration

Up to ~15 seconds

~8–10 seconds

Up to ~25 seconds

8 seconds, supports extended durations

~3–10 seconds

Resolution

1080p

FAQ about WAN 2.6

Some Information You Might Want to Know About WAN 2.6

Wan 2.6 AI video generation tool supports text-to-video, image-to-video, and video-to-video workflows, producing up to 15 seconds of 1080p cinematic-quality video output. With multi-camera storytelling, stable character recognition, consistent lighting effects, and native audio synchronization, Wan 2.6 is suitable for creative, commercial, and narrative applications.

VideosGo

Experience
WAN 2.6

Bring cinematic pipelines, curated workflows, and pro showcases into your studio.

Try it Now

Create Videos With WAN 2.6

What is theWAN 2.6

Features of WAN 2.6

Text to Video

Image to Video

Video to Video

Native 1080p

Lip-sync

Multi-scenario prompts

How It Works

Select & Input

Customize

Generate

What specific upgrades were made in WAN 2.6?

Cinema-grade precision in multi-camera storytelling

Reference-based Stable Identity and Speech Generation

Supports longer duration and stronger time stability

Integrated audio delivers lifelike high-definition sound.

Create with WAN 2.6

WAN 2.6 vs. Other Video Models

FAQ about WAN 2.6

ExperienceWAN 2.6

What is the
WAN 2.6

Experience
WAN 2.6