What is Wan 2.7 AI video generator?

Wan 2.7 is Alibaba's latest AI video generation model, releasing March 2026. It supports text-to-video, image-to-video, subject reference, voice reference, multi-image grid to video, and instruction editing. Built on Diffusion Transformer and MoE architecture with 27B+ parameters, it generates 1080p videos up to 15 seconds with native lip-sync support.

When is Wan 2.7 releasing?

Wan 2.7 is scheduled for release in March 2026. It follows Wan 2.6 (December 2025) and brings full upgrades in quality, audio, dynamics, style, and consistency across all generation modes.

What's new in Wan 2.7 compared to Wan 2.6?

Wan 2.7 adds subject reference, voice reference, multi-image grid to video, instruction editing, and video replication. It supports up to 5 video reference clips (vs 1 in 2.6) and full real-person image input. Quality, audio fidelity, and motion dynamics are upgraded across the board.

Is Wan 2.7 free to use?

Wan 2.1 is fully open-source under Apache 2.0 and runs locally on consumer GPUs with 6GB+ VRAM. For Wan 2.7, you can access it through Topview's free plan which includes monthly credits — no credit card required.

How do I use Wan 2.7 in Topview?

Open Topview Board at topview.ai/models, select the Wan model from the model selector, enter your prompt or upload reference materials, configure resolution and duration, then click generate. Your video is ready in minutes and can be exported as MP4, MOV, or WebM.

What resolution and duration does Wan 2.7 support?

Wan 2.7 supports up to 1080p resolution at 24fps with flexible duration from 2 to 15 seconds. Multiple aspect ratios are available including 16:9, 9:16, and 1:1 for different platform requirements.

How does Wan 2.7 compare to OpenAI Sora 2?

Wan 2.7 offers real-person input, up to 5 video reference clips, instruction editing, and open-source access via Wan 2.1 — features Sora 2 does not currently provide. Sora 2 supports longer sequences (25s) and excels at physics simulation. Both produce 1080p output. You can access both on Topview Board.

How does Wan 2.7 compare to Kling 3.0?

Kling 3.0 offers 4K/60fps output and 25-second generation. Wan 2.7 provides real-person input, 5 video reference clips, instruction editing, and open-source access. Wan 2.7 leads in lip-sync quality and style consistency. Both models are available on Topview Board.

Does Wan 2.7 support lip sync and audio?

Yes. Starting from Wan 2.5, the series features native multimodal architecture with full audio-visual synchronization. Wan 2.7 further enhances this with voice reference capability, allowing you to maintain a specific vocal style across generated content with precise lip movements.

Can I use Wan 2.7 for commercial projects?

Yes. Wan 2.1 is released under Apache 2.0, which allows unrestricted commercial use including advertising, film production, and content creation. For newer versions accessed through Topview, your subscription plan covers commercial usage rights.

Wan 2.7×

Topview

Wan 2.7 AI Video GeneratorText to Video & Image to Video

Alibaba's most advanced AI video model. Subject reference, voice reference, multi-image grid to video, instruction editing — all available on Topview Board.

Model

WAN 2.7

Upload Reference

@Image16

@Image17

Prompt1124/3500

[Duration]: 15 seconds [Camera]: Sony A7S III [Style & Type]: Middle East business consulting advertisement. Realistic style, high-brightness office lighting, using panning shots and fast multi-scene switching, professional and atmospheric. [Golden 3 Seconds]: A Western man in a suit sitting at a desk piled with documents, opens a notebook and looks at the camera, with huge subtitles appearing on screen. [Video Content]: The man wears black-rimmed glasses with a confident and professional expression. Actions include flipping through pages, walking in the office area @Image16, explaining with open hands, and walking confidently towards the camera at the end, finally popping up the brand CTA @Image17 [Pace & Atmosphere]: Fast-paced editing with dynamic background music. Smooth transitions, using large golden keywords to enhance visual impact, strong business atmosphere. [Dialogue]: For a long time we have been helping people setup their businesses here in the UAE. And now, it's time for us to help you with your numbers as well... we don't just set up your company, we keep it compliant and financially healthy.

Resolution

Aspect Ratio

Duration

Coming Soon

What Types of Videos Can You Make with Wan 2.7?

Wan 2.7 is not just for one format. It works especially well for video types that need stronger motion, cleaner scene consistency, more cinematic framing, or reference-driven storytelling.

Action & Motion-Heavy Videos

Use Wan 2.7 for fast-moving scenes where body motion, momentum, and timing need to stay readable. This works well for sports-style edits, cinematic action sequences, game trailers, and product demos with movement.

Prompt

"Cinematic action sequence with a female soldier in full tactical gear moving through snowy industrial ruins. Emphasize fast body motion, believable weapon recoil, airborne debris, cold atmospheric lighting, and a tense battlefield rhythm with clean shot-to-shot continuity."

Cinematic Character Shots

Wan 2.7 is a strong fit for dramatic portraits, fashion-style visuals, moody trailers, and music-video moments where framing, depth, and directional lighting need to feel more intentional and less flat.

Prompt

"Moody cinematic portrait of a woman with a black bob haircut and pearl necklace walking down a dim tiled staircase at night. Use a smooth backward tracking shot, serious expression, soft directional light, deep shadow contrast, and elegant framing with strong visual depth."

Product Story & Commercial Videos

Wan 2.7 can turn products into more premium-looking visual stories. Use it for ad creatives, launch teasers, beauty shots, product reveals, and commercial sequences that need controlled camera motion and cleaner visual polish.

Prompt

"Premium product commercial featuring a luxury skincare bottle on a reflective pedestal. Use a slow cinematic push-in, soft specular highlights, precise shadow control, glossy studio reflections, and an elegant reveal rhythm that feels polished, minimal, and high-end."

Reference-Driven Story Videos

Because Wan 2.7 supports subject reference, voice reference, and multi-image input, it is useful for creator-style storytelling, talking scenes, avatar-led explainers, and consistent multi-shot character narratives.

Prompt

"Reference-driven creator video of a young speaker talking directly to camera in a warm indoor studio. Keep facial identity consistent across cuts, preserve natural hand gestures and clean lip sync, and use soft cinematic lighting with documentary-style framing for an authentic storytelling feel."

What Is Wan 2.7?

Wan 2.7 is Alibaba's latest AI video generation model, scheduled for March 2026. Built on Diffusion Transformer (DiT) and Mixture of Experts (MoE) architecture with 27B+ parameters, it generates 1080p videos up to 15 seconds from text, images, or multi-image grids — with native lip-sync and subject consistency. The model uses a Causal 3D VAE with 4x8x8 compression ratio, activating only 14B of 27B total parameters per token to cut computation by 50% while improving complex scene generation.

Diffusion Transformer (DiT)

Advanced transformer-based diffusion model enabling superior temporal coherence and complex motion understanding.

Mixture of Experts (MoE)

27B total parameters with 14B activation, reducing computation by 50% while improving multi-character interactions.

Causal 3D VAE

Efficient spatiotemporal compression with 4x8x8 ratio, supporting arbitrary-length 1080p video encoding.

What's New in Wan 2.7

Six breakthrough upgrades over Wan 2.6 — from subject consistency to instruction-based editing.

Subject Reference

Upload a character reference to maintain visual identity across generated scenes.

Voice Reference

Provide a voice sample and the model preserves that vocal style in generated audio.

Multi-Image Grid to Video

Feed a 9-grid layout of images and Wan 2.7 synthesizes them into a coherent video sequence.

Instruction Editing

Edit generated videos with text instructions — change scenes, adjust pacing, swap elements without regenerating.

Video Replication

Upload a reference video and recreate its style, pacing, and composition with new content.

Quality Full Upgrade

Across-the-board improvements in visual quality, audio fidelity, motion dynamics, style consistency, and character coherence.

Wan 2.6 vs Wan 2.7

Feature	Wan 2.6	Wan 2.7
Subject Reference	Limited	Full support
Voice Reference	No	Yes
Multi-Image Grid	No	9-grid to video
Instruction Editing	No	Yes
Video Replication	No	Yes
Video Reference Clips	1	Up to 5
Real Person Input	Limited	Full support
Quality / Audio / Dynamics	Baseline	Full upgrade

How to Use Wan 2.7 in Topview (3 Steps)

Prompt input interface for AI video generation

Step 1

Enter a prompt

Describe the video you want using natural language.

Step 2

Generate Video

Click generate and watch Wan 2.7 bring your ideas to life in seconds.

Video download interface after generation

Step 3

Download the video

Export a clean MP4 when you're ready.

Wan 2.7 Core Capabilities

Everything you need to create professional AI videos — from text prompts to multi-reference generation.

Text to Video

Generate cinematic videos from text descriptions in English, Chinese, Japanese, Korean, and German.

Image to Video

Animate static images with natural motion, camera movement, and cinematic effects.

Subject + Voice Reference

Maintain character identity and vocal style across generated clips for brand consistency.

Multi-Image Grid to Video

Convert 9-grid image layouts into coherent video sequences with smooth transitions.

Instruction Editing

Modify generated videos via text commands — change scenes, adjust pacing, swap elements.

Native Lip Sync

Industry-leading audio-visual synchronization for talking-head and dialogue content.

Wan Model Evolution: From 2.1 to 2.7

Five generations of continuous innovation in AI video generation from Alibaba's Tongyi Lab.

Feb 2025

Wan 2.1

Open source (Apache 2.0), consumer GPU (6GB+), 14B params, VBench 86.22%

Jul 2025

Wan 2.2

MoE (27B total), 50% compute savings, character replacement, 60+ cinematic params

Oct 2025

Wan 2.5

Photo singing/dancing, 10s generation, audio-visual sync, native multimodal

Dec 2025

Wan 2.6

Full lip-sync, voice cloning, multi-shot narrative, 15s generation

Mar 2026Latest

Wan 2.7

Subject+voice reference, instruction editing, multi-image grid, quality full upgrade

Technical Specifications Comparison

Metric	Wan 2.1	Wan 2.2	Wan 2.5	Wan 2.6	Wan 2.7
Max Resolution	720p	720p	1080p	1080p	1080p
Max Duration	5s	5s	10s	15s	15s
Frame Rate	24fps	24fps	24fps	24fps	24fps
Parameters	14B	27B	27B+	27B+	27B+
VBench Score	86.22%	87.5%	—	89%+	89%+

Wan 2.7 vs Other AI Video Models

See how Wan 2.7 compares to the leading AI video generation models available today.

Metric	Wan 2.7Recommended	SeedDance 2.0	Sora 2	Kling 3.0	Veo 3.2	Gen-4.5
Max Duration	15s	15s	25s	25s	10s	10s
Resolution	1080p	1080p	1080p	4K/60fps	1080p	1080p
Open Source	Yes (2.1)	No	No	No	No	No
Real Person Input	Yes	No	—	—	—	—
Video Reference Clips	5	1	—	1	—	1
Instruction Editing	Yes	No	—	—	—	—
Lip Sync Quality	★★★★★	★★★★	★★★★	★★★★	★★★★	★★★

Why Use Wan 2.7 on Topview

Access Wan 2.7 alongside every leading AI video model — in one workspace, one subscription.

All-in-One Board

Access Wan 2.7 alongside Sora, Veo, Kling, Runway, and more on one canvas. No need to switch between platforms.

Real-Time Collaboration

Share your Board with teammates. Comment, annotate, and iterate on AI video outputs together in real time.

Single Subscription

One Topview plan covers all integrated models. No per-model pricing or separate API keys required.

Marketing Video Workflow

Topview's AI Video Agent combines Wan 2.7 with product URL extraction, viral template matching, and UGC generation.

Export Flexibility

Download as MP4, MOV, or WebM. Choose resolution and aspect ratio for TikTok, Reels, Shorts, or paid ads.

All-in-One Creation Workflow

From image to video to publishing, Topview lets you complete the whole workflow in one place instead of switching between separate tools.

Start Free — Try Wan 2.7 on Topview Board

Experience Wan 2.7's subject reference, voice reference, and multi-image grid to video on Topview Board. No credit card required.

Try Wan 2.7 Free — Open Topview Board

All AI video models in one workspace · Real-time collaboration · No credit card required

Frequently Asked Questions

What Is Wan 2.7?

Feature

Wan 2.6

Wan 2.7

Subject Reference

Limited

Full support

Voice Reference

Yes

Multi-Image Grid

9-grid to video

Instruction Editing

Yes

Video Replication

Yes

Video Reference Clips

Up to 5

Real Person Input

Limited

Full support

Quality / Audio / Dynamics

Baseline

Full upgrade

Metric

Wan 2.1

Wan 2.2

Wan 2.5

Wan 2.6

Wan 2.7

Max Resolution

720p

1080p

Max Duration

10s

15s

Frame Rate

24fps

Parameters

14B

27B

27B+

VBench Score

86.22%

87.5%

—

89%+

Metric

Wan 2.7Recommended

SeedDance 2.0

Sora 2

Kling 3.0

Veo 3.2

Gen-4.5

Max Duration

15s

25s

10s

Resolution

1080p

4K/60fps

1080p

Open Source

Yes (2.1)

Real Person Input

Yes

—

Video Reference Clips

—

Instruction Editing

Yes

—

Lip Sync Quality

★★★★★

★★★★

★★★