AI Playground: 110 AI models in one place to create anything.

Multi-reference video, unlimited potential

What you can create with WAN 2.7

Lock your first and last frame, feed up to 5 simultaneous references (up from 2 in WAN 2.6), or use the 9-grid system for multi-angle consistency. You decide the narrative arc — WAN 2.7 fills in the motion.


WAN 2.7 FAQ

What is WAN 2.7?

What is the 9-grid image-to-video feature?

You can feed 9 reference images in a 3×3 grid — multi-angle shots, sequential poses, or scene variants. WAN 2.7 uses all of them to keep the subject stable, the pose accurate, and the composition tight across the generated video. This enables consistency that previous models couldn't achieve.

How does instruction-based video editing work?

If a generated video is almost right, you can tell WAN 2.7 what to change in plain language — like 'change the background to a rooftop' or 'make it sunset lighting.' It edits the video without regenerating from scratch, preserving what already works.

What is video recreation?

Video recreation takes an existing video and generates variations while preserving the original motion and pacing. You can keep the same choreography but change the style, setting, or visual treatment.

How many reference images does WAN 2.7 support?

WAN 2.7 supports up to 5 simultaneous video references (up from 2 in WAN 2.6), plus the 9-grid system for even more detailed reference control. You can also combine references with first and last frame inputs — a capability no other video model offers at this scale.

Where can I use WAN 2.7 in Picsart?

WAN 2.7 is the default model in AI Video Generator and Fullscreen AI Video Generator, and is available in Flow and Playground. Integration with Storyline and Aura is coming soon. Every output can go straight into Picsart's editor for further refinement.

Can I lock both face and voice for consistent characters?

Yes. WAN 2.7's subject plus voice reference feature lets you anchor both the character's appearance and voice in one workflow, so the character looks and sounds consistent throughout your video.