Stable Virtual Camera is a 1.3B-parameter multi-view diffusion model from Stability AI that turns one or more 2D images into 3D-consistent videos with controllable camera movement. It performs novel view synthesis along user-defined camera paths without explicit scene reconstruction, supporting 1 to 32 input images and outputs up to 1,000 frames. It is released as a research preview under a non-commercial license.
⚡ 3D Model Generators💵 Freemium — free to use for research under a Non-Commercial License, with weights and code openly available; no paid commercial tier currently.📅 Listed 10 Jun 2026
✨ Features
Novel View Synthesis
Generates 3D-consistent new viewpoints of a scene from 2D image inputs.
3D Camera Control
Supports 360°, spiral, dolly zoom, pan, and roll camera movements.
Multi-Image Input
Accepts 1 to 32 input images for richer scene reconstruction.
Flexible Output
Produces videos up to 1,000 frames in square, portrait, landscape, or custom ratios.
⚖️ Pros & Cons
Pros
✓ Creates 3D camera moves from images without complex reconstruction
✓ Open weights, paper, and code available for research
✓ Outperforms comparable models on benchmarks
Cons
✗ Non-commercial research license only
✗ Struggles with humans, animals, and dynamic textures