Égaré quelque part dans l'espace latent où les pixels ont la mémoire de rêves non formulés.
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
EgoX: Egocentric Video Generation from a Single Exocentric Video
EgoX: Egocentric Video Generation from a Single Exocentric Video
Character Animation/motion transfer
Character Animation/motion transfer
One-to-All Animation: Alignment-Free
Character Animation and Image Pose Transfer
One-to-All Animation: Alignment-Free
Character Animation and Image Pose Transfer
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance.
Indicate a trajectory.
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance.
Indicate a trajectory.
EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture
EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture
Unified Video Editing with Temporal Reasoner
Unified Video Editing with Temporal Reasoner
MotionV2V: Editing Motion in a Video
MotionV2V: Editing Motion in a Video
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer. Arxiv: 2511.22940
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer. Arxiv: 2511.22940
Qwen-Edit-2509-Light-Migration Lora
Qwen-Edit-2509-Light-Migration Lora
Text to image based on Wan 2.2.
Text to image based on Wan 2.2.
HunyuanVideo-1.5: A lightweight video generation model
HunyuanVideo-1.5: A lightweight video generation model
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
LongCat-Image-Edit (Qwen based)
LongCat-Image-Edit (Qwen based)
Light-X is a video generation framework that jointly controls camera trajectory and illumination from monocular videos.
Light-X is a video generation framework that jointly controls camera trajectory and illumination from monocular videos.
This workflow uses Wan VACE (Wan 2.2 Fun VACE or Wan 2.1 VACE, your choice!) to smooth out awkward motion transitions between video clips.
This workflow uses Wan VACE (Wan 2.2 Fun VACE or Wan 2.1 VACE, your choice!) to smooth out awkward motion transitions between video clips.
First Frame Go
First Frame Go
RELIC: Interactive Video World Model with Long-Horizon Memory
RELIC: Interactive Video World Model with Long-Horizon Memory
MagicQuill V2: Precise and Interactive Image Editing with Layered Visual Cues.
MagicQuill V2: Precise and Interactive Image Editing with Layered Visual Cues.
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
TUNA leverages unified visual representations to enable image/video understanding, image/video generation, and image editing within a single framework. By Meta.
TUNA leverages unified visual representations to enable image/video understanding, image/video generation, and image editing within a single framework. By Meta.
iMontage: Image editing, In-context Generation, storyboards.
iMontage: Image editing, In-context Generation, storyboards.