V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published 5 days ago • 29
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 8 days ago • 84
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published 3 days ago • 35
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 2 days ago • 83
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 2 days ago • 58
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published 2 days ago • 78
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Paper • 2512.11749 • Published 5 days ago • 34
Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning Paper • 2512.10534 • Published 6 days ago • 31
Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving Paper • 2512.10739 • Published 6 days ago • 44
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation Paper • 2512.10949 • Published 6 days ago • 41
BrainExplore: Large-Scale Discovery of Interpretable Visual Representations in the Human Brain Paper • 2512.08560 • Published 8 days ago • 37
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published 7 days ago • 69
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 8 days ago • 123
From Next-Token to Next-Block: A Principled Adaptation Path for Diffusion LLMs Paper • 2512.06776 • Published 10 days ago • 23
Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality Paper • 2512.07951 • Published 9 days ago • 46