VisPlay: Self-Evolving Vision-Language Models from Images Paper • 2511.15661 • Published 20 days ago • 42
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9 • 101
Self-Rewarding Vision-Language Model via Reasoning Decomposition Paper • 2508.19652 • Published Aug 27 • 84