Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following Paper • 2511.21662 • Published 14 days ago • 10
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published 20 days ago • 105
VisPlay: Self-Evolving Vision-Language Models from Images Paper • 2511.15661 • Published 21 days ago • 42
StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published Oct 10 • 6 • 4
StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published Oct 10 • 6
StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published Oct 10 • 6
StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published Oct 10 • 6 • 4
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 140
VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning Paper • 2510.01444 • Published Oct 1 • 19
VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning Paper • 2510.01444 • Published Oct 1 • 19
CLUE: Non-parametric Verification from Experience via Hidden-State Clustering Paper • 2510.01591 • Published Oct 2 • 26
Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models Paper • 2509.06949 • Published Sep 8 • 55
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning Paper • 2509.09674 • Published Sep 11 • 80
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation Paper • 2509.15194 • Published Sep 18 • 33
Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models Paper • 2509.12132 • Published Sep 15 • 6
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper • 2509.09675 • Published Sep 11 • 28
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9 • 101
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper • 2509.09675 • Published Sep 11 • 28