From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 16 days ago • 248
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published 12 days ago • 68
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 28
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 22 days ago • 134
Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning Paper • 2511.14617 • Published 21 days ago • 1
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published 28 days ago • 32
Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes Paper • 0812.4360 • Published Dec 23, 2008 • 2
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 26 days ago • 93
Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI Paper • 2511.01689 • Published Nov 3 • 4
LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics Paper • 2511.08544 • Published 28 days ago • 6
From Memorization to Reasoning in the Spectrum of Loss Curvature Paper • 2510.24256 • Published Oct 28 • 2