Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models Paper • 2511.23319 • Published 8 days ago • 21
Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information Paper • 2511.22176 • Published 9 days ago • 4
FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning Paper • 2511.22265 • Published 9 days ago • 1
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published 9 days ago • 63
PromptBridge: Cross-Model Prompt Transfer for Large Language Models Paper • 2512.01420 • Published 5 days ago • 8
ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling Paper • 2512.01481 • Published 5 days ago • 2
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 5 days ago • 47
C^2DLM: Causal Concept-Guided Diffusion Large Language Models Paper • 2511.22146 • Published 10 days ago • 3
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Paper • 2501.05707 • Published Jan 10 • 20
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals Paper • 2406.04784 • Published Jun 7, 2024 • 2
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges Paper • 2502.01612 • Published Feb 3 • 1
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models Paper • 2412.02674 • Published Dec 3, 2024
WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model Paper • 2504.21024 • Published Apr 23 • 2
Self-Improvement in Language Models: The Sharpening Mechanism Paper • 2412.01951 • Published Dec 2, 2024
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent Paper • 2312.10003 • Published Dec 15, 2023 • 44
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published Dec 23, 2024 • 47
Self-Aware Feedback-Based Self-Learning in Large-Scale Conversational AI Paper • 2205.00029 • Published Apr 29, 2022