-
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 245 -
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 105 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 24
Collections
Discover the best community collections!
Collections including paper arxiv:2506.02153
-
A Survey of Small Language Models
Paper • 2410.20011 • Published • 46 -
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 22 -
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
Paper • 2402.15538 • Published • 6 -
Small Language Models: Survey, Measurements, and Insights
Paper • 2409.15790 • Published • 2
-
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Paper • 2508.15760 • Published • 46 -
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?
Paper • 2508.01780 • Published • 20 -
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
Paper • 2304.08244 • Published • 1 -
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper • 2508.16153 • Published • 158
-
The Leaderboard Illusion
Paper • 2504.20879 • Published • 72 -
SmolVLM: Redefining small and efficient multimodal models
Paper • 2504.05299 • Published • 200 -
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 104 -
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 22
-
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 22 -
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
Paper • 2512.02556 • Published • 184 -
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
Paper • 2511.22570 • Published • 66 -
DeepSeek-OCR: Contexts Optical Compression
Paper • 2510.18234 • Published • 82
-
Why Language Models Hallucinate
Paper • 2509.04664 • Published • 193 -
BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design
Paper • 2508.21184 • Published • 2 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 22
-
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 245 -
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 105 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 24
-
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 22 -
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
Paper • 2512.02556 • Published • 184 -
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
Paper • 2511.22570 • Published • 66 -
DeepSeek-OCR: Contexts Optical Compression
Paper • 2510.18234 • Published • 82
-
A Survey of Small Language Models
Paper • 2410.20011 • Published • 46 -
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 22 -
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
Paper • 2402.15538 • Published • 6 -
Small Language Models: Survey, Measurements, and Insights
Paper • 2409.15790 • Published • 2
-
Why Language Models Hallucinate
Paper • 2509.04664 • Published • 193 -
BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design
Paper • 2508.21184 • Published • 2 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 22
-
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Paper • 2508.15760 • Published • 46 -
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?
Paper • 2508.01780 • Published • 20 -
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
Paper • 2304.08244 • Published • 1 -
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper • 2508.16153 • Published • 158
-
The Leaderboard Illusion
Paper • 2504.20879 • Published • 72 -
SmolVLM: Redefining small and efficient multimodal models
Paper • 2504.05299 • Published • 200 -
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 104 -
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 22