K
ReneeA1
AI & ML interests
None yet
Organizations
None yet
agent RL
-
Tool-integrated Reinforcement Learning for Repo Deep Search
Paper • 2508.03012 • Published • 20 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 121 -
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents
Paper • 2509.09265 • Published • 46 -
A Survey of Reinforcement Learning for Large Reasoning Models
Paper • 2509.08827 • Published • 189
coding LLM
agent RL
-
Tool-integrated Reinforcement Learning for Repo Deep Search
Paper • 2508.03012 • Published • 20 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 121 -
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents
Paper • 2509.09265 • Published • 46 -
A Survey of Reinforcement Learning for Large Reasoning Models
Paper • 2509.08827 • Published • 189
models
0
None public yet
datasets
0
None public yet