Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
K's picture
1

K

ReneeA1

AI & ML interests

None yet

Organizations

None yet

Collections 2

coding LLM
  • GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

    Paper • 2508.06471 • Published Aug 8 • 192
agent RL
  • Tool-integrated Reinforcement Learning for Repo Deep Search

    Paper • 2508.03012 • Published Aug 5 • 20
  • Agent Lightning: Train ANY AI Agents with Reinforcement Learning

    Paper • 2508.03680 • Published Aug 5 • 121
  • Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

    Paper • 2509.09265 • Published Sep 11 • 46
  • A Survey of Reinforcement Learning for Large Reasoning Models

    Paper • 2509.08827 • Published Sep 10 • 189
coding LLM
  • GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

    Paper • 2508.06471 • Published Aug 8 • 192
agent RL
  • Tool-integrated Reinforcement Learning for Repo Deep Search

    Paper • 2508.03012 • Published Aug 5 • 20
  • Agent Lightning: Train ANY AI Agents with Reinforcement Learning

    Paper • 2508.03680 • Published Aug 5 • 121
  • Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

    Paper • 2509.09265 • Published Sep 11 • 46
  • A Survey of Reinforcement Learning for Large Reasoning Models

    Paper • 2509.08827 • Published Sep 10 • 189

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs