xuefengli's picture

1 5 2

xuefengli

xuefengli

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a collection 3 months ago

MathArena Benchmark

upvoted a paper 7 months ago

Efficient Agent Training for Computer Use

View all activity

Organizations

Papers 3

arxiv:2411.16489

arxiv:2408.06941

arxiv:2406.12753

models 1

xuefengli/policy

datasets 0

None public yet