arxiv:2509.03059
Junxiao Yang
yangjunxiao2021
AI & ML interests
Alignment/AI safety
Recent Activity
upvoted
a
paper
25 days ago
HaluMem: Evaluating Hallucinations in Memory Systems of Agents
upvoted
a
paper
about 1 month ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic,
and Long-Horizon Task Execution
upvoted
a
paper
about 1 month ago
DeepAgent: A General Reasoning Agent with Scalable Toolsets