10 14 1

beiqing

zhangBeiQing

ZhangBeiQing

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

Apollo-LMMs/TimeScope

commented on a paper about 2 months ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

upvoted a paper about 2 months ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

View all activity

Organizations

None yet

liked a Space about 2 months ago

TimeScope

💻

Visualize accuracy curves for video models

commented a paper about 2 months ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

Paper • 2508.15717 • Published Aug 21 • 1 •

upvoted a paper about 2 months ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

Paper • 2508.15717 • Published Aug 21 • 1

commented a paper about 2 months ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 23 •

upvoted a paper about 2 months ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 23

commented a paper 2 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263 •

commented a paper 3 months ago

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

Paper • 2505.00675 • Published May 1 • 3 •

upvoted a paper 3 months ago

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

Paper • 2505.00675 • Published May 1 • 3

commented a paper 3 months ago

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

Paper • 2508.09874 • Published Aug 13 • 10 •

upvoted 2 papers 3 months ago

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

Paper • 2508.09874 • Published Aug 13 • 10

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Paper • 2508.19828 • Published Aug 27 • 7

commented 2 papers 4 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 180 •

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 133 •

upvoted 4 papers 4 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 142

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 140

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2 • 107

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 134

commented a paper 4 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158 •

upvoted a paper 4 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158

upvoted a paper 5 months ago

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning

Paper • 2506.19767 • Published Jun 24 • 15

beiqing

AI & ML interests

Recent Activity

Organizations

zhangBeiQing's activity

TimeScope