R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She
kevinpro
AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
Recent Activity
liked
a dataset
6 days ago
nex-agi/agent-sft
liked
a dataset
6 days ago
opendatalab/AICC
liked
a dataset
16 days ago
nvidia/PhysicalAI-Autonomous-Vehicles