1 7 15

wangchenglong

wangclnlp

https://wangclnlp.github.io/wangchenglong.github.io/

wangclnlp

AI & ML interests

None yet

Recent Activity

updated a collection about 1 month ago

Probing-RM

updated a collection about 1 month ago

Probing-RM

new activity about 1 month ago

ifnoc/MRMBench:Update README.md

View all activity

Organizations

updated a collection about 1 month ago

Probing-RM

Collection

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models • 2 items • Updated Nov 20, 2025

New activity in ifnoc/MRMBench about 1 month ago

Update README.md

#2 opened about 1 month ago by

wangclnlp

updated a collection about 2 months ago

Probing-RM

Collection

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models • 2 items • Updated Nov 20, 2025

upvoted 3 papers 4 months ago

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 101

GRAM-R^2: Self-Training Generative Foundation Reward Models for Reward Reasoning

Paper • 2509.02492 • Published Sep 2, 2025 • 1

GRAM: A Generative Foundation Reward Model for Reward Generalization

Paper • 2506.14175 • Published Jun 17, 2025 • 1

updated a collection 4 months ago

GRAM-RR

Collection

Self-Training Generative Foundation Reward Models for Reward Reasoning • 4 items • Updated Nov 8, 2025

updated 2 models 4 months ago

wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel

Text Generation • 3B • Updated Sep 4, 2025 • 18

wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel

Text Generation • 8B • Updated Sep 4, 2025 • 75 • 1

updated a dataset 4 months ago

wangclnlp/GRAM-RR-TrainingData

Updated Sep 4, 2025 • 13

published a dataset 4 months ago

wangclnlp/GRAM-RR-TrainingData

Updated Sep 4, 2025 • 13

published 2 models 4 months ago

wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel

Text Generation • 3B • Updated Sep 4, 2025 • 18

wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel

Text Generation • 8B • Updated Sep 4, 2025 • 75 • 1

updated a collection 4 months ago

GRAM-RR

Collection

Self-Training Generative Foundation Reward Models for Reward Reasoning • 4 items • Updated Nov 8, 2025

upvoted a collection 6 months ago

GRAM

Collection

Generative Foundation Reward Models for Reward Generalization • 8 items • Updated Jun 19, 2025 • 1

updated 2 models 6 months ago

NiuTrans/GRAM-Qwen3-4B-RewardModel

4B • Updated Jun 26, 2025 • 15 • 2

NiuTrans/GRAM-Qwen3-8B-RewardModel

8B • Updated Jun 26, 2025 • 10 • 4

wangchenglong

AI & ML interests

Recent Activity

Organizations

wangclnlp's activity

Update README.md