Konstantin Grotov's picture

3 2

Konstantin Grotov

konstantgr

·

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

JetBrains-Research/Qwen3-30B-A3B-am

upvoted a paper about 1 month ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

authored a paper 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

View all activity

Organizations

updated a model about 1 month ago

JetBrains-Research/Qwen3-30B-A3B-am

31B • Updated Oct 29 • 5

upvoted a paper about 1 month ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published Oct 27 • 20

authored a paper 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 37

New activity in JetBrains-Research/PIPer-8B-RL-only 2 months ago

Improve model card: Add paper and code badges, update datasets metadata

#1 opened 2 months ago by

New activity in JetBrains-Research/PIPer-8B 2 months ago

Improve model card: Add paper and code links

#1 opened 2 months ago by

updated a collection 2 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 3

upvoted a paper 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 37

updated a collection 2 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 3

updated a dataset 2 months ago

JetBrains-Research/PIPer-envbench-zeroshot-rl

Viewer • Updated Oct 2 • 742 • 115 • 1

published a dataset 2 months ago

JetBrains-Research/PIPer-envbench-zeroshot-rl

Viewer • Updated Oct 2 • 742 • 115 • 1

updated a collection 2 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 3

updated a dataset 2 months ago

JetBrains-Research/PIPer-SFT-2500-sharegpt

Viewer • Updated Oct 2 • 2.5k • 71 • 1

published a dataset 2 months ago

JetBrains-Research/PIPer-SFT-2500-sharegpt

Viewer • Updated Oct 2 • 2.5k • 71 • 1

updated a collection 2 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 3

updated a dataset 2 months ago

JetBrains-Research/PIPer-eval

Preview • Updated Sep 30 • 94

published a dataset 2 months ago

JetBrains-Research/PIPer-eval

Preview • Updated Sep 30 • 94

updated a collection 2 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 3

updated a model 2 months ago

JetBrains-Research/Qwen3-8B-am

Text Generation • 8B • Updated Sep 30 • 30

updated a collection 2 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 3