2 9 68

By

ByRookie

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

liked a dataset about 1 month ago

allenai/tulu-3-sft-mixture

View all activity

Organizations

upvoted a paper 12 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 12 days ago • 54

liked a Space about 1 month ago

The Smol Training Playbook

📚

2.53k

The secrets to building world-class LLMs

liked a dataset about 1 month ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 12.1k • 196

liked a model 2 months ago

Tengyunw/qwen3_30b_moe_eagle3

Updated Nov 5 • 2.49k • 11

liked a dataset 3 months ago

HuggingFaceFW/finepdfs

Updated 4 days ago • 36.1k • 681

liked a model 4 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Text Generation • 50B • Updated Oct 15 • 2.8k • 21

liked a dataset 4 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25 • 25.7M • 11.7k • 164

liked a model 4 months ago

MetaStoneTec/XBai-o4

33B • Updated Aug 6 • 48 • 193

New activity in nvidia/AceReason-1.1-SFT 6 months ago

will you release code rl dataset ?

🔥 3

#2 opened 6 months ago by

ByRookie

liked 2 datasets 6 months ago

zwhe99/DeepMath-103K

Viewer • Updated May 29 • 103k • 15.7k • 275

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9 • 1.2M • 15.8k • 187

upvoted a paper 6 months ago

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26 • 45

liked 2 datasets 7 months ago

a-m-team/AM-Thinking-v1-Distilled

Preview • Updated Jun 12 • 834 • 53

a-m-team/AM-Thinking-v1-RL-Dataset

Viewer • Updated May 21 • 54.8k • 243 • 17

liked a dataset 8 months ago

a-m-team/AM-DeepSeek-R1-Distilled-1.4M

Preview • Updated Mar 30 • 1.65k • 170

upvoted a paper 9 months ago

MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization

Paper • 2503.16874 • Published Mar 21 • 44

liked a dataset 9 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 6.42k • 610

liked 2 models 9 months ago

Skywork/Skywork-R1V-38B

Image-Text-to-Text • 38B • Updated Aug 12 • 53.8k • 127

thu-coai/CharacterGLM-6B

Updated Apr 21, 2024 • 92 • 58

upvoted a paper 9 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 89

By

AI & ML interests

Recent Activity

Organizations

ByRookie's activity

The Smol Training Playbook

will you release code rl dataset ?