Distilled Reasoning Models with Activation Sparse
AI & ML interests
ML algorithms and systems
Reproduce Deepseek distilled models based on open-r1.
-
InfiniAILab/OpenR1-Qwen-3B-SFT-Instruct
Text Generation • 3B • Updated • 4 • 1 -
InfiniAILab/OpenR1-Qwen-7B-SFT-Instruct
Text Generation • 8B • Updated • 8 • 2 -
InfiniAILab/OpenR1-Qwen-7B-Math-Instruct
Text Generation • 8B • Updated • 15 -
InfiniAILab/OpenR1-Qwen-1.5B-SFT-Instruct
Text Generation • 2B • Updated • 7
Distilled Reasoning Models with Activation Sparse
Reproduce Deepseek distilled models based on open-r1.
-
InfiniAILab/OpenR1-Qwen-3B-SFT-Instruct
Text Generation • 3B • Updated • 4 • 1 -
InfiniAILab/OpenR1-Qwen-7B-SFT-Instruct
Text Generation • 8B • Updated • 8 • 2 -
InfiniAILab/OpenR1-Qwen-7B-Math-Instruct
Text Generation • 8B • Updated • 15 -
InfiniAILab/OpenR1-Qwen-1.5B-SFT-Instruct
Text Generation • 2B • Updated • 7
models
96
InfiniAILab/Autoregressive-7B-2
2B
•
Updated
•
10
InfiniAILab/Autoregressive-7B
1.0B
•
Updated
•
1
•
1
InfiniAILab/Multiverse-7B
1B
•
Updated
•
461
InfiniAILab/Autoregressive-1.5B-2
0.2B
•
Updated
•
1
InfiniAILab/Autoregressive-1.5B
0.2B
•
Updated
•
1
•
1
InfiniAILab/Autoregressive-1.5B-no-structure
0.2B
•
Updated
•
3
InfiniAILab/Multiverse-1.5B
0.2B
•
Updated
•
182
•
1
InfiniAILab/S1-claude-1K-32B-bs16-new-tokenizer
33B
•
Updated
•
4
InfiniAILab/S1-claude-1K-32B-bs16
33B
•
Updated
•
3
InfiniAILab/S1.1-1K-32B-bs16-new-tokenizer-parallel-7.1-v6-true-mix-prompt
33B
•
Updated
•
3
datasets
22
InfiniAILab/multiverse-sample
Updated
•
22
InfiniAILab/gsm_infinite_symbolic_32k
Updated
•
129
InfiniAILab/gsm_infinite_hard_128k
Viewer
•
Updated
•
12.3k
•
435
InfiniAILab/gsm_infinite_symbolic_16k
Updated
•
200
InfiniAILab/gsm_infinite_medium_128k
Viewer
•
Updated
•
12.7k
•
923
InfiniAILab/gsm_infinite_symbolic_8k
Updated
•
475
InfiniAILab/gsm_infinite_hard_64k
Viewer
•
Updated
•
12.3k
•
14
InfiniAILab/gsm_infinite_symbolic_0
Updated
•
390
InfiniAILab/gsm_infinite_medium_64k
Viewer
•
Updated
•
21.3k
•
64
InfiniAILab/gsm_infinite_symbolic_128k
Updated
•
108