Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_w_sys_4k Text Generation • 333k • Updated 26 days ago • 23
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k Text Generation • 333k • Updated 26 days ago • 26
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k_5_epochs Text Generation • 333k • Updated 26 days ago • 33
Xinging/user_simulator_uncertainty_threshold_40_sft_train_dataset Text Generation • 333k • Updated Sep 12 • 6 • 1
Xinging/distill_r1_coing_neo_cleaned_uncertainty_threshold_40_sft_conversation_train_dataset Text Generation • 333k • Updated Sep 12 • 3 • 1
Xinging/mistral-24b_sft_0.1_alpaca_gpt4_active_by_comprehensive_ntrain_5700_new_lora_adapter Updated Apr 10 • 4