O2iginal/L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S4096-step1-rand2b-nolearn-o-token1B Updated Oct 13
O2iginal/L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S512-step1-rand2b-nolearn-o-token1B Updated Oct 13
O2iginal/L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S512-step1-rand2b-bsz8x16-token1B Updated Oct 13
O2iginal/L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S4096-step1-rand2b-bsz4x4-token1B Updated Oct 13
O2iginal/L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S512-step1-rand2b-nolearn-o-bsz16x8-token1B Updated Oct 13
QwerkyAI/Qwerky-Optimized-Llama3.2-Mamba-0.2-3B-Instruct Text Generation • 4B • Updated 22 days ago • 388 • 2
QwerkyAI/Qwerky-Optimized-Llama3.1-Mamba-0.2-8B-Instruct Text Generation • 9B • Updated 14 days ago • 290 • 3