DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper ⢠2512.02556 ⢠Published 8 days ago ⢠187
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 9 days ago ⢠231
view post Post 8251 Qwen3-Next can now be Run locally! (30GB RAM)Instruct GGUF: unsloth/Qwen3-Next-80B-A3B-Instruct-GGUFThe models come in Thinking and Instruct versions and utilize a new architecture, allowing it to have ~10x faster inference than Qwen32B.š Step-by-step Guide: https://docs.unsloth.ai/models/qwen3-nextThinking GGUF: unsloth/Qwen3-Next-80B-A3B-Thinking-GGUF See translation š„ 37 37 ā¤ļø 11 11 š 7 7 š¤ 3 3 + Reply
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper ⢠2511.14993 ⢠Published 21 days ago ⢠222
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper ⢠2510.23607 ⢠Published Oct 27 ⢠174
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights Paper ⢠2509.22944 ⢠Published Sep 26 ⢠79
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code Paper ⢠2508.18106 ⢠Published Aug 25 ⢠345
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper ⢠2509.08721 ⢠Published Sep 10 ⢠660 ⢠53
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper ⢠2509.08721 ⢠Published Sep 10 ⢠660
Intern-S1: A Scientific Multimodal Foundation Model Paper ⢠2508.15763 ⢠Published Aug 21 ⢠256
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation Paper ⢠2506.18095 ⢠Published Jun 22 ⢠66
Running on Zero Featured 410 Zonos š 410 Generate audio from text with customizable emotions and settings