view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 4 days ago • 49
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face 3 days ago • 20
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated Jul 21 • 348
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 6 days ago • 240
Qwen3-VL Collection Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats. • 56 items • Updated 6 days ago • 17
Ministral 3 - Additional Checkpoints Collection Different formats and Quantized versions of our Ministral 3 family; 14B/8B/3B Instruct/Reasoning GGUF, 3B Instruct ONNX and 14B/8B/3B Instruct BF16. • 13 items • Updated 6 days ago • 12
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8 • 10
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 7 days ago • 224
view article Article Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms 18 days ago • 31