view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 7 days ago • 224
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2 +4 Aug 21, 2024 • 42