Qwen
/

Qwen2.5-3B

@@ -1,14 +1,17 @@
 ---
 license: other
 license_name: qwen-research
 license_link: https://huggingface.co/Qwen/Qwen2.5-3B/blob/main/LICENSE
-language:
-- en
 pipeline_tag: text-generation
 ---
 # Qwen2.5-3B
 ## Introduction
 Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:
@@ -47,6 +50,8 @@ Detailed evaluation results are reported in this [📑 blog](https://qwenlm.gith
 For requirements on GPU memory and the respective throughput, see results [here](https://qwen.readthedocs.io/en/latest/benchmark/speed_benchmark.html).
 ## Citation
 If you find our work helpful, feel free to give us a cite.

 ---
+language:
+- en
 license: other
+library_name: transformers
 license_name: qwen-research
 license_link: https://huggingface.co/Qwen/Qwen2.5-3B/blob/main/LICENSE
 pipeline_tag: text-generation
 ---
 # Qwen2.5-3B
+This repository contains the 3B Qwen2.5 checkpoint described in the paper [Making LLMs Better Many-to-Many Speech-to-Text Translators with Curriculum Learning](https://huggingface.co/papers/2409.19510).
 ## Introduction
 Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:
 For requirements on GPU memory and the respective throughput, see results [here](https://qwen.readthedocs.io/en/latest/benchmark/speed_benchmark.html).
+The Github repository for the paper is https://github.com/yxduir/LLM-SRT
 ## Citation
 If you find our work helpful, feel free to give us a cite.