Steveeeeeeen HF Staff commited on
Commit
636c163
·
verified ·
1 Parent(s): 9718048

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -15
README.md CHANGED
@@ -37,21 +37,20 @@ This model is part of the **Omnilingual ASR** family released by Meta AI. The or
37
  <!-- TODO : add new tokenizer, we'll get two tokenizer, add mssing speed numbers-->
38
  | Model Name | Features | Parameters | Download Size (FP32) | Inference VRAM¹ | Real-Time Factor¹ (relative speed)² |
39
  |---------------------|---------------|------------:|---------------:|---------------:|-----------:|
40
- | [`omniASR_W2V_300M`](https://dl.fbaipublicfiles.com/mms/omniASR-W2V-300M.pt) | SSL | 317_390_592 | 1.2 GiB | | |
41
- | [`omniASR_W2V_1B`](https://dl.fbaipublicfiles.com/mms/omniASR-W2V-1B.pt) | SSL | 965_514_752 | 3.6 GiB | | |
42
- | [`omniASR_W2V_3B`](https://dl.fbaipublicfiles.com/mms/omniASR-W2V-3B.pt) | SSL | 3_064_124_672 | 12.0 GiB | | |
43
- | [`omniASR_W2V_7B`](https://dl.fbaipublicfiles.com/mms/omniASR-W2V-7B.pt) | SSL | 6_488_487_168 | 25.0 GiB | | |
44
- | [`omniASR_CTC_300M`](https://dl.fbaipublicfiles.com/mms/omniASR-CTC-300M.pt) | ASR | 325_494_996 | 1.3 GiB | ~2 GiB | 0.001 (96x) |
45
- | [`omniASR_CTC_1B`](https://dl.fbaipublicfiles.com/mms/omniASR-CTC-1B.pt) | ASR | 975_065_300 | 3.7 GiB | ~3 GiB | 0.002 (48x) |
46
- | [`omniASR_CTC_3B`](https://dl.fbaipublicfiles.com/mms/omniASR-CTC-3B.pt) | ASR | 3_080_423_636 | 12.0 GiB | ~8 GiB | 0.003 (32x) |
47
- | [`omniASR_CTC_7B`](https://dl.fbaipublicfiles.com/mms/omniASR-CTC-7B.pt) | ASR | 6_504_786_132 | 25.0 GiB | ~15 GiB | 0.006 (16x) |
48
- | [`omniASR_LLM_300M`](https://dl.fbaipublicfiles.com/mms/omniASR-LLM-300M.pt) | ASR with optional language conditioning | 1_627_603_584 | 6.1 GiB | ~5 GiB | 0.090 (~1x) |
49
- | [`omniASR_LLM_1B`](https://dl.fbaipublicfiles.com/mms/omniASR-LLM-1B.pt) | ASR with optional language conditioning | 2_275_710_592 | 8.5 GiB | ~6 GiB | 0.091 (~1x) |
50
- | [`omniASR_LLM_3B`](https://dl.fbaipublicfiles.com/mms/omniASR-LLM-3B.pt) | ASR with optional language conditioning | 4_376_679_040 | 17.0 GiB | ~10 GiB | 0.093 (~1x) |
51
- | [`omniASR_LLM_7B`](https://dl.fbaipublicfiles.com/mms/omniASR-LLM-7B.pt) | ASR with optional language conditioning | 7_801_041_536 | 30.0 GiB | ~17 GiB | 0.092 (~1x) |
52
- | [`omniASR_LLM_7B_ZS`](https://dl.fbaipublicfiles.com/mms/omniASR-LLM-7B-ZS.pt) | Zero-Shot ASR | 7_810_900_608 | 30.0 GiB | ~20 GiB | 0.194 (~0.5x) |
53
- | [`omniASR_tokenizer`](https://dl.fbaipublicfiles.com/mms/omniASR_tokenizer.model) | Tokenizer for most of architectures (except omniASR_LLM_7B) | - | 100 KiB | - |
54
- | [`omniASR_tokenizer_v7`](https://dl.fbaipublicfiles.com/mms/omniASR_tokenizer_v7.model) | Tokenizer for omniASR_LLM_7B model | - | 100 KiB | - ||
55
 
56
  ¹ (batch=1, audio_len=30s, BF16, A100)
57
 
 
37
  <!-- TODO : add new tokenizer, we'll get two tokenizer, add mssing speed numbers-->
38
  | Model Name | Features | Parameters | Download Size (FP32) | Inference VRAM¹ | Real-Time Factor¹ (relative speed)² |
39
  |---------------------|---------------|------------:|---------------:|---------------:|-----------:|
40
+ | [`omniASR_W2V_300M`](https://huggingface.co/Steveeeeeeen/omniASR-W2V-300M) | SSL | 317_390_592 | 1.2 GiB | | |
41
+ | [`omniASR_W2V_1B`](https://huggingface.co/Steveeeeeeen/omniASR-W2V-1B) | SSL | 965_514_752 | 3.6 GiB | | |
42
+ | [`omniASR_W2V_3B`](https://huggingface.co/Steveeeeeeen/omniASR-W2V-3B) | SSL | 3_064_124_672 | 12.0 GiB | | |
43
+ | [`omniASR_W2V_7B`](https://huggingface.co/Steveeeeeeen/omniASR-W2V-7B) | SSL | 6_488_487_168 | 25.0 GiB | | |
44
+ | [`omniASR_CTC_300M`](https://huggingface.co/Steveeeeeeen/omniASR-CTC-300M) | ASR | 325_494_996 | 1.3 GiB | ~2 GiB | 0.001 (96x) |
45
+ | [`omniASR_CTC_1B`](https://huggingface.co/Steveeeeeeen/omniASR-CTC-1B) | ASR | 975_065_300 | 3.7 GiB | ~3 GiB | 0.002 (48x) |
46
+ | [`omniASR_CTC_3B`](https://huggingface.co/Steveeeeeeen/omniASR-CTC-3B) | ASR | 3_080_423_636 | 12.0 GiB | ~8 GiB | 0.003 (32x) |
47
+ | [`omniASR_CTC_7B`](https://huggingface.co/Steveeeeeeen/omniASR-CTC-7B) | ASR | 6_504_786_132 | 25.0 GiB | ~15 GiB | 0.006 (16x) |
48
+ | [`omniASR_LLM_300M`](https://huggingface.co/Steveeeeeeen/omniASR-LLM-300M) | ASR with optional language conditioning | 1_627_603_584 | 6.1 GiB | ~5 GiB | 0.090 (~1x) |
49
+ | [`omniASR_LLM_1B`](https://huggingface.co/Steveeeeeeen/omniASR-LLM-1B) | ASR with optional language conditioning | 2_275_710_592 | 8.5 GiB | ~6 GiB | 0.091 (~1x) |
50
+ | [`omniASR_LLM_3B`](https://huggingface.co/Steveeeeeeen/omniASR-LLM-3B) | ASR with optional language conditioning | 4_376_679_040 | 17.0 GiB | ~10 GiB | 0.093 (~1x) |
51
+ | [`omniASR_LLM_7B`](https://huggingface.co/Steveeeeeeen/omniASR-LLM-7B) | ASR with optional language conditioning | 7_801_041_536 | 30.0 GiB | ~17 GiB | 0.092 (~1x) |
52
+ | [`omniASR_LLM_7B_ZS`](https://huggingface.co/Steveeeeeeen/omniASR-LLM-7B-ZS) | Zero-Shot ASR | 7_810_900_608 | 30.0 GiB | ~20 GiB | 0.194 (~0.5x) |
53
+
 
54
 
55
  ¹ (batch=1, audio_len=30s, BF16, A100)
56