facebook/wav2vec2-base-960h
Automatic Speech Recognition
•
94.4M
•
Updated
•
1.93M
•
383
Generate spatial audio from images (and optionally text)
Paper Whisperer