Hugging Face, NVIDIA, Mistral AI, and the University of Cambridge launch the Open ASR Leaderboard, a public benchmark for ASR ...
Abstract: This paper reports how speech recognition accuracy can be improved using the speech few-shot in-context learning capabilities of a multimodal foundation model when applied to the speech of ...