Whisper for audio captioning Collection Whisper models finetuned on audio captioning instead of speech recognition. These model aim to briefly describe what happens in the audio scene. • 3 items • Updated Oct 30, 2023 • 2