Wav2Vec 2.0
A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data.
Automatic Speech Recognition • Updated • 110k • 134Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled audio data from the LibriSpeech and LibriVox (LV) corpora, and fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant Wav2Vec 2.0 checkpoint from the initial release, obtaining 1.9/3.9% WER on the LibriSpeech test clean/other subsets respectively.
facebook/wav2vec2-large-960h
Automatic Speech Recognition • Updated • 78.1k • 26Note The Wav2Vec 2.0 "large" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-960h
Automatic Speech Recognition • Updated • 1.28M • • 307Note The Wav2Vec 2.0 "base" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-100h
Automatic Speech Recognition • Updated • 2.86k • 6Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data, and fine-tuned on 100 hours of labelled LibriSpeech ASR data.
facebook/wav2vec2-large-lv60
Updated • 13k • 6Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled data from the LibriSpeech and LibriVox (LV) corpora.
facebook/wav2vec2-large
Updated • 4.41k • 4Note The Wav2Vec 2.0 "large" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
facebook/wav2vec2-base
Updated • 1.23M • 79Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Paper • 2006.11477 • Published • 4Note The wav2vec 2.0 paper, accepted to NeurIPS 2020.