stereoplegic
's Collections
HyPoradise: An Open Baseline for Generative Speech Recognition with
Large Language Models
Paper
•
2309.15701
•
Published
•
2
CoLLD: Contrastive Layer-to-layer Distillation for Compressing
Multilingual Pre-trained Speech Encoders
Paper
•
2309.07707
•
Published
•
1
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo
Labelling
Paper
•
2311.00430
•
Published
•
56
Reproducing Whisper-Style Training Using an Open-Source Toolkit and
Publicly Available Data
Paper
•
2309.13876
•
Published
•
1
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large
Language Models
Paper
•
2309.10707
•
Published
•
1
Massive End-to-end Models for Short Search Queries
Paper
•
2309.12963
•
Published
•
1
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework
for Speech Recognition
Paper
•
2310.06434
•
Published
•
4
MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics
Transcription
Paper
•
2108.02625
•
Published
•
1
Beyond Universal Transformer: block reusing with adaptor in Transformer
for automatic speech recognition
Paper
•
2303.13072
•
Published
•
1
Continual Learning for Monolingual End-to-End Automatic Speech
Recognition
Paper
•
2112.09427
•
Published
•
1
ILASR: Privacy-Preserving Incremental Learning for Automatic Speech
Recognition at Production Scale
Paper
•
2207.09078
•
Published
•
1
Multilingual Byte2Speech Models for Scalable Low-resource Speech
Synthesis
Paper
•
2103.03541
•
Published
Bilingual End-to-End ASR with Byte-Level Subwords
Paper
•
2205.00485
•
Published