dkounadis
/

wav2small

@@ -20,11 +20,18 @@ tags:
 # Arousal - Dominance - Valence
 Speech Emotion Recognition model from combined use of [Wavlm](https://huggingface.co/3loi/SER-Odyssey-Baseline-WavLM-Multi-Attributes) / [wav2vec2](https://hf.rst.im/audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim).
-Achieves `0.6760566` valence CCC on [MSP-Podcast](https://ecs.utdallas.edu/research/researchlabs/msp-lab/MSP-Podcast.html) Test 1. Used as teacher for [wav2small]().
-# Benchmarks
 <table style="width:500px">
   <tr><th colspan=6 align="center" >CCC MSP Podcast v1.7</th></tr>
   <tr><th colspan=3 align="center">Test 1</th><th colspan=3 align="center">Test 2</th></tr>

 # Arousal - Dominance - Valence
 Speech Emotion Recognition model from combined use of [Wavlm](https://huggingface.co/3loi/SER-Odyssey-Baseline-WavLM-Multi-Attributes) / [wav2vec2](https://hf.rst.im/audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim).
+Achieves `0.6760566` valence CCC on [MSP-Podcast](https://ecs.utdallas.edu/research/researchlabs/msp-lab/MSP-Podcast.html) Test 1. Used as teacher for [wav2small](https://arxiv.org/abs/2408.13920v1).
+# [arXiv](https://arxiv.org/abs/2408.13920v1)
+```
+Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech emotion recognition.
+Dionyssos Kounadis-Bastian, Oliver Schrüfer, Anna Derington, Hagen Wierstorf, Florian Eyben, Felix Burkhardt, Björn Schuller.
+2024, arXiV Preprint
+```
 <table style="width:500px">
   <tr><th colspan=6 align="center" >CCC MSP Podcast v1.7</th></tr>
   <tr><th colspan=3 align="center">Test 1</th><th colspan=3 align="center">Test 2</th></tr>