This language indendent wav2vec2 classification model is based on this dataset.

Sound classes are:

teeth-chattering
teeth-grinding
tongue-clicking
nose-blowing
coughing
yawning
throat clearing
sighing
lip-popping
lip-smacking
panting
crying
laughing
sneezing
moaning
screaming

inference.py shows, how the model can be used.

Downloads last month: 11,350

Inference Examples

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.