Improved student model training for acoustic event detection models

Anthea Cheung; Qingming Tang; Chieh-Chi Kao; Ming Sun; Chao Wang

Publication

Improved student model training for acoustic event detection models

By Anthea Cheung, Qingming Tang, Chieh-Chi Kao, Ming Sun, Chao Wang

2021

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

We introduce several novel knowledge distillation techniques for training a single shallow model of three recurrent layers for acoustic event detection (AED). These techniques allow us to train a generic shallow student model without many convolutional layers, ensembling, or custom modules. Gradual incorporation of pseudo-labeled data, using strong and weak pseudo-labels to train our student model, event masking in the loss function, and a custom SpecAugment procedure with event-dependent time masking all contribute to a strong event-based F1-score of 42.7%, which matches the top submission score, compared to 34.7% when training with a generic knowledge distillation method. For comparison to state-of-the-art performance, we use the ensemble model of the top submission in the challenge as a fixed teacher model.

Improved student model training for acoustic event detection models

Latest news

Work with us