5 Papers at Interspeech 2022
15 June 2022, by Timo Gerkmann
We are happy to announce 5 papers on speech enhancement and emotion recognition for Interspeech 2022 in Korea. Hope to see you there!
Methods include
- Score-based Generative Models
- Bayesian Deep Neural Networks
- Deep Kalman Filtering
- End-to-end learning
Applications include
- Multichannel Speech Enhancement
- Single Channel Speech Enhancement
- Dereverberation
- Emotion Recognition
Please find more details under "publications" / Peer-reviewed Conferences
- Kristina Tesch, Nils-Hendrik Mohrmann, Timo Gerkmann, "On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement", Interspeech, Incheon, Korea, Sep. 2022
- Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock, Timo Gerkmann, "End-To-End Label Uncertainty Modeling for Speech-based Arousal Recognition Using Bayesian Neural Networks", Interspeech, Incheon, Korea, Sep. 2022
- Simon Welker, Julius Richter, Timo Gerkmann, "Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain", Interspeech, Incheon, Korea, Sep. 2022
- Danilo de Oliveira, Tal Peer, Timo Gerkmann, "Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes", Interspeech, Incheon, Korea, Sep. 2022
- Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann, "Neural Network-augmented Kalman Filtering for Robust Online Speech Dereverberation in Noisy Reverberant Environment", Interspeech, Incheon, Korea, Sep. 2022