5 Papers at Interspeech 2022

15 June 2022, by Timo Gerkmann

We are happy to announce 5 papers on speech enhancement and emotion recognition for Interspeech 2022 in Korea. Hope to see you there!

Methods include

Score-based Generative Models
Bayesian Deep Neural Networks
Deep Kalman Filtering
End-to-end learning

Applications include

Multichannel Speech Enhancement
Single Channel Speech Enhancement
Dereverberation
Emotion Recognition

Please find more details under "publications" / Peer-reviewed Conferences

Kristina Tesch, Nils-Hendrik Mohrmann, Timo Gerkmann, "On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement", Interspeech, Incheon, Korea, Sep. 2022
Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock, Timo Gerkmann, "End-To-End Label Uncertainty Modeling for Speech-based Arousal Recognition Using Bayesian Neural Networks", Interspeech, Incheon, Korea, Sep. 2022
Simon Welker, Julius Richter, Timo Gerkmann, "Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain", Interspeech, Incheon, Korea, Sep. 2022
Danilo de Oliveira, Tal Peer, Timo Gerkmann, "Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes", Interspeech, Incheon, Korea, Sep. 2022
Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann, "Neural Network-augmented Kalman Filtering for Robust Online Speech Dereverberation in Noisy Reverberant Environment", Interspeech, Incheon, Korea, Sep. 2022

Latest articles

30.01.2026|SP

6 Papers at ICASSP 2026

Excited to share that our team will be presenting six papers at ICASSP 2026. Congratulations and thanks to all co-authors for their excellent work.

• Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks?
Rostislav Makarov, Lea Schönherr, Timo Gerkmann
[audio], [arxiv]

We show that...

Photo: Gerhard Richter

12.09.2025|SP

Dissertation Julius Richter

Julius Richter successfully defended his PhD degree with his thesis "Generative Speech Enhancement in Multimodal Applications". Thanks to committee members and in particular the external members Shinji Watanabe and Simon Leglaive!

Julius Richter's dissertation advances generative speech enhancement...