Julius Richter, MSc
Photo: unknown-riju
Audio-Visual Signal Processing
Signal Processing (SP)
Address
Universität Hamburg
Department of Informatics
SP Research Group
Office
Room: F-127
Contact
Tel: +49 40 42883-2539
Research interests
Publications
- Jean-Marie Lemercier, Julius Richter, Simon Welker, Eloi Moliner, Vesa Välimäki, Timo Gerkmann, "Diffusion Models for Audio Restoration," IEEE Signal Processing Magazine, Jan 2025, accepted. [arxiv]
- Julius Richter, Yi-Chiao Wu, Steven Krenn, Simon Welker, Bunlong Lay, Shinji Watanabe, Alexander Richard, Timo Gerkmann, "EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation," ISCA Interspeech, Kos, Greece, Sep. 2024. [arxiv] [audio] [code]
- Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Tal Peer, Timo Gerkmann, "Causal Diffusion Models for Generalized Speech Enhancement," IEEE Open Journal of Signal Processing, vol. 5, pp 780-789, 2024. [doi] [audio]
- Bunlong Lay, Jean-Marie Lemercier, Julius Richter, Timo Gerkmann, "Single and Few-step Diffusion for Generative Speech Enhancement," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, South Korea, Apr. 2024. [doi] [arxiv] [audio] [code]
- Julius Richter, Simone Frintrop, Timo Gerkmann, "Audio-Visual Speech Enhancement with Score-Based Generative Models," ITG Conference on Speech Communication, Aachen, Germany, Sept. 2023. [doi] [arxiv] [audio]
- Danilo de Oliveira, Julius Richter, Jean-Marie Lemercier, Tal Peer, Timo Gerkmann, "On the Behavior of Intrusive and Non-intrusive Speech Enhancement Metrics in Predictive and Generative Settings," ITG Conference on Speech Communication, Aachen, Germany, Sept. 2023. [doi] [arxiv]
- Bunlong Lay, Simon Welker, Julius Richter, Timo Gerkmann "Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement", ISCA Interspeech, Dublin, Ireland, Aug. 2023. [doi] [arxiv] [audio] [code]
- Hector Martel, Julius Richter, Kai Li, Xiaolin Hu, Timo Gerkmann, "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model," ISCA Interspeech, Dublin, Ireland, Aug. 2023. [doi] [arxiv] [code]
- Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Tal Peer, Timo Gerkmann, "Speech Signal Improvement Using Causal Generative Diffusion Models," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, Greece, Jun. 2023. [doi] [arxiv] [audio]
- Jean-Marie Lemercier, Julius Richter, Simon Welker, Timo Gerkmann, "Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration," IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Rhodes, Greece, Jun. 2023. [doi] [arxiv]
- Jean-Marie Lemercier, Julius Richter, Simon Welker, Timo Gerkmann, "StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation," IEEE/ACM Transactions on Audio, Speech, and Language Processing, accepted, 2023. [doi] [arxiv] [audio] [code]
- Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Timo Gerkmann, "Speech Enhancement and Dereverberation with Diffusion-Based Generative Models," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 2351 - 2364, 2023. [doi] [arxiv] [audio] [code]
- Simon Welker, Julius Richter, Timo Gerkmann, "Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain," ISCA Interspeech, Incheon, Korea, Sep. 2022. [doi] [arxiv] [audio] [code]
- Julius Richter, Jeanine Liebold, Timo Gerkmann, "Continuous Phoneme Recognition based on Audio-Visual Modality Fusion," IEEE World Congress on Computational Intelligence, Padua, Italy, Jul. 2022. [doi] [code]
- Guillaume Carbajal, Julius Richter, Timo Gerkmann, "Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement," IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, Oct. 2021. [doi] [arxiv]
- Guillaume Carbajal, Julius Richter, Timo Gerkmann, "Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Toronto, Ontario, Canada, Jun. 2021. [doi] [arxiv]
- Julius Richter, Guillaume Carbajal, Timo Gerkmann, "Speech Enhancement with Stochastic Temporal Convolutional Networks," ISCA Interspeech, Shanghai, China, Oct. 2020. [doi] [audio]
- Quan Nguyen, Julius Richter, Mikko Lauri, Timo Gerkmann, Simone Frintrop, "Improving mix-and-separate training in audio-visual sound source separation with an object prior," ICPR 2020. [doi]