I am a 4+4 PhD Fellow (joint long master's thesis and Ph.D.) at the Department of Electronic Systems, Aalborg University and the Centre for Acoustic Signal Processing Research (CASPR). My research focuses on developing and evaluating the newest sequence modelling neural networks like Mamba and xLSTM for single-channel speech enhancement, and I'm advised by Prof. Zheng-Hua Tan, Prof. Jan Østergaard and Prof. Jesper Jensen.
Exploring Resolution-Wise Shared Attention in Hybrid Mamba-U-Nets for Improved Cross-Corpus Speech Enhancement
2026 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2026)
Our proposed RWSA-MambaUNet models significantly outperform state-of-the-art single-channel speech enhancement systems on out-of-domain datasets with a substantially lower computational complexity.
MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
IEEE Transactions on Audio, Speech and Language Processing
MambAttention significantly matches or outperforms discriminative and generative state-of-the-art single-channel speech enhancement systems on out-of-domain datasets. t-SNE plots reveal that our shared multi-head attention module encourages the model to learn dataset-invariant features.
xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement
Interspeech 2025
xLSTM-SENet matches or outperforms state-of-the-art single-channel speech enhancement systems. Additionaly, we find that a correctly configured LSTM also matches SOTA Mamba- and Conformer-based systems of similar complexity in speech enhancement.
Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025) *Equal contribution. (Oral Presentation)
We show that pre-trained diffusion models can effectively detect and defend against targeted adversarial attacks on automatic speech recognition systems with success rates.
Invited speaker at Oticon A/S (part of Demant) in February 2026. The topic of the talk was generalization performance of neural architectures for speech enhancement.
Oral presentation of our paper: Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models at the IEEE International Conference on Acoustics, Speech, and Signal Processing (IEEE ICASSP), Hyderabad, 2025.
(Update 05/25: Extension with additional resources was awarded (Call H2-2025)) I was awarded a sizeable grant for computing time on the Danish national e-resources (Call H1-2025), which gives me access to the LUMI Supercomputer, 2025.
Won AAU SEMCON (7th Semester conference) out of 22 participating groups at the Department of Electronic Systems, Aalborg University, 2023.