CSTF-SENet: A Single-Channel Speech Enhancement Model with Cross-Scale Temporal-Frequency Transformer
Supervised Attention Multi-Scale Temporal Convolutional Network for Monaural Speech Enhancement in Real Scenarios
Two-stage UNet with multi-axis gated multilayer perceptron for monaural noisy-reverberant speech enhancement