Online direction of arrival estimation based on deep learning Q Li, X Zhang, H Li 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 34 | 2018 |
A robust text-independent speaker verification method based on speech separation and deep speaker F Zhao, H Li, X Zhang ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 19 | 2019 |
Using optimal ratio mask as training target for supervised speech separation S Xia, H Li, X Zhang 2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017 | 15 | 2017 |
Exploiting spectro-temporal structures using NMF for DNN-based supervised speech separation S Nie, S Liang, H Li, XL Zhang, ZL Yang, WJ Liu, LK Dong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 10 | 2016 |
Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation. H Li, S Nie, X Zhang, H Zhang Interspeech, 550-554, 2016 | 9 | 2016 |
Speakerfilter: Deep learning-based target speaker extraction using anchor speech S He, H Li, X Zhang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 8 | 2020 |
Beamformed Feature for Learning-based Dual-channel Speech Separation H Li, X Zhang, G Gao ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 3 | 2020 |
Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning. H Li, DL Wang, X Zhang, G Gao INTERSPEECH, 4626-4630, 2020 | 3 | 2020 |
DBNet: A Dual-branch Network Architecture Processing on Spectrum and Waveform for Single-channel Speech Enhancement K Zhang, S He, H Li, X Zhang arXiv preprint arXiv:2105.02436, 2021 | 2 | 2021 |
Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection T Xu, H Li, H Zhang, X Zhang 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 2 | 2019 |
Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising H Li, X Zhang, H Zhang, G Gao arXiv preprint arXiv:1708.08251, 2017 | 2 | 2017 |
Robust Speech Dereverberation Based on WPE and Deep Learning H Li, X Zhang, G Gao 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020 | 1 | 2020 |
Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation H Li, DL Wang, X Zhang, G Gao IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2878-2887, 2021 | | 2021 |
Guided Training: A Simple Method for Single-channel Speaker Separation H Li, X Zhang, G Gao arXiv preprint arXiv:2103.14330, 2021 | | 2021 |
Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain S He, H Li, X Zhang arXiv preprint arXiv:2010.13053, 2020 | | 2020 |
Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech H Li, X Zhang, G Gao 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | | 2019 |