A comprehensive study of speech separation: spectrogram vs waveform separation F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu arXiv preprint arXiv:1905.07497, 2019 | 63 | 2019 |
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019 | 48 | 2019 |
Multi-modal multi-channel target speech separation R Gu, SX Zhang, Y Xu, L Chen, Y Zou, D Yu IEEE Journal of Selected Topics in Signal Processing 14 (3), 530-541, 2020 | 44 | 2020 |
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu Interspeech, 4290-4294, 2019 | 44 | 2019 |
Enhancing end-to-end multi-channel speech separation via spatial feature learning R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 33 | 2020 |
Audio-visual multi-channel recognition of overlapped speech J Yu, B Wu, R Gu, SX Zhang, L Chen, YX Yu, D Su, D Yu, X Liu, H Meng arXiv preprint arXiv:2005.08571, 2020 | 17 | 2020 |
Complex neural spatial filter: Enhancing multi-channel target speech separation in complex domain R Gu, SX Zhang, Y Zou, D Yu IEEE Signal Processing Letters 28, 1370-1374, 2021 | 8 | 2021 |
Temporal-spatial neural filter: Direction informed end-to-end multi-channel target speech separation R Gu, Y Zou arXiv preprint arXiv:2001.00391, 2020 | 8 | 2020 |
Learning a robust DOA estimation model with acoustic vector sensor cues Y Zou, R Gu, D Wang, A Jiang, CH Ritz 2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017 | 6 | 2017 |
Speaker-discriminative embedding learning via affinity matrix for short utterance speaker verification J Peng, R Gu, Y Zou, W Wang 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 4 | 2019 |
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform. J Peng, X Qu, J Wang, R Gu, J Xiao, L Burget, J Cernocký Interspeech, 511-515, 2021 | 2 | 2021 |
Deep Speaker Embedding with Long Short Term Centroid Learning for Text-Independent Speaker Verification. J Peng, R Gu, Y Zou INTERSPEECH, 3246-3250, 2020 | 2 | 2020 |
Interaction data detection system to upgrade brick and mortar shops: Metrics allow offline shops to compete with online retailers X Su, R Gu, G Han, D Choi IEEE Consumer Electronics Magazine 6 (4), 57-63, 2017 | 2 | 2017 |
Interest degree of products analysis by RFID technology for offline shops marketing optimization X Su, R Gu, C Qi, X Zhang, D Choi 2016 IEEE Advanced Information Management, Communicates, Electronic and …, 2016 | 2 | 2016 |
Text anchor based metric learning for small-footprint keyword spotting L Wang, R Gu, N Chen, Y Zou arXiv preprint arXiv:2108.05516, 2021 | 1 | 2021 |
Logistic similarity metric learning via affinity matrix for text-independent speaker verification J Peng, R Gu, Y Zou 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 1 | 2019 |
Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention X Xu, R Gu, Y Zou ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | | 2022 |
Learning Decoupling Features Through Orthogonality Regularization L Wang, R Gu, W Zhuang, P Gao, Y Wang, Y Zou ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | | 2022 |
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction Z Zhao, R Gu, D Yang, J Tian, Y Zou arXiv preprint arXiv:2204.07375, 2022 | | 2022 |
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches Z Zhao, D Yang, R Gu, H Zhang, Y Zou arXiv preprint arXiv:2204.01355, 2022 | | 2022 |