Zhiyong WU (吴志勇)

Cited by

	All	Since 2019
Citations	3309	2636
h-index	28	25
i10-index	95	79

800

400

200

600

200520062007200820092010201120122013201420152016201720182019202020212022202320249 10 17 22 28 32 24 32 27 48 75 74 108 139 177 300 430 623 790 302

Public access

View all

100 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jia Jia (贾珈)Professor, 清华大学(Tsinghua University)Verified email at tsinghua.edu.cn
Shiyin KangXVerse Inc.Verified email at xverse.cn
Runnan LiBeijing University of Posts and TelecommunicationsVerified email at bupt.edu.cn
Xunying LiuChinese University of Hong KongVerified email at se.cuhk.edu.hk
Dan SuTencent AI LabVerified email at tencent.com
Xu LiTencent ARC Lab; The Chinese University of Hong KongVerified email at tencent.com
Jingbei LiTsinghua UniversityVerified email at jingbei.li
Yishuang NingTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Songxiang LiumiHoYoVerified email at mihoyo.com
Jun Chen（陈鋆）Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Xixin WuThe Chinese University of Hong KongVerified email at se.cuhk.edu.hk
Dongyang DaiVerified email at tsinghua.org.cn
Changhe SongTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Yuewen CaoThe Chinese University of Hong KongVerified email at se.cuhk.edu.hk
Hung-yi LeeNational Taiwan UniversityVerified email at ntu.edu.tw
Shaoguang MaoSenior Research SDE, Microsoft Research AsiaVerified email at microsoft.com
Shen ZhangTsinghua UnviersityVerified email at tsinghua.org.cn
Yixuan Zhou (周逸轩)PhD student, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Sheng ZhaoMicrosoftVerified email at microsoft.com

Zhiyong WU (吴志勇)

Associate Professor, Tsinghua University

Verified email at sz.tsinghua.edu.cn - Homepage

Speech synthesis Deep learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A review of deep learning based speech synthesis Y Ning, S He, Z Wu, C Xing, L Zhang Applied Sciences 9 (19), 4050, 2019	166	2019
Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	118	2019
Emotion recognition from variable-length speech segments using deep learning on spectrograms X Ma, Z Wu, J Jia, M Xu, H Meng, L Cai INTERSPEECH 2018, 3683-3687, 2018	99	2018
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification Y Zhang, Z Lv, H Wu, S Zhang, P Hu, Z Wu, H Lee, H Meng Proc. Interspeech 2022, 306-310, 2022	92	2022
Multi-level fusion of audio and visual features for speaker identification Z Wu, L Cai, H Meng Advances in Biometrics, 493-499, 2006	92	2006
A deep recurrent approach for acoustic-to-articulatory inversion P Liu, Q Yu, Z Wu, S Kang, H Meng, L Cai 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	86	2015
Question detection from acoustic features using recurrent neural network with gated recurrent unit Y Tang, Y Huang, Z Wu, H Meng, M Xu, L Cai 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	81	2016
Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition R Li, Z Wu, J Jia, S Zhao, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	78	2019
FullSubNet+: Channel Attention Fullsubnet with Complex Spectrograms for Speech Enhancement J Chen, Z Wang, D Tuo, Z Wu, S Kang, H Meng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	72	2022
Automatic lexical stress and pitch accent detection for L2 English speech using multi-distribution deep neural networks K Li, S Mao, X Li, Z Wu, H Meng Speech Communication 96, 28-36, 2018	63	2018
Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar. Z Wu, S Zhang, L Cai, HM Meng INTERSPEECH, 1802-1805, 2006	61	2006
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition X Cai, D Dai, Z Wu, X Li, J Li, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	60	2021
Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition D Dai, Z Wu, R Li, X Wu, J Jia, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	58	2019
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks X Song, G Wang, Y Huang, Z Wu, D Su, H Meng Proc. Interspeech 2020, 3765-3769, 2020	55	2020
Modelling high-dimensional sequences with LSTM-RTRBM: application to polyphonic music generation Q Lyu, Z Wu, J Zhu, H Meng Proceedings of the 24th International Conference on Artificial Intelligence …, 2015	51	2015
Towards Multi-Scale Style Control for Expressive Speech Synthesis X Li, C Song, J Li, Z Wu, J Jia, H Meng Proc. Interspeech 2021, 4673-4677, 2021	46	2021
Towards Discriminative Representation Learning for Speech Emotion Recognition R Li, Z Wu, J Jia, Y Bu, S Zhao, H Meng Proceedings of the 28th International Joint Conference on Artificial …, 2019	46	2019
One-Shot Voice Conversion with Global Speaker Embeddings H Lu, Z Wu, D Dai, R Li, S Kang, J Jia, H Meng Proc. Interspeech 2019, 669-673, 2019	44	2019
End-to-end Code-switched TTS with Mix of Monolingual Recordings Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	43	2019
Head and facial gestures synthesis using PAD model for an expressive talking avatar J Jia, Z Wu, S Zhang, HM Meng, L Cai Multimedia Tools and Applications 73 (1), 439-461, 2014	43	2014

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors