Follow
Wen-Chin Huang
Title
Cited by
Cited by
Year
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language 64, 101114, 2020
4112020
Mosnet: Deep learning based objective assessment for voice conversion
CC Lo, SW Fu, WC Huang, X Wang, J Yamagishi, Y Tsao, HM Wang
arXiv preprint arXiv:1904.08352, 2019
3182019
Voice Conversion Challenge 2020–-Intra-lingual semi-parallel and cross-lingual voice conversion–-}}
Z Yi, WC Huang, X Tian, J Yamagishi, RK Das, T Kinnunen, Z Ling, ...
Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020
236*2020
Generalization ability of MOS prediction networks
E Cooper, WC Huang, T Toda, J Yamagishi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1542022
The voicemos challenge 2022
WC Huang, E Cooper, Y Tsao, HM Wang, T Toda, J Yamagishi
arXiv preprint arXiv:2203.11389, 2022
1222022
Voice transformer network: Sequence-to-sequence voice conversion using transformer with text-to-speech pretraining
WC Huang, T Hayashi, YC Wu, H Kameoka, T Toda
arXiv preprint arXiv:1912.06813, 2019
1122019
SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative capabilities
HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ...
arXiv preprint arXiv:2203.06849, 2022
952022
Ldnet: Unified listener dependent modeling in mos prediction for synthetic speech
WC Huang, E Cooper, J Yamagishi, T Toda
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
732022
Predictions of subjective ratings and spoofing assessments of voice conversion challenge 2020 submissions
RK Das, T Kinnunen, WC Huang, Z Ling, J Yamagishi, Y Zhao, X Tian, ...
arXiv preprint arXiv:2009.03554, 2020
592020
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans
S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ...
2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021
562021
Voice conversion based on cross-domain features using variational auto encoders
WC Huang, HT Hwang, YH Peng, Y Tsao, HM Wang
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
542018
The singing voice conversion challenge 2023
WC Huang, LP Violeta, S Liu, J Shi, T Toda
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
512023
Unsupervised representation disentanglement using cross domain features and adversarial learning in variational autoencoder based voice conversion
WC Huang, H Luo, HT Hwang, CC Lo, YH Peng, Y Tsao, HM Wang
IEEE Transactions on Emerging Topics in Computational Intelligence 4 (4 …, 2020
512020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
WC Huang, T Hayashi, YC Wu, H Kameoka, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 745 - 755, 2021
472021
The sequence-to-sequence baseline for the voice conversion challenge 2020: Cascading asr and tts
WC Huang, T Hayashi, S Watanabe, T Toda
arXiv preprint arXiv:2010.02434, 2020
472020
S3prl-vc: Open-source voice conversion framework with self-supervised speech representations
WC Huang, SW Yang, T Hayashi, HY Lee, S Watanabe, T Toda
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
462022
Any-to-one sequence-to-sequence voice conversion using self-supervised discrete speech representations
WC Huang, YC Wu, T Hayashi
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
382021
Many-to-many voice transformer network
H Kameoka, WC Huang, K Tanaka, T Kaneko, N Hojo, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 656-670, 2020
382020
Investigating self-supervised pretraining frameworks for pathological speech recognition
LP Violeta, WC Huang, T Toda
arXiv preprint arXiv:2203.15431, 2022
352022
Speech recognition by simply fine-tuning BERT
WC Huang, CH Wu, SB Luo, KY Chen, HM Wang, T Toda
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
352021
The system can't perform the operation now. Try again later.
Articles 1–20