Ye Jia

Cited by

	All	Since 2019
Citations	5722	5659
h-index	21	21
i10-index	26	26

1600

800

400

1200

201820192020202120222023202445 300 640 1023 1352 1555 779

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Yonghui WuGoogle BrainVerified email at google.com
Ron J WeissGoogleVerified email at google.com
Yu ZhangOpenAIVerified email at csail.mit.edu
Jonathan ShenGoogleVerified email at google.com
Heiga ZenPrincipal Scientist (Director), Google DeepMindVerified email at google.com
Zhifeng ChenGoogle Inc.Verified email at google.com
Quan WangSenior Staff Software Engineer @ Google; Instructor @ Udemy; Textbook Author; IEEE Senior MemberVerified email at google.com
Ignacio Lopez MorenoGoogle IncVerified email at google.com
Patrick NguyenResearch Scientist, Google, Inc.Verified email at google.com
Melvin JohnsonResearcher, GoogleVerified email at stanford.edu
RJ Skerry-RyanGoogle, Inc.Verified email at alum.mit.edu
Rob ClarkGoogleVerified email at google.com
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Michelle Tadmor (Ramanovich)GoogleVerified email at google.com
Bhuvana RamabhadranManager, GoogleVerified email at google.com
Andrew RosenbergGoogleVerified email at google.com
Ankur BapnaSoftware Engineer, Google DeepmindVerified email at google.com
Yuan CaoGoogle DeepMindVerified email at google.com
Chung-Cheng ChiuAppleVerified email at apple.com
Wolfgang MachereyGoogle ResearchVerified email at google.com

Ye Jia

Meta

Verified email at google.com - Homepage

Speech synthesis Speech translation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Y Jia, Y Zhang, RJ Weiss, Q Wang, J Shen, F Ren, Z Chen, P Nguyen, ... Advances in Neural Information Processing Systems, 2018	923	2018
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International conference on machine learning, 5180-5189, 2018	918	2018
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019	787	2019
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018	408	2018
ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language, 101114, 2020	338	2020
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018	270	2018
Improved noisy student training for automatic speech recognition DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le arXiv preprint arXiv:2005.09629, 2020	248	2020
Direct speech-to-speech translation with a sequence-to-sequence model Y Jia, RJ Weiss, F Biadsy, W Macherey, M Johnson, Z Chen, Y Wu Proc. Interspeech 2019, 1123--1127, 2019	224	2019
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	203	2019
Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning Y Zhang, RJ Weiss, H Zen, Y Wu, Z Chen, RJ Skerry-Ryan, Y Jia, ... arXiv preprint arXiv:1907.04448, 2019	184	2019
Leveraging weakly supervised data to improve end-to-end speech-to-text translation Y Jia, M Johnson, W Macherey, RJ Weiss, Y Cao, CC Chiu, N Ari, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	173	2019
Speech recognition with augmented synthesized speech A Rosenberg, Y Zhang, B Ramabhadran, Y Jia, P Moreno, Y Wu, Z Wu 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	131	2019
Parrotron: An end-to-end speech-to-speech conversion model and its applications to hearing-impaired speech and speech separation F Biadsy, RJ Weiss, PJ Moreno, D Kanevsky, Y Jia arXiv preprint arXiv:1904.04169, 2019	128	2019
Parallel tacotron: Non-autoregressive and controllable tts I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	119	2021
mslam: Massively multilingual joint pre-training for speech and text A Bapna, C Cherry, Y Zhang, Y Jia, M Johnson, Y Cheng, S Khanuja, ... arXiv preprint arXiv:2202.01374, 2022	99	2022
Non-attentive tacotron: Robust and controllable neural tts synthesis including unsupervised duration modeling J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu arXiv preprint arXiv:2010.04301, 2020	91	2020
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Y Jia, MT Ramanovich, T Remez, R Pomerantz International Conference on Machine Learning, 10120-10134, 2022	81*	2022
SLAM: A unified encoder for speech and language modeling via speech-text joint pre-training A Bapna, Y Chung, N Wu, A Gulati, Y Jia, JH Clark, M Johnson, J Riesa, ... arXiv preprint arXiv:2110.10329, 2021	79	2021
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS Y Jia, H Zen, J Shen, Y Zhang, Y Wu Proc. Interspeech 2021, 151--155, 2021	78	2021
Parallel Tacotron 2: A non-autoregressive neural TTS model with differentiable duration modeling I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu arXiv preprint arXiv:2103.14574, 2021	62	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors