Follow
Thomas Merritt
Thomas Merritt
Spotify
Verified email at spotify.com
Title
Cited by
Cited by
Year
Towards achieving robust universal neural vocoding
J Lorenzo-Trueba, T Drugman, J Latorre, T Merritt, B Putrycz, ...
arXiv preprint arXiv:1811.06292, 2018
1232018
From HMMs to DNNs: where do the improvements come from?
O Watts, GE Henter, T Merritt, Z Wu, S King
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
932016
Effect of data reduction on sequence-to-sequence neural TTS
J Latorre, J Lachowicz, J Lorenzo-Trueba, T Merritt, T Drugman, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
792019
Deep neural network-guided unit selection synthesis
T Merritt, RAJ Clark, Z Wu, J Yamagishi, S King
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
762016
Low-resource expressive text-to-speech using data augmentation
G Huybrechts, T Merritt, G Comini, B Perz, R Shah, J Lorenzo-Trueba
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
702021
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech
GE Henter, T Merritt, M Shannon, C Mayo, S King
INTERSPEECH 2014 15th Annual Conference of the International Speech …, 2014
522014
Camp: a two-stage approach to modelling prosody in context
Z Hodari, A Moinet, S Karlapati, J Lorenzo-Trueba, T Merritt, A Joly, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
352021
In other news: A bi-style text-to-speech model for synthesizing newscaster voice with limited data
N Prateek, M Łajszczak, R Barra-Chicote, T Drugman, J Lorenzo-Trueba, ...
arXiv preprint arXiv:1904.02790, 2019
342019
Phrase break prediction for long-form reading TTS: Exploiting text structure information
V Klimkov, A Nadolski, A Moinet, B Putrycz, R Barra-Chicote, T Merritt, ...
332018
Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech
R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ...
arXiv preprint arXiv:2106.12896, 2021
282021
Comprehensive evaluation of statistical speech waveform synthesis
T Merritt, B Putrycz, A Nadolski, T Ye, D Korzekwa, W Dolecki, T Drugman, ...
2018 IEEE Spoken Language Technology Workshop (SLT), 325-331, 2018
222018
Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech
T Merritt, J Latorre, S King
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
222015
Creating new voices using normalizing flows
P Bilinski, T Merritt, A Ezzerg, K Pokora, S Cygert, K Yanagisawa, ...
arXiv preprint arXiv:2312.14569, 2023
212023
Text-free non-parallel many-to-many voice conversion using normalising flow
T Merritt, A Ezzerg, P Biliński, M Proszewska, K Pokora, R Barra-Chicote, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
192022
The CSTR entry to the Blizzard Challenge 2016
T Merritt, S Ronanki, Z Wu, O Watts
Blizzard Challenge 2016, 2016
182016
Deep neural network context embeddings for model selection in rich-context HMM synthesis
T Merritt, J Yamagishi, Z Wu, O Watts, S King
Interspeech 2015, 2015
152015
Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis.
T Merritt, T Raitio, S King
Interspeech, 1509-1513, 2014
152014
Investigating the shortcomings of HMM synthesis
T Merritt, S King
Proceedings of 8th ISCA Speech Synthesis Workshop, 185-190, 2013
142013
Varying speaking styles with neural textto-speech
T Wood, T Merritt
Alexa Blogs, Nov 19, 2018
122018
Expressive, variable, and controllable duration modelling in TTS
A Abbas, T Merritt, A Moinet, S Karlapati, E Muszynska, S Slangen, E Gatti, ...
arXiv preprint arXiv:2206.14165, 2022
112022
The system can't perform the operation now. Try again later.
Articles 1–20