Follow
Thomas Merritt
Thomas Merritt
Spotify
Verified email at spotify.com
Title
Cited by
Cited by
Year
Towards achieving robust universal neural vocoding
J Lorenzo-Trueba, T Drugman, J Latorre, T Merritt, B Putrycz, ...
arXiv preprint arXiv:1811.06292, 2018
1172018
From HMMs to DNNs: where do the improvements come from?
O Watts, GE Henter, T Merritt, Z Wu, S King
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
922016
Deep neural network-guided unit selection synthesis
T Merritt, RAJ Clark, Z Wu, J Yamagishi, S King
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
732016
Effect of data reduction on sequence-to-sequence neural TTS
J Latorre, J Lachowicz, J Lorenzo-Trueba, T Merritt, T Drugman, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
722019
Low-resource expressive text-to-speech using data augmentation
G Huybrechts, T Merritt, G Comini, B Perz, R Shah, J Lorenzo-Trueba
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
632021
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech.
GE Henter, T Merritt, M Shannon, C Mayo, S King
Interspeech, 1504-1508, 2014
492014
In other news: A bi-style text-to-speech model for synthesizing newscaster voice with limited data
N Prateek, M Łajszczak, R Barra-Chicote, T Drugman, J Lorenzo-Trueba, ...
arXiv preprint arXiv:1904.02790, 2019
332019
Camp: a two-stage approach to modelling prosody in context
Z Hodari, A Moinet, S Karlapati, J Lorenzo-Trueba, T Merritt, A Joly, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
322021
Phrase break prediction for long-form reading TTS: Exploiting text structure information
V Klimkov, A Nadolski, A Moinet, B Putrycz, R Barra-Chicote, T Merritt, ...
312018
Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech
R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ...
arXiv preprint arXiv:2106.12896, 2021
262021
Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech
T Merritt, J Latorre, S King
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
222015
Comprehensive evaluation of statistical speech waveform synthesis
T Merritt, B Putrycz, A Nadolski, T Ye, D Korzekwa, W Dolecki, T Drugman, ...
2018 IEEE Spoken Language Technology Workshop (SLT), 325-331, 2018
202018
Creating new voices using normalizing flows
P Bilinski, T Merritt, A Ezzerg, K Pokora, S Cygert, K Yanagisawa, ...
arXiv preprint arXiv:2312.14569, 2023
192023
Text-free non-parallel many-to-many voice conversion using normalising flow
T Merritt, A Ezzerg, P Biliński, M Proszewska, K Pokora, R Barra-Chicote, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
162022
Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis.
T Merritt, T Raitio, S King
Interspeech, 1509-1513, 2014
162014
Deep neural network context embeddings for model selection in rich-context HMM synthesis.
T Merritt, J Yamagishi, Z Wu, O Watts, S King
INTERSPEECH, 2207-2211, 2015
152015
Investigating the shortcomings of HMM synthesis
T Merritt, S King
Eighth ISCA Workshop on Speech Synthesis, 2013
142013
Varying speaking styles with neural textto-speech
T Wood, T Merritt
Amazon Science, 2018
122018
A flexible front-end for HTS
M Aylett, R Dall, A Ghoshal, GE Henter, T Merritt
INTERSPEECH 2014 15th Annual Conference of the International Speech …, 2014
112014
Expressive, variable, and controllable duration modelling in TTS
A Abbas, T Merritt, A Moinet, S Karlapati, E Muszynska, S Slangen, E Gatti, ...
arXiv preprint arXiv:2206.14165, 2022
92022
The system can't perform the operation now. Try again later.
Articles 1–20