Shivam Mehta

Cited by

	All	Since 2019
Citations	111	110
h-index	7	6
i10-index	6	6

20182019202020212022202320241 1 3 10 33 63

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Gustav Eje HenterKTH Royal Institute of Technology, Stockholm, SwedenVerified email at kth.se
Éva SzékelyAssistant Professor, KTH Royal Institute of TechnologyVerified email at kth.se
Jonas BeskowProfessor, KTH Speech, Music and HearingVerified email at kth.se
Harm LamerisKTHVerified email at kth.se
Simon AlexandersonKTH Royal Institute of TechnologyVerified email at kth.se
Ambika KirklandKTH, Stockholm UniversityVerified email at ling.su.se
joakim gustafsonProfessor in Speech Technology, KTHVerified email at speech.kth.se
Ruibo TuKTH Royal Institute of TechnologyVerified email at kth.se
Anna DeichlerKTH Royal Institute of TechnologyVerified email at kth.se
Birger MoëllPhD Student in Machine Learning, KTH Royal Institute of TechnologyVerified email at kth.se
Jim O'ReganKTH Royal Institute of TechnologyVerified email at tcd.ie
Siyang WangPhD Student, KTH Royal Institute of TechnologyVerified email at kth.se
Dr. Gaurav RajSharda UniversityVerified email at sharda.ac.in
Ivan SmetannikovITMO UniversityVerified email at niuitmo.ru
Olov EngwallProfessor in Speech Communication, KTH (Royal Institute of Technology); Stockholm, SwedenVerified email at kth.se
Agnes AxelssonPostdoctoral researcher, department of Intelligent Systems, Delft University of TechnologyVerified email at kth.se
Ronald CumbalUppsala UniversityVerified email at it.uu.se
Rajiv PunmiyaVerified email at catholic.ac.kr

Shivam Mehta

KTH Royal Institute of Technology & WASP AI

Verified email at kth.se - Homepage

Probabilistic Machine Learning Deep Learning Speech Synthesis Generative Models


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Neural HMMs are all you need (for high-quality attention-free TTS) S Mehta, É Székely, J Beskow, GE Henter ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	22	2022
Prosody-controllable spontaneous TTS with neural HMMs H Lameris, S Mehta, GE Henter, J Gustafson, É Székely ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	17	2023
OverFlow: Putting flows on top of neural transducers for better TTS S Mehta, A Kirkland, H Lameris, J Beskow, É Székely, GE Henter Proceedings of INTERSPEECH 2023, 4279--4283, 2023	14	2023
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis S Mehta, S Wang, S Alexanderson, J Beskow, É Székely, GE Henter Proc. 12th ISCA Speech Synthesis Workshop (SSW2023), 150--156, 2023	14	2023
Matcha-TTS: A fast TTS architecture with conditional flow matching S Mehta, R Tu, J Beskow, É Székely, GE Henter arXiv preprint arXiv:2309.03199, 2023	12	2023
Diffusion-based co-speech gesture generation using joint text and audio representation A Deichler, S Mehta, S Alexanderson, J Beskow Proceedings of the 25th International Conference on Multimodal Interaction …, 2023	10	2023
Penetration testing as a test phase in web service testing a black box pen testing approach S Mehta, G Raj, D Singh Smart Computing and Informatics: Proceedings of the First International …, 2018	7	2018
Stuck in the MOS pit: A critical analysis of MOS test methodology in TTS evaluation A Kirkland, S Mehta, H Lameris, GE Henter, E Székely, J Gustafson 12th Speech Synthesis Workshop (SSW) 2023, 2023	6	2023
Speech data augmentation for improving phoneme transcriptions of aphasic speech using wav2vec 2.0 for the psst challenge B Moëll, J O'Regan, S Mehta, A Kirkland, H Lameris, J Gustafsson, ... 13th Language Resources and Evaluation Conference (LREC), 62-70, 2022	4	2022
Unified speech and gesture synthesis using flow matching S Mehta, R Tu, S Alexanderson, J Beskow, É Székely, GE Henter ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	3	2024
Stereotypical nationality representations in HRI: perspectives from international young adults R Cumbal, A Axelsson, S Mehta, O Engwall Frontiers in Robotics and AI 10, 1264614, 2023	1	2023
Finding the Blank with Sequence Labeling for English Learning S Mehta, I Smetannikov Proceedings of the 2020 1st International Conference on Control, Robotics …, 2020	1	2020
Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech S Mehta, H Lameris, R Punmiya, J Beskow, É Székely, GE Henter arXiv preprint arXiv:2406.05401, 2024		2024
Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis S Mehta, A Deichler, J O'regan, B Moëll, J Beskow, GE Henter, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024		2024
Learning fast with fewer data samples using Neural HMMs S Mehta, H Lameris, É Székely, J Beskow, GE Henter		2022
Spontaneous Neural HMM TTS with Prosodic Feature Modification H Lameris, S Mehta, GE Henter, A Kirkland, B Moëll, J O’Regan, ... Fonetik 2022, Stockholm 13-15 May, 202, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–16

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors