Takaaki Hori

Cited by

	All	Since 2019
Citations	10206	8180
h-index	44	39
i10-index	116	81

1900

950

475

1425

20042005200620072008200920102011201220132014201520162017201820192020202120222023202432 38 51 47 55 70 58 100 130 133 101 134 144 291 570 950 1350 1814 1766 1843 451

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Jonathan Le RouxMERLVerified email at merl.com
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)Verified email at google.com
Atsushi NakamuraGraduate School of Natural Sciences, Nagoya City UniversityVerified email at ieee.org
Chiori HoriMERLVerified email at merl.com
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Oba TakanobuManager, Service Innovation Depertment, NTT Docomo Inc.Verified email at nttdocomo.com
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Masakiyo FujimotoSenior researcher, National Institute of Information and Communications TechnologyVerified email at nict.go.jp
Yotaro KuboGoogle SpeechVerified email at ieee.org
Takuya YoshiokaAssemblyAIVerified email at assemblyai.com
Keisuke KinoshitaResearch Scientist at GoogleVerified email at ieee.org
Shoko ArakiNTT Communication Science LaboratoriesVerified email at ieee.org
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Miquel EspiApple Inc.Verified email at apple.com
James GlassMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
Akinori ItoTohoku UniversityVerified email at spcom.ecei.tohoku.ac.jp
Timothy J. HazenLinkedInVerified email at alum.mit.edu
Sadaoki FuruiToyota Technological Institute at ChicagoVerified email at ttic.edu

Takaaki Hori

Apple

Verified email at apple.com

Speech Recognition Spoken Language Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Espnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	1455	2018
Joint CTC-attention based end-to-end speech recognition using multi-task learning S Kim, T Hori, S Watanabe 2017 IEEE international conference on acoustics, speech and signal …, 2017	1014	2017
Hybrid CTC/attention architecture for end-to-end speech recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017	819	2017
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	750	2019
Attention-based multimodal fusion for video description C Hori, T Hori, TY Lee, Z Zhang, B Harsham, JR Hershey, TK Marks, ... Proceedings of the IEEE international conference on computer vision, 4193-4202, 2017	401	2017
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan arXiv preprint arXiv:1706.02737, 2017	344	2017
Streaming automatic speech recognition with the transformer model N Moritz, T Hori, J Le ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	198	2020
Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition T Hori, C Hori, Y Minami, A Nakamura IEEE Transactions on audio, speech, and language processing 15 (4), 1352-1365, 2007	186	2007
Language independent end-to-end architecture for joint language identification and speech recognition S Watanabe, T Hori, JR Hershey 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	167	2017
Triggered attention for end-to-end speech recognition N Moritz, T Hori, J Le Roux ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	138	2019
End-to-end audio visual scene-aware dialog using multimodal attention-based video features C Hori, H Alamri, J Wang, G Wichern, T Hori, A Cherian, TK Marks, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	134	2019
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling J Cho, MK Baskar, R Li, M Wiesner, SH Mallidi, N Yalta, M Karafiat, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 521-527, 2018	134	2018
End-to-end speech recognition with word-based RNN language models T Hori, J Cho, S Watanabe 2018 IEEE Spoken Language Technology Workshop (SLT), 389-396, 2018	134	2018
Joint CTC/attention decoding for end-to-end speech recognition T Hori, S Watanabe, JR Hershey Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017	132	2017
Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB challenge M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ... Reverb workshop, 2014	126	2014
Open-vocabulary spoken utterance retrieval using confusion networks T Hori, IL Hetherington, TJ Hazen, JR Glass 2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007	121	2007
Multichannel end-to-end speech recognition T Ochiai, S Watanabe, T Hori, JR Hershey International conference on machine learning, 2632-2641, 2017	120	2017
Back-translation-style data augmentation for end-to-end ASR T Hayashi, S Watanabe, Y Zhang, T Toda, T Hori, R Astudillo, K Takeda 2018 IEEE Spoken Language Technology Workshop (SLT), 426-433, 2018	117	2018
Duration-controlled LSTM for polyphonic sound event detection T Hayashi, S Watanabe, T Toda, T Hori, J Le Roux, K Takeda IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (11 …, 2017	106	2017
Low-latency real-time meeting recognition and understanding using distant microphones and omni-directional camera T Hori, S Araki, T Yoshioka, M Fujimoto, S Watanabe, T Oba, A Ogawa, ... IEEE transactions on audio, speech, and language processing 20 (2), 499-513, 2011	106	2011

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors