Kenichi Kumatani
Title
Cited by
Cited by
Year
Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors
K Kumatani, J McDonough, B Raj
IEEE Signal Processing Magazine 29 (6), 127-140, 2012
1332012
Beamforming with a maximum negentropy criterion
K Kumatani, J McDonough, B Rauch, D Klakow, PN Garner, W Li
IEEE Transactions on audio, speech, and language processing 17 (5), 994-1008, 2009
83*2009
Generation of wake-up words
OA Bapat, K Kumatani
US Patent 9,373,321, 2016
562016
Adaptive beamforming with a minimum mutual information criterion
K Kumatani, T Gehrig, U Mayer, E Stoimenov, J McDonough, M Wolfel
Audio, Speech, and Language Processing, IEEE Transactions on 15 (8), 2527-2541, 2007
50*2007
Microphone array processing for distant speech recognition: Towards real-world deployment
K Kumatani, T Arakawa, K Yamamoto, J McDonough, B Raj, R Singh, ...
Proceedings of The 2012 Asia Pacific Signal and Information Processing …, 2012
492012
Channel selection based on multichannel cross-correlation coefficients for distant speech recognition
K Kumatani, J McDonough, JF Lehman, B Raj
2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays …, 2011
492011
Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming
K Kumatani, J McDonough, S Schacht, D Klakow, PN Garner, W Li
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
452008
Direct modeling of raw audio with dnns for wake word detection
K Kumatani, S Panchapagesan, M Wu, M Kim, N Strom, G Tiwari, ...
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
362017
To separate speech: A system for recognizing simultaneous speech
J McDonough, K Kumatani, T Gehrig, E Stoimenov, U Mayer, S Schacht, ...
Proceedings of the 4th international conference on Machine learning for …, 2007
31*2007
Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
L Mošner, M Wu, A Raju, SHK Parthasarathi, K Kumatani, S Sundaram, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
292019
Advances in lecture recognition: The isl rt-06s evaluation system
C Fügen, M Wölfel, JW McDonough, S Ikbal, F Kraft, K Laskowski, ...
Ninth International Conference on Spoken Language Processing, 2006
262006
Time-delayed bottleneck highway networks using a dft feature for keyword spotting
J Guo, K Kumatani, M Sun, M Wu, A Raju, N Ström, A Mandal
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
252018
Adaptive beamforming with a maximum negentropy criterion
K Kumatani, J McDonough, D Klakow, PN Garner, W Li
Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008, 180-183, 2008
232008
Improving hands-free speech recognition in a car through audio-visual voice activity detection
F Faubel, M Georges, K Kumatani, A Bruhn, D Klakow
2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays …, 2011
202011
Frequency domain multi-channel acoustic modeling for distant speech recognition
W Minhua, K Kumatani, S Sundaram, N Ström, B Hoffmeister
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
19*2019
Maximum kurtosis beamforming with the generalized sidelobe canceller
K Kumatani, J McDonough, B Rauch, PN Garner, W Li, J Dines
Ninth Annual Conference of the International Speech Communication Association, 2008
192008
Multi-modal temporal asynchronicity modeling by product HMMs for robust audio-visual speech recognition
S Nakamura, K Kumatani, S Tamura
Proceedings. Fourth IEEE International Conference on Multimodal Interfaces …, 2002
182002
Maximum negentropy beamforming with superdirectivity
K Kumatani, L Lu, J McDonough, A Ghoshal, D Klakow
2010 18th European Signal Processing Conference, 2067-2071, 2010
152010
Microphone array post-filter based on spatially-correlated noise measurements for distant speech recognition
K Kumatani, B Raj, R Singh, J McDonough
Thirteenth Annual Conference of the International Speech Communication …, 2012
132012
The ISL RT-06S speech-to-text system
C Fügen, S Ikbal, F Kraft, K Kumatani, K Laskowski, J McDonough, ...
Machine Learning for Multimodal Interaction, 407-418, 2006
132006
The system can't perform the operation now. Try again later.
Articles 1–20