Jing Shi
Cited by
Cited by
Recent developments on espnet toolkit boosted by conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Vlp: A survey on vision-language pre-training
FL Chen, DZ Zhang, ML Han, XY Chen, J Shi, S Xu, B Xu
Machine Intelligence Research 20 (1), 38-56, 2023
An exploration of self-supervised pretrained representations for end-to-end speech recognition
X Chang, T Maekaku, P Guo, J Shi, YJ Lu, AS Subramanian, T Wang, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration
C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 785-792, 2021
X-llm: Bootstrapping advanced large language models by treating multi-modalities as foreign languages
F Chen, M Han, H Zhao, Q Zhang, J Shi, S Xu, B Xu
arXiv preprint arXiv:2305.04160, 2023
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans
S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ...
2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021
Neural speaker diarization with speaker-wise chain rule
Y Fujita, S Watanabe, S Horiguchi, Y Xue, J Shi, K Nagamatsu
arXiv preprint arXiv:2006.01796, 2020
Speaker-conditional chain model for speech separation and extraction
J Shi, J Xu, Y Fujita, S Watanabe, B Xu
arXiv preprint arXiv:2006.14149, 2020
Modeling attention and memory for auditory selection in a cocktail party environment
J Xu, J Shi, G Liu, X Chen, B Xu
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation.
J Shi, J Xu, G Liu, B Xu
IJCAI, 4353-4360, 2018
Distilled binary neural network for monaural speech separation
X Chen, G Liu, J Shi, J Xu, B Xu
2018 International Joint Conference on Neural Networks (IJCNN), 1-8, 2018
Sequence to multi-sequence learning via conditional chain mapping for mixture signals
J Shi, X Chang, P Guo, S Watanabe, Y Fujita, J Xu, B Xu, L Xie
Advances in Neural Information Processing Systems 33, 3735-3747, 2020
Closing the gap between time-domain multi-channel speech enhancement on real and simulation conditions
W Zhang, J Shi, C Li, S Watanabe, Y Qian
2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021
Discretization and re-synthesis: an alternative method to solve the cocktail party problem
J Shi, X Chang, T Hayashi, YJ Lu, S Watanabe, B Xu
arXiv preprint arXiv:2112.09382, 2021
Ensemble of feature sets and classification methods for stance detection
J Xu, S Zheng, J Shi, Y Yao, B Xu
Natural Language Understanding and Intelligent Applications: 5th CCF …, 2016
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments.
Y Hao, J Xu, J Shi, P Zhang, L Qin, B Xu
Interspeech, 1431-1435, 2020
Hierarchical memory networks for answer selection on unknown words
J Xu, J Shi, Y Yao, S Zheng, B Xu
arXiv preprint arXiv:1609.08843, 2016
Train from scratch: Single-stage joint training of speech separation and recognition
J Shi, X Chang, S Watanabe, B Xu
Computer Speech & Language 76, 101387, 2022
Training noisy single-channel speech separation with noisy oracle sources: A large gap and a small step
M Maciejewski, J Shi, S Watanabe, S Khudanpur
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Unsupervised and pseudo-supervised vision-language alignment in visual dialog
F Chen, D Zhang, X Chen, J Shi, S Xu, B Xu
Proceedings of the 30th ACM International Conference on Multimedia, 4142-4153, 2022
The system can't perform the operation now. Try again later.
Articles 1–20