MS MARCO: A human generated machine reading comprehension dataset T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, R Majumder, L Deng choice 2640, 660, 2016 | 1160 | 2016 |
Ms marco: A human generated machine reading comprehension dataset P Bajaj, D Campos, N Craswell, L Deng, J Gao, X Liu, R Majumder, ... arXiv preprint arXiv:1611.09268, 2016 | 452 | 2016 |
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 2022 | 242 | 2022 |
InfoXLM: An information-theoretic framework for cross-lingual language model pre-training Z Chi, L Dong, F Wei, N Yang, S Singhal, W Wang, X Song, XL Mao, ... arXiv preprint arXiv:2007.07834, 2020 | 211 | 2020 |
Coco-lm: Correcting and contrasting text sequences for language model pretraining Y Meng, C Xiong, P Bajaj, P Bennett, J Han, X Song Advances in Neural Information Processing Systems 34, 23102-23114, 2021 | 120 | 2021 |
Transformer-xh: Multi-evidence reasoning with extra hop attention C Zhao, C Xiong, C Rosset, X Song, P Bennett, S Tiwary | 97 | 2020 |
Neural ranking models with multiple document fields H Zamani, B Mitra, X Song, N Craswell, S Tiwary Proceedings of the eleventh ACM international conference on web search and …, 2018 | 84 | 2018 |
Xlm-e: Cross-lingual language model pre-training via electra Z Chi, S Huang, L Dong, S Ma, B Zheng, S Singhal, P Bajaj, X Song, ... arXiv preprint arXiv:2106.16138, 2021 | 67 | 2021 |
Leading conversational search by suggesting useful questions C Rosset, C Xiong, X Song, D Campos, N Craswell, S Tiwary, P Bennett Proceedings of the web conference 2020, 1160-1170, 2020 | 66 | 2020 |
Ms marco: A human generated machine reading comprehension dataset DF Campos, T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, ... ArXiv, abs/1611.09268, 2016 | 57 | 2016 |
Pushing the limits of narrow precision inferencing at cloud scale with microsoft floating point B Darvish Rouhani, D Lo, R Zhao, M Liu, J Fowers, K Ovtcharov, ... Advances in neural information processing systems 33, 10271-10281, 2020 | 53 | 2020 |
Generic intent representation in web search H Zhang, X Song, C Xiong, C Rosset, PN Bennett, N Craswell, S Tiwary Proceedings of the 42nd International ACM SIGIR Conference on Research and …, 2019 | 47 | 2019 |
Knowledge-aware language model pretraining C Rosset, C Xiong, M Phan, X Song, P Bennett, S Tiwary arXiv preprint arXiv:2007.00655, 2020 | 46 | 2020 |
Deltalm: Encoder-decoder pre-training for language generation and translation by augmenting pretrained multilingual encoders S Ma, L Dong, S Huang, D Zhang, A Muzio, S Singhal, HH Awadalla, ... arXiv preprint arXiv:2106.13736, 2021 | 43 | 2021 |
Language is not all you need: Aligning perception with language models S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... arXiv preprint arXiv:2302.14045, 2023 | 41 | 2023 |
Ms marco: A human-generated machine reading comprehension dataset.(2016) T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, R Majumder, L Deng arXiv preprint arXiv:1611.09268, 2016 | 28 | 2016 |
Consistency regularization for cross-lingual fine-tuning B Zheng, L Dong, S Huang, W Wang, Z Chi, S Singhal, W Che, T Liu, ... arXiv preprint arXiv:2106.08226, 2021 | 27 | 2021 |
An axiomatic approach to regularizing neural ranking models C Rosset, B Mitra, C Xiong, N Craswell, X Song, S Tiwary Proceedings of the 42nd international ACM SIGIR conference on research and …, 2019 | 24 | 2019 |
Multilingual machine translation systems from Microsoft for WMT21 shared task J Yang, S Ma, H Huang, D Zhang, L Dong, S Huang, A Muzio, S Singhal, ... arXiv preprint arXiv:2111.02086, 2021 | 22 | 2021 |
MS MARCO: A human generated machine reading comprehension dataset. CoRR abs/1611.09268 (2016) T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, R Majumder, L Deng arXiv preprint arXiv:1611.09268, 2016 | 19 | 2016 |