Ms marco: A human generated machine reading comprehension dataset DF Campos, T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, ... ArXiv, abs/1611.09268 29, 2016 | 2317* | 2016 |
Overview of the TREC 2019 deep learning track N Craswell, B Mitra, E Yilmaz, D Campos, EM Voorhees arXiv preprint arXiv:2003.07820, 2020 | 543 | 2020 |
XGLUE: A new benchmark dataset for cross-lingual pre-training, understanding and generation Y Liang, N Duan, Y Gong, N Wu, F Guo, W Qi, M Gong, L Shou, D Jiang, ... arXiv preprint arXiv:2004.01401, 2020 | 302 | 2020 |
The optimal bert surgeon: Scalable and accurate second-order pruning for large language models E Kurtic, D Campos, T Nguyen, E Frantar, M Kurtz, B Fineran, M Goin, ... arXiv preprint arXiv:2203.07259, 2022 | 102 | 2022 |
Leading conversational search by suggesting useful questions C Rosset, C Xiong, X Song, D Campos, N Craswell, S Tiwary, P Bennett Proceedings of the web conference 2020, 1160-1170, 2020 | 95 | 2020 |
Ms marco: A human generated machine reading comprehension dataset DF Campos, T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, ... ArXiv, abs/1611.09268 29, 2016 | 95 | 2016 |
Orcas: 18 million clicked query-document pairs for analyzing search N Craswell, D Campos, B Mitra, E Yilmaz, B Billerbeck Proceedings of the 29th ACM International Conference on Information …, 2020 | 83 | 2020 |
Open domain web keyphrase extraction beyond language modeling L Xiong, C Hu, C Xiong, D Campos, A Overwijk arXiv preprint arXiv:1911.02671, 2019 | 68 | 2019 |
Ms marco: Benchmarking ranking models in the large-data regime N Craswell, B Mitra, E Yilmaz, D Campos, J Lin Proceedings of the 44th international ACM SIGIR conference on research and …, 2021 | 67 | 2021 |
TREC deep learning track: Reusable test collections in the large data regime N Craswell, B Mitra, E Yilmaz, D Campos, EM Voorhees, I Soboroff Proceedings of the 44th international ACM SIGIR conference on research and …, 2021 | 45 | 2021 |
Overview of the TREC 2019 deep learning track. CoRR abs/2003.07820 (2020) N Craswell, B Mitra, E Yilmaz, D Campos, EM Voorhees | 30 | 2020 |
Curriculum learning for language modeling D Campos arXiv preprint arXiv:2108.02170, 2021 | 25 | 2021 |
Significant improvements over the state of the art? a case study of the ms marco document ranking leaderboard J Lin, D Campos, N Craswell, B Mitra, E Yilmaz Proceedings of the 44th international ACM SIGIR conference on research and …, 2021 | 23 | 2021 |
Overview of the TREC 2020 deep learning track. CoRR abs/2102.07662 (2021) N Craswell, B Mitra, E Yilmaz, D Campos arXiv preprint arXiv:2102.07662, 2021 | 23 | 2021 |
On the reliability of test collections for evaluating systems of different types E Yilmaz, N Craswell, B Mitra, D Campos proceedings of the 43rd International ACM SIGIR Conference on Research and …, 2020 | 21 | 2020 |
Reading COmprehension Dataset P Bajaj, D Campos, N Craswell, L Deng, J Gao, X Liu, R Majumder, ... | 18 | 2016 |
IMG2SMI: translating molecular structure images to simplified molecular-input line-entry system D Campos, H Ji arXiv preprint arXiv:2109.04202, 2021 | 11 | 2021 |
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models L Merrick, D Xu, G Nuti, D Campos arXiv preprint arXiv:2405.05374, 2024 | 10 | 2024 |
Fostering coopetition while plugging leaks: The design and implementation of the MS MARCO leaderboards J Lin, D Campos, N Craswell, B Mitra, E Yilmaz Proceedings of the 45th international ACM SIGIR conference on research and …, 2022 | 8 | 2022 |
Sparse* BERT: sparse models generalize to new tasks and domains D Campos, A Marques, T Nguyen, M Kurtz, CX Zhai arXiv preprint arXiv:2205.12452, 2022 | 8* | 2022 |