Orca: Progressive learning from complex explanation traces of gpt-4 S Mukherjee, A Mitra, G Jawahar, S Agarwal, H Palangi, A Awadallah arXiv preprint arXiv:2306.02707, 2023 | 139 | 2023 |
Adamix: Mixture-of-adaptations for parameter-efficient model tuning Y Wang, S Agarwal, S Mukherjee, X Liu, J Gao, AH Awadallah, J Gao arXiv preprint arXiv:2205.12410, 2022 | 49 | 2022 |
Orca 2: Teaching small language models how to reason A Mitra, L Del Corro, S Mahajan, A Codas, C Simoes, S Agarwal, X Chen, ... arXiv preprint arXiv:2311.11045, 2023 | 46 | 2023 |
Skipdecode: Autoregressive skip decoding with batching and caching for efficient llm inference L Del Corro, A Del Giorno, S Agarwal, B Yu, A Awadallah, S Mukherjee arXiv preprint arXiv:2307.02628, 2023 | 23 | 2023 |