PaLM 2 Technical Report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 854 | 2023 |
Scaling up models and data with t5x and seqio A Roberts, HW Chung, G Mishra, A Levskaya, J Bradbury, D Andor, ... Journal of Machine Learning Research 24 (377), 1-8, 2023 | 116 | 2023 |
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024 | 54 | 2024 |
Neural generation meets real people: Towards emotionally engaging mixed-initiative conversations A Paranjape, A See, K Kenealy, H Li, A Hardy, P Qi, KR Sadagopan, ... arXiv preprint arXiv:2008.12348, 2020 | 48 | 2020 |
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models A Singh, JD Co-Reyes, R Agarwal, A Anand, P Patil, PJ Liu, J Harrison, ... arXiv preprint arXiv:2312.06585, 2023 | 21 | 2023 |
Transformers and pointer-generator networks for abstractive summarization J Deaton, A Jacobs, K Kenealy, A See | 8 | 2019 |
Neural Generation Meets Real People: Building a Social, Informative Open-Domain Dialogue Agent EA Chi, A Paranjape, A See, C Chiam, K Kenealy, SK Lim, A Hardy, ... arXiv preprint arXiv:2207.12021, 2022 | 7 | 2022 |
Transfer Learning for Text Diffusion Models K Han, K Kenealy, A Barua, N Fiedel, N Constant arXiv preprint arXiv:2401.17181, 2024 | 1 | 2024 |
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024 | | 2024 |
PaLM 2 Technical Report.(2023) R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | | 2023 |