Aidan Gomez

Cited by

	All	Since 2019
Citations	124091	122039
h-index	16	16
i10-index	20	20

43000

21500

10750

32250

20182019202020212022202320241443 5567 11096 18834 28730 42893 14917

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Noam ShazeerCharacter.aiVerified email at character.ai
Jakob UszkoreitInceptiveVerified email at uszkoreit.net
Łukasz KaiserOpenAI & CNRSVerified email at openai.com
Yarin GalAssociate Professor, University of OxfordVerified email at cs.ox.ac.uk
Pascal NotinDepartment of Computer Science, University of OxfordVerified email at cs.ox.ac.uk
Ivan ZhangFOR.aiVerified email at ivanzhang.ca
Siddhartha Rao KamalakaraCohereVerified email at for.ai
Roger GrosseAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Geoffrey HintonEmeritus Prof. Computer Science, University of TorontoVerified email at cs.toronto.edu
Bharat VenkiteshMachine Learning at CohereVerified email at cohere.ai
Jeff DeanGoogle Chief Scientist, Google Research and Google DeepMindVerified email at google.com
Nicholas Frosstcofounder of cohere.ai

Aidan Gomez

Cohere

Verified email at cohere.ai - Homepage

Artificial Intelligence Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Attention is all you need A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... Advances in neural information processing systems 30, 2017	120946	2017
Tensor2tensor for neural machine translation A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ... arXiv preprint arXiv:1803.07416, 2018	611	2018
The reversible residual network: Backpropagation without storing activations AN Gomez, M Ren, R Urtasun, RB Grosse Advances in neural information processing systems 30, 2017	543	2017
Disease variant prediction with deep generative models of evolutionary data J Frazer, P Notin, M Dias, A Gomez, JK Min, K Brock, Y Gal, DS Marks Nature 599 (7883), 91-95, 2021	390	2021
One model to learn them all L Kaiser, AN Gomez, N Shazeer, A Vaswani, N Parmar, L Jones, ... arXiv preprint arXiv:1706.05137, 2017	381	2017
Depthwise Separable Convolutions for Neural Machine Translation L Kaiser, AN Gomez, F Chollet International Conference on Learning Representations, 2018	350	2018
A systematic comparison of bayesian deep learning robustness in diabetic retinopathy tasks A Filos, S Farquhar, AN Gomez, TGJ Rudner, Z Kenton, L Smith, ... arXiv preprint arXiv:1912.10481, 2019	128*	2019
Tranception: protein fitness prediction with autoregressive transformers and inference-time retrieval P Notin, M Dias, J Frazer, JM Hurtado, AN Gomez, D Marks, Y Gal International Conference on Machine Learning, 16990-17017, 2022	118	2022
Learning Sparse Networks Using Targeted Dropout AN Gomez, I Zhang, S Rao Kamalakara, D Madaan, K Swersky, Y Gal, ... arXiv preprint arXiv:1905.13678, 2019	116	2019
The difficulty of training sparse neural networks U Evci, F Pedregosa, A Gomez, E Elsen arXiv preprint arXiv:1906.10732, 2019	91	2019
Self-attention between datapoints: Going beyond individual input-output pairs in deep learning J Kossen, N Band, C Lyle, AN Gomez, T Rainforth, Y Gal Advances in Neural Information Processing Systems 34, 28742-28756, 2021	87	2021
Prioritized training on points that are learnable, worth learning, and not yet learnt S Mindermann, JM Brauner, MT Razzak, M Sharma, A Kirsch, W Xu, ... International Conference on Machine Learning, 15630-15649, 2022	84	2022
Unsupervised cipher cracking using discrete GANs AN Gomez, S Huang, I Zhang, BM Li, M Osama, L Kaiser arXiv preprint arXiv:1801.04883, 2018	79	2018
Wat zei je? detecting out-of-distribution translations with variational transformers TZ Xiao, AN Gomez, Y Gal arXiv preprint arXiv:2006.08344, 2020	34*	2020
Targeted dropout AN Gomez, I Zhang, K Swersky, Y Gal, GE Hinton	33	2018
Attention-based sequence transduction neural networks NM Shazeer, AN Gomez, LM Kaiser, JD Uszkoreit, LO Jones, NJ Parmar, ... US Patent 10,452,978, 2019	31	2019
Interlocking backpropagation: Improving depthwise model-parallelism AN Gomez, O Key, K Perlin, S Gou, N Frosst, J Dean, Y Gal Journal of Machine Learning Research 23 (171), 1-28, 2022	16	2022
Depthwise separable convolutions for neural machine translation AN Gomez, LM Kaiser, F Chollet US Patent 10,853,590, 2020	12	2020
Robustness to pruning predicts generalization in deep neural networks L Kuhn, C Lyle, AN Gomez, J Rothfuss, Y Gal arXiv preprint arXiv:2103.06002, 2021	10	2021
Multi-task multi-modal machine learning system NM Shazeer, AN Gomez, LM Kaiser, JD Uszkoreit, LO Jones, NJ Parmar, ... US Patent 10,789,427, 2020	10	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors