Aidan Gomez
Aidan Gomez
Department of Computer Science, University of Oxford
Verified email at cohere.ai - Homepage
Title
Cited by
Cited by
Year
Attention is all you need
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
Advances in neural information processing systems, 5998-6008, 2017
295962017
Tensor2tensor for neural machine translation
A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ...
arXiv preprint arXiv:1803.07416, 2018
4202018
The reversible residual network: Backpropagation without storing activations
AN Gomez, M Ren, R Urtasun, RB Grosse
Proceedings of the 31st International Conference on Neural Information …, 2017
2842017
One model to learn them all
L Kaiser, AN Gomez, N Shazeer, A Vaswani, N Parmar, L Jones, ...
arXiv preprint arXiv:1706.05137, 2017
2672017
Depthwise Separable Convolutions for Neural Machine Translation
L Kaiser, AN Gomez, F Chollet
International Conference on Learning Representations, 2018
1842018
Learning Sparse Networks Using Targeted Dropout
AN Gomez, I Zhang, S Rao Kamalakara, D Madaan, K Swersky, Y Gal, ...
arXiv preprint arXiv:1905.13678, 2019
76*2019
Unsupervised cipher cracking using discrete gans
AN Gomez, S Huang, I Zhang, BM Li, M Osama, L Kaiser
arXiv preprint arXiv:1801.04883, 2018
482018
A systematic comparison of bayesian deep learning robustness in diabetic retinopathy tasks
A Filos, S Farquhar, AN Gomez, TGJ Rudner, Z Kenton, L Smith, ...
arXiv preprint arXiv:1912.10481, 2019
47*2019
The difficulty of training sparse neural networks
U Evci, F Pedregosa, A Gomez, E Elsen
arXiv preprint arXiv:1906.10732, 2019
322019
Wat zei je? detecting out-of-distribution translations with variational transformers
TZ Xiao, AN Gomez, Y Gal
arXiv preprint arXiv:2006.08344, 2020
11*2020
Large-scale clinical interpretation of genetic variants using evolutionary data and deep learning
J Frazer, P Notin, M Dias, A Gomez, K Brock, Y Gal, D Marks
bioRxiv, 2020
52020
Attention-based sequence transduction neural networks
NM Shazeer, AN Gomez, LM Kaiser, JD Uszkoreit, LO Jones, NJ Parmar, ...
US Patent 10,452,978, 2019
22019
Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning
J Kossen, N Band, C Lyle, AN Gomez, T Rainforth, Y Gal
arXiv preprint arXiv:2106.02584, 2021
12021
Robustness to Pruning Predicts Generalization in Deep Neural Networks
L Kuhn, C Lyle, AN Gomez, J Rothfuss, Y Gal
arXiv preprint arXiv:2103.06002, 2021
12021
Depthwise separable convolutions for neural machine translation
AN Gomez, LM Kaiser, F Chollet
US Patent 10,853,590, 2020
12020
Predicting Twitter Engagement With Deep Language Models
M Volkovs, Z Cheng, M Ravaut, H Yang, K Shen, JP Zhou, A Wong, ...
Proceedings of the Recommender Systems Challenge 2020, 38-43, 2020
12020
Improving compute efficacy frontiers with SliceOut
P Notin, AN Gomez, J Yoo, Y Gal
arXiv preprint arXiv:2007.10909, 2020
1*2020
Prioritized training on points that are learnable, worth learning, and not yet learned
S Mindermann, M Razzak, W Xu, A Kirsch, M Sharma, A Morisot, ...
arXiv preprint arXiv:2107.02565, 2021
2021
Depthwise separable convolutions for neural machine translation
AN Gomez, LM Kaiser, F Chollet
US Patent App. 17/100,169, 2021
2021
Interlocking Backpropagation: Improving depthwise model-parallelism
AN Gomez, O Key, S Gou, N Frosst, J Dean, Y Gal
arXiv preprint arXiv:2010.04116, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–20