Yuhuai(Tony) Wu

Cited by

	All	Since 2019
Citations	15165	14460
h-index	34	33
i10-index	44	44

6000

3000

1500

4500

20162017201820192020202120222023202440 148 474 815 1584 1989 2750 5148 2129

Public access

View all

17 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Roger GrosseAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Jimmy BaUniversity of TorontoVerified email at cs.toronto.edu
Christian SzegedyResearcherVerified email at szegedy.org
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Ruslan SalakhutdinovUPMC Professor, Machine Learning Department, CMUVerified email at cs.cmu.edu
Behnam NeyshaburSenior Staff Research Scientist, DeepMindVerified email at google.com
David DuvenaudAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Albert Q. JiangUniversity of Cambridge | Mistral AIVerified email at mistral.ai
Percy LiangAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Saizheng Zhang
Oriol VinyalsResearch Scientist at Google DeepMindVerified email at google.com

Yuhuai(Tony) Wu

Co-Founder of xAI

Verified email at x.ai - Homepage

Machine Learning Machine Reasoning Theorem Proving


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... Nature 575 (7782), 350-354, 2019	4609*	2019
On the opportunities and risks of foundation models R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2021	2714	2021
Openai baselines P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...	1829*	2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation Y Wu, E Mansimov, RB Grosse, S Liao, J Ba Advances in Neural Information Processing Systems, 5283-5292, 2017	787	2017
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	765	2023
Holistic evaluation of language models P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ... arXiv preprint arXiv:2211.09110, 2022	598	2022
Solving quantitative reasoning problems with language models A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ... Advances in Neural Information Processing Systems 35, 3843-3857, 2022	421	2022
Backpropagation through the void: Optimizing control variates for black-box gradient estimation W Grathwohl, D Choi, Y Wu, G Roeder, D Duvenaud ICLR2018, 2017	311	2017
STaR: Bootstrapping reasoning with reasoning E Zelikman, Y Wu, ND Goodman arXiv preprint arXiv:2203.14465, 2022	269*	2022
On the quantitative analysis of decoder-based generative models Y Wu, Y Burda, R Salakhutdinov, R Grosse 5th International Conference on Learning Representations (ICLR 2017), 2016	265	2016
Sticking the landing: Simple, lower-variance gradient estimators for variational inference G Roeder, Y Wu, DK Duvenaud Advances in Neural Information Processing Systems 30, 2017	257*	2017
Architectural complexity measures of recurrent neural networks S Zhang, Y Wu, T Che, Z Lin, R Memisevic, RR Salakhutdinov, Y Bengio Advances in neural information processing systems 29, 2016	190	2016
STDP-compatible approximation of backpropagation in an energy-based model Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu Neural computation 29 (3), 555-577, 2017	182*	2017
On multiplicative integration with recurrent neural networks Y Wu, S Zhang, Y Zhang, Y Bengio, RR Salakhutdinov Advances in neural information processing systems 29, 2016	179	2016
The Importance of Sampling in Meta-Reinforcement Learning B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ... Advances in Neural Information Processing Systems, 9299-9309, 2018	164*	2018
Memorizing Transformers Y Wu, MN Rabe, DL Hutchins, C Szegedy International Conference on Learning Representations 2022, 2022	162	2022
Understanding Short-Horizon Bias in Stochastic Meta-Optimization Y Wu, M Ren, R Liao, RB Grosse Sixth International Conference on Learning Representations (ICLR 2018), 2018	132	2018
Invariant Causal Representation Learning for Out-of-Distribution Generalization C Lu, Y Wu, JM Hernández-Lobato, B Schölkopf International Conference on Learning Representations, 2022	119*	2022
Exploring length generalization in large language models C Anil, Y Wu, A Andreassen, A Lewkowycz, V Misra, V Ramasesh, ... Advances in Neural Information Processing Systems 35, 38546-38556, 2022	104	2022
Autoformalization with large language models Y Wu, AQ Jiang, W Li, M Rabe, C Staats, M Jamnik, C Szegedy Advances in Neural Information Processing Systems 35, 32353-32368, 2022	94	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors