Yi Wu
Yi Wu
Researcher, OpenAI Inc.
Verified email at cs.berkeley.edu - Homepage
TitleCited byYear
Multi-agent actor-critic for mixed cooperative-competitive environments
R Lowe, Y Wu, A Tamar, J Harb, OAIP Abbeel, I Mordatch
Advances in Neural Information Processing Systems, 6379-6390, 2017
3842017
Value iteration networks
A Tamar, Y Wu, G Thomas, S Levine, P Abbeel
Advances in Neural Information Processing Systems, 2154-2162, 2016
2622016
Building generalizable agents with a realistic and rich 3d environment
Y Wu, Y Wu, G Gkioxari, Y Tian
arXiv preprint arXiv:1801.02209, 2018
982018
Adversarial training for relation extraction
Y Wu, D Bamman, S Russell
Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017
512017
Swift: Compiled inference for probabilistic programming languages
Y Wu, L Li, S Russell, R Bodik
arXiv preprint arXiv:1606.09242, 2016
26*2016
Robust multi-agent reinforcement learning via minimax deep deterministic policy gradient
S Li, Y Wu, X Cui, H Dong, F Fang, S Russell
AAAI Conference on Artificial Intelligence (AAAI), 2019
122019
Dual-space analysis of the sparse linear model
Y Wu, DP Wipf
Advances in Neural Information Processing Systems, 1745-1753, 2012
122012
Understanding and evaluating sparse linear discriminant analysis
Y Wu, D Wipf, JM Yun
Artificial Intelligence and Statistics, 1070-1078, 2015
112015
Deep reinforcement learning for green security games with real-time information
Y Wang, ZR Shi, L Yu, Y Wu, R Singh, L Joppa, F Fang
Proceedings of the AAAI Conference on Artificial Intelligence 33, 1401-1408, 2019
9*2019
Meta-learning MCMC proposals
T Wang, Y Wu, D Moore, SJ Russell
Advances in Neural Information Processing Systems, 4146-4156, 2018
5*2018
Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Y Wu, S Srivastava, N Hay, S Du, S Russell
arXiv preprint arXiv:1806.02027, 2018
3*2018
A nearly-black-box online algorithm for joint parameter and state estimation in temporal models
YB Erol, Y Wu, L Li, S Russell
Thirty-First AAAI Conference on Artificial Intelligence, 2017
32017
BFiT: From possible-world semantics to random-evaluation semantics in open universe
Y Wu, L Li, SJ Russell
Neural Information Processing Systems, Probabilistic Programming workshop, 2014
22014
Bayesian Relational Memory for Semantic Visual Navigation
Y Wu, Y Wu, A Tamar, S Russell, G Gkioxari, Y Tian
arXiv preprint arXiv:1909.04306, 2019
12019
Near-Linear Time Local Polynomial Nonparametric Estimation
Y Wang, Y Wu, SS Du
arXiv preprint arXiv:1802.09578, 2018
12018
Emergent tool use from multi-agent autocurricula
B Baker, I Kanitscheider, T Markov, Y Wu, G Powell, B McGrew, ...
arXiv preprint arXiv:1909.07528, 2019
2019
Learning and Planning with a Semantic Model
Y Wu, Y Wu, A Tamar, S Russell, G Gkioxari, Y Tian
arXiv preprint arXiv:1809.10842, 2018
2018
The system can't perform the operation now. Try again later.
Articles 1–17