Dustin Morrill
Dustin Morrill
Computing Science Graduate Student, University of Alberta
Verified email at ualberta.ca - Homepage
Title
Cited by
Cited by
Year
Deepstack: Expert-level artificial intelligence in heads-up no-limit poker
M Moravčík, M Schmid, N Burch, V Lisý, D Morrill, N Bard, T Davis, ...
Science 356 (6337), 508-513, 2017
6462017
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
622019
Solving games with functional regret estimation
K Waugh, D Morrill, JA Bagnell, M Bowling
Twenty-ninth AAAI conference on artificial intelligence, 2015
482015
Computing approximate equilibria in sequential adversarial games by exploitability descent
E Lockhart, M Lanctot, J Pérolat, JB Lespiau, D Morrill, F Timbers, K Tuyls
arXiv preprint arXiv:1903.05614, 2019
372019
Neural replicator dynamics: Multiagent learning via hedging policy gradients
D Hennes, D Morrill, S Omidshafiei, R Munos, J Perolat, M Lanctot, ...
Proceedings of the 19th International Conference on Autonomous Agents and …, 2020
19*2020
Aivat: A new variance reduction technique for agent evaluation in imperfect information games
N Burch, M Schmid, M Moravcik, D Morill, M Bowling
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
122018
Using regret estimation to solve games compactly
DR Morrill
102016
OpenSpiel: A Framework for Reinforcement Learning in Games. CoRR abs/1908.09453 (2019)
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint cs.LG/1908.09453, 2019
92019
Hindsight and Sequential Rationality of Correlated Play
D Morrill, R D'Orazio, R Sarfati, M Lanctot, JR Wright, A Greenwald, ...
arXiv preprint arXiv:2012.05874, 2020
52020
The advantage regret-matching actor-critic
A Gruslys, M Lanctot, R Munos, F Timbers, M Schmid, J Perolat, D Morrill, ...
arXiv preprint arXiv:2008.12234, 2020
52020
Neural replicator dynamics
D Hennes, D Morrill, S Omidshafiei, R Munos, J Perolat, M Lanctot, ...
arXiv preprint arXiv:1906.00190, 2019
42019
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of -Regression Counterfactual Regret Minimization
R D'Orazio, D Morrill, JR Wright, M Bowling
arXiv preprint arXiv:1912.02967, 2019
32019
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
D Morrill, R D'Orazio, M Lanctot, JR Wright, M Bowling, A Greenwald
arXiv preprint arXiv:2102.06973, 2021
22021
Bounds for approximate regret-matching algorithms
R D'Orazio, D Morrill, JR Wright
arXiv preprint arXiv:1910.01706, 2019
12019
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games Supplementary
D Morrill, R D’Orazio, M Lanctot, JR Wright, M Bowling, AR Greenwald
Clavis Aurea?
M Moravčík, M Schmid, N Burch, V Lisý, D Morrill, N Bard, T Davis
The system can't perform the operation now. Try again later.
Articles 1–16