Yao Liu

Cited by

	All	Since 2019
Citations	665	658
h-index	9	9
i10-index	9	9

180

135

20182019202020212022202320246 45 75 165 151 156 66

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Omer GottesmanAmazonVerified email at amazon.com
Finale Doshi-VelezProfessor, HarvardVerified email at seas.harvard.edu
Alekh AgarwalGoogleVerified email at google.com
Adith SwaminathanMicrosoft ResearchVerified email at microsoft.com
Pierre-Luc BaconUniversity of MontrealVerified email at mila.quebec
Zhaohan Daniel GuoDeepMindVerified email at google.com
Allen NieStanford UniversityVerified email at stanford.edu
Rasool FakoorAmazon Web ServicesVerified email at amazon.com
Yannis Flet-BerliacPostdoc, Stanford UniversityVerified email at stanford.edu
Shoham SabachAssociate Professor, Technion, Faculty of Data and Decision SciencesVerified email at technion.ac.il
Kavosh AsadiResearch Scientist, Amazon Web ServicesVerified email at amazon.com
Liwei WangProfessor, Peking UniversityVerified email at cis.pku.edu.cn
Dipendra MisraMicrosoft Research New YorkVerified email at microsoft.com
Robert SchapireMicrosoft ResearchVerified email at microsoft.com
Miroslav DudikMicrosoft ResearchVerified email at microsoft.com
Philip ThomasUniversity of Massachusetts AmherstVerified email at cs.umass.edu
Pratik ChaudhariUniversity of PennsylvaniaVerified email at seas.upenn.edu
Zuxin LiuCarnegie Mellon UniversityVerified email at cs.cmu.edu
Jesse ZhangPhD Student, USCVerified email at usc.edu

Yao Liu

Amazon

Verified email at stanford.edu - Homepage

Reinforcement Learning Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Provably good batch reinforcement learning without great exploration Y Liu, A Swaminathan, A Agarwal, E Brunskill Advances in Neural Information Processing Systems 33, 1264–1274, 2020	202	2020
Off-Policy Policy Gradient with Stationary Distribution Correction Y Liu, A Swaminathan, A Agarwal, E Brunskill Proceedings of The 35th Uncertainty in Artificial Intelligence Conference …, 2019	170*	2019
Representation balancing mdps for off-policy policy evaluation Y Liu, O Gottesman, A Raghu, M Komorowski, A Faisal, F Doshi-Velez, ... Advances in Neural Information Processing Systems 31, 2644--2653, 2018	74	2018
Interpretable off-policy evaluation in reinforcement learning by highlighting influential transitions O Gottesman, J Futoma, Y Liu, S Parbhoo, L Celi, E Brunskill, ... International Conference on Machine Learning, 3658-3667, 2020	51	2020
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling Y Liu, PL Bacon, E Brunskill International Conference on Machine Learning, 6184-6193, 2020	39	2020
Behaviour policy estimation in off-policy policy evaluation: Calibration matters A Raghu, O Gottesman, Y Liu, M Komorowski, A Faisal, F Doshi-Velez, ... arXiv preprint arXiv:1807.01066, 2018	39	2018
Combining parametric and nonparametric models for off-policy evaluation O Gottesman, Y Liu, S Sussex, E Brunskill, F Doshi-Velez In International Conference on Machine Learning, 2366-2375, 2019	30	2019
When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms Y Liu, E Brunskill The 14th European Workshop on Reinforcement Learning, 2018	23	2018
Pac continuous state online multitask reinforcement learning with identification Y Liu, Z Guo, E Brunskill Proceedings of the 2016 International Conference on Autonomous Agents …, 2016	18	2016
Reinforcement learning tutor better supported lower performers in a math task S Ruan, A Nie, W Steenbergen, J He, JQ Zhang, M Guo, Y Liu, ... Machine Learning, 1-26, 2024	7	2024
All-action policy gradient methods: A numerical integration approach B Petit, L Amdahl-Culleton, Y Liu, J Smith, PL Bacon arXiv preprint arXiv:1910.09093, 2019	5	2019
Nonlinear Dimensionality Reduction by Local Orthogonality Preserving Alignment T Lin, Y Liu, B Wang, LW Wang, HB Zha Journal of Computer Science and Technology 31 (3), 512-524, 2016	3*	2016
Offline policy optimization with eligible actions Y Liu, Y Flet-Berliac, E Brunskill Uncertainty in Artificial Intelligence, 1253-1263, 2022	2	2022
Provably sample-efficient RL with side information about latent dynamics Y Liu, D Misra, M Dudík, RE Schapire Advances in Neural Information Processing Systems 35, 33482-33493, 2022	1	2022
Stitched trajectories for off-policy learning S Sussex, O Gottesman, Y Liu, S Murphy, E Brunskill, F Doshi-Velez ICML Workshop, 2018	1	2018
Budgeting counterfactual for offline RL Y Liu, P Chaudhari, R Fakoor Advances in Neural Information Processing Systems 36, 2024		2024
TD Convergence: An Optimization Perspective K Asadi, S Sabach, Y Liu, O Gottesman, R Fakoor Advances in Neural Information Processing Systems 36, 2024		2024
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models Z Liu, J Zhang, K Asadi, Y Liu, D Zhao, S Sabach, R Fakoor arXiv preprint arXiv:2310.05905, 2023		2023
Model Selection for Off-Policy Policy Evaluation Y Liu, PS Thomas, E Brunskill The Multi-disciplinary Conference on Reinforcement Learning and Decision Making, 2017		2017

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors