Wenhao Yang

Cited by

	All	Since 2019
Citations	2576	2572
h-index	7	7
i10-index	7	7

880

440

220

660

20192020202120222023202417 146 366 649 865 526

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Zhihua ZhangProfessor of Computer Science, Shanghai Jiao Tong UniversityVerified email at zju.edu.cn
Shusen WangMetaVerified email at meta.com
Xiang LiUniversity of PennsylvaniaVerified email at upenn.edu
Liangyu ZhangPhD student at Peking UniversityVerified email at pku.edu.cn
Tadashi KozunoOMRON SINIC XVerified email at alumni.oist.jp
Hao JinPeking UniversityVerified email at pku.edu.cn
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Jiadong Liangpeking universityVerified email at pku.edu.cn
Scott M. JordanPostdoctoral Fellow, University of AlbertaVerified email at ualberta.ca
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Mohammad Gheshlaghi AzarCohereVerified email at google.com
Rémi MunosGoogle DeepMindVerified email at inria.fr
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Pierre MénardOvGU MagdeburgVerified email at inria.fr
Toshinori KitamuraThe University of TokyoVerified email at weblab.t.u-tokyo.ac.jp
Nino VieillardGoogle DeepMindVerified email at google.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Jincheng MeiResearch Scientist, Google BrainVerified email at google.com
Yunhao TangResearch Scientist, DeepMindVerified email at columbia.edu

Wenhao Yang

Stanford University

Verified email at stanford.edu - Homepage

Reinforcement Learning Optimization Statistics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
On the Convergence of FedAvg on Non-IID Data X Li, K Huang, W Yang, S Wang, Z Zhang arXiv preprint arXiv:1907.02189, 2019	2270	2019
Communication-efficient local decentralized SGD methods X Li, W Yang, S Wang, Z Zhang arXiv preprint arXiv:1910.09126, 2019	111*	2019
Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics W Yang, L Zhang, Z Zhang The Annals of Statistics 50 (6), 3223-3248, 2022	58	2022
Federated Reinforcement Learning with Environment Heterogeneity H Jin, Y Peng, W Yang, S Wang, Z Zhang International Conference on Artificial Intelligence and Statistics, 18-37, 2022	53	2022
A regularized approach to sparse optimal policy in reinforcement learning W Yang, X Li, Z Zhang Advances in Neural Information Processing Systems 32, 2019	36*	2019
A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning X Li, W Yang, J Liang, Z Zhang, MI Jordan International Conference on Artificial Intelligence and Statistics, 2207-2261, 2023	17*	2023
Robust Markov Decision Processes without Model Estimation W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang arXiv preprint arXiv:2302.01248, 2023	10*	2023
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022	7	2022
Semiparametrically efficient off-policy evaluation in linear Markov decision processes C Xie, W Yang, Z Zhang International Conference on Machine Learning, 38227-38257, 2023	4	2023
Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs W Yang, X Li, G Xie, Z Zhang arXiv preprint arXiv:2011.00213, 2020	3	2020
Regularization and variance-weighted regression achieves minimax optimality in linear MDPs: theory and practice T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ... International Conference on Machine Learning, 17135-17175, 2023	2	2023
Semi-infinitely Constrained Markov Decision Processes L Zhang, Y Peng, W Yang, Z Zhang Advances in Neural Information Processing Systems 35, 16808-16820, 2022	2	2022
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach M Lu, W Yang, L Zhang, Z Zhang arXiv preprint arXiv:2209.05186, 2022	2	2022
Estimation and Inference in Distributional Reinforcement Learning L Zhang, Y Peng, J Liang, W Yang, Z Zhang arXiv preprint arXiv:2309.17262, 2023	1	2023
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning L Zhang, Y Peng, W Yang, Z Zhang IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-14, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–15

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors