Follow
Wenhao Yang
Title
Cited by
Cited by
Year
On the Convergence of FedAvg on Non-IID Data
X Li, K Huang, W Yang, S Wang, Z Zhang
arXiv preprint arXiv:1907.02189, 2019
21962019
Communication-efficient local decentralized SGD methods
X Li, W Yang, S Wang, Z Zhang
arXiv preprint arXiv:1910.09126, 2019
112*2019
Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics
W Yang, L Zhang, Z Zhang
The Annals of Statistics 50 (6), 3223-3248, 2022
522022
Federated Reinforcement Learning with Environment Heterogeneity
H Jin, Y Peng, W Yang, S Wang, Z Zhang
International Conference on Artificial Intelligence and Statistics, 18-37, 2022
512022
A regularized approach to sparse optimal policy in reinforcement learning
W Yang, X Li, Z Zhang
Advances in Neural Information Processing Systems 32, 2019
36*2019
A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning
X Li, W Yang, J Liang, Z Zhang, MI Jordan
International Conference on Artificial Intelligence and Statistics, 2207-2261, 2023
16*2023
Robust Markov Decision Processes without Model Estimation
W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang
arXiv preprint arXiv:2302.01248, 2023
9*2023
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ...
arXiv preprint arXiv:2205.14211, 2022
62022
Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs
W Yang, X Li, G Xie, Z Zhang
arXiv preprint arXiv:2011.00213, 2020
32020
Regularization and variance-weighted regression achieves minimax optimality in linear MDPs: theory and practice
T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ...
International Conference on Machine Learning, 17135-17175, 2023
22023
Semi-infinitely Constrained Markov Decision Processes
L Zhang, Y Peng, W Yang, Z Zhang
Advances in Neural Information Processing Systems 35, 16808-16820, 2022
22022
Estimation and Inference in Distributional Reinforcement Learning
L Zhang, Y Peng, J Liang, W Yang, Z Zhang
arXiv preprint arXiv:2309.17262, 2023
12023
Semiparametrically efficient off-policy evaluation in linear Markov decision processes
C Xie, W Yang, Z Zhang
International Conference on Machine Learning, 38227-38257, 2023
12023
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning
L Zhang, Y Peng, W Yang, Z Zhang
IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-14, 2023
2023
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
M Lu, W Yang, L Zhang, Z Zhang
arXiv preprint arXiv:2209.05186, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–15