Follow
Wanpeng Zhang
Wanpeng Zhang
Ph.D. Candidate, Peking University
Verified email at stu.pku.edu.cn - Homepage
Title
Cited by
Cited by
Year
Sample efficient reinforcement learning via model-ensemble exploration and exploitation
Y Yao, L Xiao, Z An, W Zhang, D Luo
2021 IEEE International Conference on Robotics and Automation (ICRA), 4202-4208, 2021
202021
Model-based opponent modeling
X Yu, J Jiang, W Zhang, H Jiang, Z Lu
Advances in Neural Information Processing Systems 35, 28208-28221, 2022
192022
Robust model-based reinforcement learning for autonomous greenhouse control
W Zhang, X Cao, Y Yao, Z An, X Xiao, D Luo
Asian Conference on Machine Learning, 1208-1223, 2021
182021
igrow: A smart agriculture solution to autonomous greenhouse control
X Cao, Y Yao, L Li, W Zhang, Z An, Z Zhang, L Xiao, S Guo, X Cao, M Wu, ...
Proceedings of the AAAI Conference on Artificial Intelligence 36 (11), 11837 …, 2022
142022
A simulator-based planning framework for optimizing autonomous greenhouse control strategy
Z An, X Cao, Y Yao, W Zhang, L Li, Y Wang, S Guo, D Luo
Proceedings of the International Conference on Automated Planning and …, 2021
92021
Entity divider with language grounding in multi-agent reinforcement learning
Z Ding, W Zhang, J Yue, X Wang, T Huang, Z Lu
International Conference on Machine Learning, 8103-8119, 2023
72023
Efficient and stable information directed exploration for continuous reinforcement learning
M Chen, X Xiao, W Zhang, X Gao
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
52022
Self-paced probabilistic principal component analysis for data with outliers
B Zhao, X Xiao, W Zhang, B Zhang, G Gan, S Xia
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
42020
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback
W Zhang, Z Lu
arXiv preprint arXiv:2309.17176, 2023
3*2023
MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning
W Zhang, X Xiao, Y Yao, M Chen, D Luo
arXiv preprint arXiv:2108.01295, 2021
12021
Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation
W Zhang, Y Li, B Yang, Z Lu
arXiv preprint arXiv:2306.02747, 2023
2023
Method, device and equipment for determining parameters and storage medium
W Zhang, D Luo, X Xiao
CN Patent CN112,527,104 A, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–12