Follow
Yawen Duan
Title
Cited by
Cited by
Year
Ai alignment: A comprehensive survey
J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang, Y Duan, Z He, J Zhou, ...
arXiv preprint arXiv:2310.19852, 2023
2282023
Harms from increasingly agentic algorithmic systems
A Chan, R Salganik, A Markelius, C Pang, N Rajkumar, D Krasheninnikov, ...
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and …, 2023
112*2023
TransNAS-Bench-101: Improving transferability and Generalizability of Cross-Task Neural Architecture Search
Y Duan, X Chen, H Xu, Z Chen, X Liang, T Zhang, Z Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
712021
Adversarial Policies Beat Superhuman Go AIs
TT Wang, A Gleave, T Tseng, K Pelrine, N Belrose, J Miller, MD Dennis, ...
Proceedings of the 40th International Conference on Machine Learning, 35655 …, 2023
61*2023
CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search
X Chen, Y Duan, Z Chen, H Xu, Z Chen, X Liang, T Zhang, Z Li
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
232020
On the fragility of learned reward functions
L McKinney, Y Duan, D Krueger, A Gleave
arXiv preprint arXiv:2301.03652, 2023
212023
Libra-leaderboard: Towards responsible ai through a balanced leaderboard of safety and capability
H Li, X Han, Z Zhai, H Mu, H Wang, Z Zhang, Y Geng, S Lin, R Wang, ...
arXiv preprint arXiv:2412.18551, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–7