Follow
Ben Garfinkel
Ben Garfinkel
Director, Centre for the Governance of AI; Research Fellow, University of Oxford
Verified email at philosophy.ox.ac.uk
Title
Cited by
Cited by
Year
The malicious use of artificial intelligence: Forecasting, prevention, and mitigation
M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, A Dafoe, ...
arXiv preprint arXiv:1802.07228, 2018
11602018
Model evaluation for extreme risks
T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ...
arXiv preprint arXiv:2305.15324, 2023
1332023
How does the offense-defense balance scale?
B Garfinkel, A Dafoe
Emerging Technologies and International Stability, 247-274, 2021
912021
Democratising AI: Multiple meanings, goals, and methods
E Seger, A Ovadya, D Siddarth, B Garfinkel, A Dafoe
Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 715-722, 2023
632023
Towards best practices in AGI safety and governance: A survey of expert opinion
J Schuett, N Dreksler, M Anderljung, D McCaffary, L Heim, E Bluemke, ...
arXiv preprint arXiv:2305.07153, 2023
432023
The windfall clause: Distributing the benefits of AI for the common good
C O'Keefe, P Cihon, B Garfinkel, C Flynn, J Leung, A Dafoe
Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 327-331, 2020
392020
Open-sourcing highly capable foundation models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives
E Seger, N Dreksler, R Moulange, E Dardaman, J Schuett, K Wei, ...
arXiv preprint arXiv:2311.09227, 2023
362023
Beyond privacy trade-offs with structured transparency
A Trask, E Bluemke, T Collins, BGE Drexler, CG Cuervas-Mons, I Gabriel, ...
arXiv preprint arXiv:2012.08347, 2020
302020
& Amodei, D.(2018)
M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, A DAFOE, ...
The malicious use of artificial intelligence: Forecasting, prevention, and …, 1802
201802
The malicious use of artificial intelligence: forecasting, prevention, and mitigation. Future of Humanity Institute, University of Oxford, Centre for the Study of Existential …
M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, D Amodei
Center for a New American Security, Electronic Frontier Foundation, OpenAI 1 …, 2018
192018
The windfall clause: Distributing the benefits of AI, Centre for the governance of AI research report
C O’Keefe, P Cihon, C Flynn, B Garfinkel, J Leung, A Dafoe
Future of Humanity Institute, University of Oxford. https://www. fhi. ox. ac …, 2020
142020
Open problems in technical ai governance
A Reuel, B Bucknall, S Casper, T Fist, L Soder, O Aarne, L Hammond, ...
arXiv preprint arXiv:2407.14981, 2024
132024
AI policy levers: A review of the US government’s tools to shape AI research, development, and deployment
SC Fischer, J Leung, M Anderljung, C O’keefe, S Torges, SM Khan, ...
Retrieved June 1, 2022, 2021
92021
From principles to rules: A regulatory approach for frontier AI
J Schuett, M Anderljung, A Carlier, L Koessler, B Garfinkel
arXiv preprint arXiv:2407.07300, 2024
82024
Exploring the Relevance of Data Privacy-Enhancing Technologies for AI Governance Use Cases
E Bluemke, T Collins, B Garfinkel, A Trask
arXiv preprint arXiv:2303.08956, 2023
82023
Contact tracing apps can help stop coronavirus. But they can hurt privacy
T Shevlane, B Garfinkel, A Dafoe
The Washington Post, 2020
72020
On the impossibility of supersized machines
B Garfinkel, M Brundage, D Filan, C Flynn, J Luketina, M Page, ...
arXiv preprint arXiv:1703.10987, 2017
62017
The impact of artificial intelligence
B Garfinkel
The Oxford handbook of AI governance, 2022
52022
Towards best practices in AGI safety and governance
J Schuett, N Dreksler, M Anderljung, D McCaffary, L Heim, E Bluemke, ...
Surv. Expert Opin., 2023
42023
Open-sourcing highly capable foundation models
E Seger, N Dreksler, R Moulange, E Dardaman, J Schuett, K Wei, ...
Research paper, Centre for the Governance of AI, 2023
32023
The system can't perform the operation now. Try again later.
Articles 1–20