Follow
John Burden
Title
Cited by
Cited by
Year
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
11702022
Harms from increasingly agentic algorithmic systems
A Chan, R Salganik, A Markelius, C Pang, N Rajkumar, D Krasheninnikov, ...
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and …, 2023
652023
Rethink reporting of evaluation results in AI
R Burnell, W Schellaert, J Burden, TD Ullman, F Martinez-Plumed, ...
Science 380 (6641), 136-138, 2023
602023
Safe reinforcement learning for sepsis treatment
Y Jia, J Burden, T Lawton, I Habli
2020 IEEE International conference on healthcare informatics (ICHI), 1-7, 2020
282020
Safety-driven design of machine learning for sepsis treatment
Y Jia, T Lawton, J Burden, J McDermid, I Habli
Journal of Biomedical Informatics 117, 103762, 2021
162021
Exploring AI safety in degrees: generality, capability and control
J Burden, J Hernández-Orallo
Proceedings of the workshop on artificial intelligence safety (safeai 2020 …, 2020
142020
How general-purpose is a language model? Usefulness and safety with human prompters in the wild
PAM Casares, BS Loe, J Burden, J Hernández-Orallo
Proceedings of the AAAI Conference on Artificial Intelligence 36 (5), 5295-5303, 2022
92022
Your prompt is my command: on assessing the human-centred generality of multimodal models
W Schellaert, F Martínez-Plumed, K Vold, J Burden, PAM Casares, ...
Journal of Artificial Intelligence Research 77, 377-394, 2023
62023
Evaluating AI evaluation: Perils and prospects
J Burden
arXiv preprint arXiv:2407.09221, 2024
52024
An international consortium for evaluations of societal-scale risks from advanced AI
R Gruetzemacher, A Chan, K Frazier, C Manning, Š Los, J Fox, ...
arXiv preprint arXiv:2310.14455, 2023
52023
Inferring Capabilities from Task Performance with Bayesian Triangulation
J Burden, K Voudouris, R Burnell, D Rutar, L Cheke, J Hernández-Orallo
arXiv preprint arXiv:2309.11975, 2023
52023
Evaluating object permanence in embodied agents using the animal-ai environment
K Voudouris, N Donnelly, D Rutar, R Burnell, J Burden, ...
https://ceur-ws. org/Vol-3169/paper2. pdf, 2022
52022
Not a Number: Identifying Instance Features for Capability-Oriented Evaluation
R Burnell, J Burden, D Rutar, K Voudouris, L Cheke, J Hernández-Orallo
Proceedings of the Thirty-First International Joint Conference on …, 2022
52022
Using uniform state abstractions for reward shaping with reinforcement learning
J Burden, D Kudenko
Workshop on adaptive learning agents (ALA) at the federated AI meeting 18, 2018
52018
Animal-AI 3: What's New & Why You Should Care
K Voudouris, I Alhas, W Schellaert, M Crosby, J Holmes, J Burden, ...
arXiv preprint arXiv:2312.11414, 2023
42023
Predictable Artificial Intelligence
L Zhou, PA Moreno-Casares, F Martínez-Plumed, J Burden, R Burnell, ...
arXiv preprint arXiv:2310.06167, 2023
32023
Latent Property State Abstraction For Reinforcement learning
J Burden, SK Siahroudi, D Kudenko
Proceedings of the AAMAS Workshop on Adaptive Learning Agents (ALA), 2021
32021
Uniform state abstraction for reinforcement learning
J Burden, D Kudenko
ECAI 2020, 1031-1038, 2020
32020
Oases of Cooperation: An Empirical Evaluation of Reinforcement Learning in the Iterated Prisoner's Dilemma.
P Barnett, J Burden
SafeAI@ AAAI, 2022
22022
9. From Turing’s Speculations to an Academic Discipline: A History of AI Existential Safety
J Burden, S Clarke, J Whittlestone
Open Book Publishers, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20