Ethical and social risks of harm from language models L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ... arXiv preprint arXiv:2112.04359, 2021 | 966 | 2021 |
Taxonomy of risks posed by language models L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022 | 552 | 2022 |
Science in the age of large language models A Birhane, A Kasirzadeh, D Leslie, S Wachter Nature Reviews Physics 5 (5), 277-280, 2023 | 219 | 2023 |
Airline crew scheduling: models, algorithms, and data sets A Kasirzadeh, M Saddoune, F Soumis EURO Journal on Transportation and Logistics 6 (2), 111-137, 2017 | 181 | 2017 |
The use and misuse of counterfactuals in ethical machine learning A Kasirzadeh, A Smart Proceedings of 2021 ACM Conference on Fairness, Accountability, and Transparency, 2021 | 128 | 2021 |
In conversation with artificial intelligence: aligning language models with human values A Kasirzadeh, I Gabriel Philosophy & Technology 36 (2), 27, 2023 | 117 | 2023 |
Typology of risks of generative text-to-image models C Bird, E Ungless, A Kasirzadeh Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 396-410, 2023 | 85 | 2023 |
Foundational challenges in assuring alignment and safety of large language models U Anwar, A Saparov, J Rando, D Paleka, M Turpin, P Hase, ES Lubana, ... arXiv preprint arXiv:2404.09932, 2024 | 83 | 2024 |
User tampering in reinforcement learning recommender systems A Kasirzadeh, C Evans Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 58-69, 2023 | 51* | 2023 |
Algorithmic and human decision making: for a double standard of transparency M Günther, A Kasirzadeh AI & SOCIETY 37 (1), 375-381, 2022 | 41 | 2022 |
Algorithmic fairness and structural injustice: Insights from feminist political philosophy A Kasirzadeh Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, 349-356, 2022 | 32 | 2022 |
A review of modern recommender systems using generative models (gen-recsys) Y Deldjoo, Z He, J McAuley, A Korikov, S Sanner, A Ramisa, R Vidal, ... Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and …, 2024 | 31 | 2024 |
Reasons, values, stakeholders: a philosophical framework for explainable artificial intelligence A Kasirzadeh Proceedings of 2021 ACM Conference on Fairness, Accountability, and Transparency, 2021 | 27 | 2021 |
Fairness and data protection impact assessments A Kasirzadeh, D Clifford Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, 146-153, 2021 | 14 | 2021 |
Two Types of AI Existential Risk: Decisive and Accumulative A Kasirzadeh arXiv preprint arXiv:2401.07836, 2024 | 12 | 2024 |
Counter countermathematical explanations A Kasirzadeh Erkenntnis, 2537–2560, 2023 | 10 | 2023 |
The Ethical Gravity Thesis: Marrian levels and the persistence of algorithmic bias in automated decision-making systems A Kasirzadeh, C Klein Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, 618-626, 2021 | 9 | 2021 |
CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models G Pistilli, A Leidinger, Y Jernite, A Kasirzadeh, AS Luccioni, M Mitchell arXiv preprint arXiv:2405.13974, 2024 | 8 | 2024 |
A new role for mathematics in empirical sciences A Kasirzadeh Philosophy of Science 88, 2021 | 7 | 2021 |
Discipline and Label: A WEIRD Genealogy and Social Theory of Data Annotation A Smart, D Wang, E Monk, M Díaz, A Kasirzadeh, E Van Liemt, ... arXiv preprint arXiv:2402.06811, 2024 | 6 | 2024 |