-
782In this report, we argue that there is a realistic possibility that some AI systems will be conscious and/or robustly agentic in the near future. That means that the prospect of AI welfare and moral patienthood — of AI systems with their own interests and moral significance — is no longer an issue only for sci-fi or the distant future. It is an issue for the near future, and AI companies and other actors have a responsibility to start taking it seriously. We also recommend three early step…Read more
-
2079What is AI safety? What do we want it to be?Philosophical Studies 182 (7): 1495-1518. 2025.The field of AI safety seeks to prevent or reduce the harms caused by AI systems. A simple and appealing account of what is distinctive of AI safety as a field holds that this feature is constitutive: a research project falls within the purview of AI safety just in case it aims to prevent or reduce the harms caused by AI systems. Call this appealingly simple account The Safety Conception of AI safety. Despite its simplicity and appeal, we argue that The Safety Conception is in tension with at le…Read more
-
1895What is it for a Machine Learning Model to Have a Capability?British Journal for the Philosophy of Science. forthcoming.What can contemporary machine learning (ML) models do? Given the proliferation of ML models in society, answering this question matters to a variety of stakeholders, both public and private. The evaluation of models' capabilities is rapidly emerging as a key subfield of modern ML, buoyed by regulatory attention and government grants. Despite this, the notion of an ML model possessing a capability has not been interrogated: what are we saying when we say that a model is able to do something? And …Read more
-
1861Operationalising Representation in Natural Language ProcessingBritish Journal for the Philosophy of Science. 2023.Despite its centrality in the philosophy of cognitive science, there has been little prior philosophical work engaging with the notion of representation in contemporary NLP practice. This paper attempts to fill that lacuna: drawing on ideas from cognitive science, I introduce a framework for evaluating the representational claims made about components of neural NLP models, proposing three criteria with which to evaluate whether a component of a model represents a property and operationalising th…Read more
-
68Proxy Selection in Transitive Proxy VotingSocial Choice and Welfare 58 69-99. 2022.Transitive proxy voting (or "liquid democracy") is a novel form of collective decision making, often framed as an attractive hybrid of direct and representative democracy. Although the ideas behind liquid democracy have garnered widespread support, there have been relatively few attempts to model it formally. This paper makes three main contributions. First, it proposes a new social choice-theoretic model of liquid democracy, which is distinguished by taking a richer formal perspective on the pr…Read more
-
1880AI language models cannot replace human research participantsAI and Society 39 (5): 2603-2605. 2024.In a recent letter, Dillion et. al (2023) make various suggestions regarding the idea of artificially intelligent systems, such as large language models, replacing human subjects in empirical moral psychology. We argue that human subjects are in various ways indispensable.
-
325Everettian Quantum Mechanics and the Metaphysics of ModalityBritish Journal for the Philosophy of Science 72 (4): 939-964. 2021.This article sits at a point of intersection between the philosophy of physics and the metaphysics of modality. There are clear similarities between Everettian quantum mechanics and various modal metaphysical theories, but there have hitherto been few attempts at exploring how the two topics relate. In this article, I build on a series of recent papers by Wilson ([2011], [2012], [2013]), who argues that Everettian quantum mechanics’ connections with traditional modal metaphysics are vital in def…Read more