-
545The individuation problem for large language models asks which entities associated with them, if any, should be identified as minds. We approach this problem through mechanistic interpretability, engaging in particular with recent empirical work on persona vectors, persona space, and emergent misalignment. We argue that three views are the strongest candidates: the virtual instance view and two new views we introduce, the (virtual) instance-persona view and the model-persona view. First, we arg…Read more
-
732Testing for consciousness in current AIIn Walter Sinnott-Armstrong & Liad Mudrik (eds.), Tests of Consciousness: How to tell whether a human, other animal or AI is conscious and what they are conscious of. forthcoming.
-
546Are any machines conscious today?In Calum Chace (ed.), Perspectives on Machine Consciousness. forthcoming.
-
871Desire in AIIn Alex Gregory (ed.), The Routledge Handbook on the Philosophy of Desire, Routledge. forthcoming.Chatbots seem to express desires and many AI systems arguably pursue goals, such as winning games or helping users. Furthermore, many AI systems are trained by reinforcement learning and similar forms of learning are associated with human and animal desires. This chapter considers arguments for and against the attribution of desires to various AI systems and discusses some implications. Cases from AI can also help us to evaluate possible sets of conditions for desire.
-
23Higher-order representation in AIPhilosophy and the Mind Sciences 7 (1). 2026.Higher-order representations are those that are about other representations. Humans and some other animals form higher-order mental representations concerning representations in our own minds, through the operation of processes of metacognition and introspection. These have been linked with a wide range of mental capacities and attributes, including consciousness. Recent research on large language models (LLMs) has explored their knowledge of their own ‘minds’, sometimes suggesting that these mo…Read more
-
188Rapid progress in artificial intelligence (AI) capabilities has drawn fresh attention to the prospect of consciousness in AI. There is an urgent need for rigorous methods to assess AI systems for consciousness, but significant uncertainty about relevant issues in consciousness science. We present a method for assessing AI systems for consciousness that involves exploring what follows from existing or future neuroscientific theories of consciousness. Indicators derived from such theories can be u…Read more
-
70AI AssertionErgo: An Open Access Journal of Philosophy 12 (n/a): 968-988. 2025.Modern generative AI systems have shown the capacity to produce remarkably fluent language, prompting debates both about their semantic understanding and, less prominently, about whether they can perform speech acts. This paper addresses the latter question, focusing on assertion. We argue that to be capable of assertion, an entity must meet two requirements: it must produce outputs with descriptive functions, and it must be capable of being sanctioned by agents with which it interacts. The seco…Read more
-
770In this report, we argue that there is a realistic possibility that some AI systems will be conscious and/or robustly agentic in the near future. That means that the prospect of AI welfare and moral patienthood — of AI systems with their own interests and moral significance — is no longer an issue only for sci-fi or the distant future. It is an issue for the near future, and AI companies and other actors have a responsibility to start taking it seriously. We also recommend three early step…Read more
-
154The agency in language agentsInquiry: An Interdisciplinary Journal of Philosophy. forthcoming.Language agents are AI systems that combine large language models with other elements to facilitate interaction with an environment. They include LLM-based chatbots but can have a wide range of additional features to support learning, reasoning and decision-making. Goldstein and Kirk-Giannini. Citationm.s. [AI wellbeing] argue that some language agents have beliefs and desires, but it is not obvious that they are agents at all, since they select outputs by querying language models. This paper in…Read more
-
178Reinforcement learning and artificial agencyMind and Language 39 (1): 22-38. 2024.There is an apparent connection between reinforcement learning and agency. Artificial entities controlled by reinforcement learning algorithms are standardly referred to as agents, and the mainstream view in the psychology and neuroscience of agency is that humans and other animals are reinforcement learners. This article examines this connection, focusing on artificial reinforcement learning systems and assuming that there are various forms of agency. Artificial reinforcement learning systems s…Read more
-
294Sharing Our Concepts with MachinesErkenntnis 88 (7): 3079-3095. 2021.As AI systems become increasingly competent language users, it is an apt moment to consider what it would take for machines to understand human languages. This paper considers whether either language models such as GPT-3 or chatbots might be able to understand language, focusing on the question of whether they could possess the relevant concepts. A significant obstacle is that systems of both kinds interact with the world only through text, and thus seem ill-suited to understanding utterances co…Read more
-
409AI AssertionErgo: An Open Access Journal of Philosophy. 2023.Modern generative AI systems have shown the capacity to produce remarkably fluent language, prompting debates both about their semantic understanding and, less prominently, about whether they can perform speech acts. This paper addresses the latter question, focusing on assertion. We argue that to be capable of assertion, an entity must meet two requirements: it must produce outputs with descriptive functions, and it must be capable of being sanctioned by agents with which it interacts. The seco…Read more
-
166Machine Learning, Functions and GoalsCroatian Journal of Philosophy 22 (66): 351-370. 2022.Machine learning researchers distinguish between reinforcement learning and supervised learning and refer to reinforcement learning systems as “agents”. This paper vindicates the claim that systems trained by reinforcement learning are agents while those trained by supervised learning are not. Systems of both kinds satisfy Dretske’s criteria for agency, because they both learn to produce outputs selectively in response to inputs. However, reinforcement learning is sensitive to the instrumental v…Read more
-
109Affective Experience and Evidence for Animal ConsciousnessPhilosophical Topics 48 (1): 109-127. 2020.Affective experience in nonhuman animals is of great interest for both theoretical and practical reasons. This paper highlights research by the psychologists Anthony Dickinson and Bernard Balleine which provides particularly good evidence of conscious affective experience in rats. This evidence is compelling because it implicates a sophisticated system for goal-directed action selection, and demonstrates a contrast between apparently conscious and unconscious evaluative representations with simi…Read more
-
120Cognitive Models Are Distinguished by Content, Not FormatPhilosophy of Science 88 (1): 83-102. 2021.Cognitive scientists often describe the mind as constructing and using models of aspects of the environment, but it is not obvious what makes something a model as opposed to a mere representation....
-
127Directive ContentPacific Philosophical Quarterly 102 (1): 2-26. 2020.Representations may have descriptive content, directive content, or both, but little explicit attention has been given to the problem of distinguishing representations of these three kinds. We do not know, for instance, what determines whether a given representation is a directive instructing its consumer to perform some action or has descriptive content to the effect that the action in question has a certain value. This paper considers what it takes for a representation to have directive conten…Read more
-
153Representation and the active consumerSynthese 197 (10): 4533-4550. 2020.One of the central tasks for naturalistic theories of representation is to say what it takes for something to be a representation, and some leading theories have been criticised for being too liberal. Prominent discussions of this problem have proposed a producer-oriented solution; it is argued that representations must be produced by systems employing perceptual constancy mechanisms. However, representations may be produced by simple transducers if they are consumed in the right way. It is char…Read more
-
115Why Hunger is not a DesireReview of Philosophy and Psychology 8 (3): 617-635. 2017.This paper presents an account of the nature of desire, informed by psychology and neuroscience, which entails that hunger is not a desire. The account is contrasted with Schroeder’s well-known empirically-informed theory of desire. It is argued that one significant virtue of the present account, in comparison with Schroeder’s theory, is that it draws a sharp distinction between desires and basic drives, such as the drive for food. One reason to draw this distinction is that experiments on incen…Read more
Patrick Butlin
Eleos AI Research