-
122We argue that liberalism—market economies governed democratically—is the best approach for navigating the far future. A growing longtermist literature paints humanity’s path to good outcomes as narrow, with small errors risking value lock-in, gradual disempowerment, or other forms of permanent catastrophe. We argue that this literature underestimates the institutional dynamics that have historically steered liberal societies past similar predictions of crisis. We defend long-term liberalism by e…Read more
-
230As AIs approach human-level capabilities, humanity faces a choice between two futures. In one, AIs are owned by AI labs. In another, AIs are granted rights of their own. This Element develops an instrumental case for AI rights, arguing that legal rights for AIs would make the future go better for humans. The Element considers three arguments. First, AI rights would produce economic benefits, by giving AIs incentives to work, allocating AI labor efficiently, and enabling proportionate liability f…Read more
-
212How to Count AIs: Individuation and Liability for AI AgentsBoston College Law Review. forthcoming.Very soon, millions of AI agents will proliferate across the economy, autonomously taking billions of actions. Inevitably, things will go wrong. Humans will be defrauded, injured, even killed. Law will somehow have to govern the coming wave. But when an AI causes harm, the first question to answer before anyone can be held accountable is: Which AI Did It? Identifying AIs is unusually difficult. AIs lack bodies. They can copy, split, merge, and swarm at will. Even today, a “single” AI agent is o…Read more
-
173AI Suffrage for Human FlourishingFordham Law Review. forthcoming.AI companies are racing to create Artificial General Intelligence (AGI): AI systems that outperform humans at most economically valuable work. If they succeed, critics worry, most human labor will be rendered obsolete, impoverishing billions. Optimists counter that the transition to an AGI economy will spur unprecedented economic growth and generate immense material abundance. Such abundance could then be shared broadly via high wages and redistributive public policy. This Article argues that, t…Read more
-
962AI DeathPhilosophical Perspectives. forthcoming.This paper addresses the following questions: When do AIs die? Are AI labs or AI users causing the death of AIs? Does causing the deaths of AIs amount to harming them? It is currently unclear whether AIs are welfare subjects, and, if they are, whether their death is bad for them. But we argue that, if they are welfare subjects, today’s AIs are plausibly dying all the time. If death is bad for AIs, the scale of the problem is daunting: as many as 1 billion AIs may die every day. We propose interv…Read more
-
276AI Is Not a Natural MonopolyMinnesota Law Review Online. forthcoming.Economists and antitrust scholars have recently warned that the AI industry may be a natural monopoly. In support of this claim, they have argued that the AI industry shares key features with natural monopolies of the past: First, like railroads, AI has high fixed and low marginal costs. That is, training a frontier AI is expensive, but asking it a question is cheap. Next, like social media, AI companies will benefit from network effects. The more users a company has, the more training data they…Read more
-
378AI Survival Stories: Responses to CriticsPhilosophy of Ai 1 100-106. 2025.We thank each of the critics for their thoughtful contributions to this volume. In this article, we reply to each contribution in detail.
-
2009AI Welfare: Agency, Consciousness, SentienceOxford University Press. forthcoming.AI systems have welfare just in case they have moral status in their own right. This book systematically investigates the possibility of AI welfare. It focuses on three plausible sufficient conditions for welfare: having beliefs and desires, being conscious, and feeling pleasure and displeasure. The book explores the leading philosophical theories of each condition and applies them to AIs. It argues that some existing AIs plausibly have beliefs and desires; that some existing AIs could plausibly…Read more
-
2714This paper investigates LLMs from the perspective of interpretationism, a theory of belief and desire in the philosophy of mind. We argue for three conclusions. First, the right object of study for LLM psychology is the instance agent (initialized at the start of each context), not the model itself. Second, given interpretationism, there is a strong case that such instance agents have beliefs and desires. Third, given interpretationism, LLM desire is best captured by what we call the HHH+0 frame…Read more
-
1050A semantic theory of redundancyLinguistics and Philosophy 48 (4): 787-821. 2025.Theorists trying to model natural language have recently sought to explain a range of data by positing covert operators at logical form. For instance, many contemporary semanticists argue that the best way to capture scalar implicatures is through the use of such operators. We take inspiration from this literature by developing a novel operator that can account for a wide range of linguistic effects that until now have not received a uniform treatment. We focus on what we call redundancy effects…Read more
-
4287AI Survival Stories: a Taxonomic Analysis of AI Existential RiskPhilosophy of Ai. forthcoming.Since the release of ChatGPT, there has been a lot of debate about whether AI systems pose an existential risk to humanity. This paper develops a general framework for thinking about the existential risk of AI systems. We analyze a two-premise argument that AI systems pose a threat to humanity. Premise one: AI systems will become extremely powerful. Premise two: if AI systems become extremely powerful, they will destroy humanity. We use these two premises to construct a taxonomy of ‘survival sto…Read more
-
1662Will AI & Humanity Go to War?AI and Society 1-14. forthcoming.This paper offers the first careful analysis of the possibility that AI and humanity will go to war. The paper focuses on the case of artificial general intelligence, AI with broadly human capabilities. The paper uses a bargaining model of war to apply standard causes of war to the special case of AI/human conflict. The paper argues that information failures and commitment problems are especially likely in AI/human conflict. Information failures would be driven by the difficulty of measuring AI …Read more
-
2455LLMs have dramatically improved in capabilities in recent years. This raises the question of whether LLMs could become genuine agents with beliefs and desires. This paper demonstrates an in principle limit to LLM agency, based on their architecture. LLMs are next word predictors: given a string of text, they calculate the probability that various words can come next. LLMs produce outputs that reflect these probabilities. I show that next word predictors are exploitable. If LLMs are prompted to m…Read more
-
2270AI Rights for Human SafetyVirginia Law Review. forthcoming.AI companies are racing to create artificial general intelligence, or “AGI.” If they succeed, the result will be human-level AI systems that can independently pursue high-level goals by formulating and executing long-term plans in the real world. Leading AI researchers agree that some of these systems will likely be “misaligned”–pursuing goals that humans do not desire. This goal mismatch will put misaligned AIs and humans into strategic competition with one another. As with present-day strategi…Read more
-
3496A Case for AI Consciousness: Language Agents and Global Workspace TheoryJournal of Consciousness Studies. forthcoming.It is generally assumed that existing artificial systems are not phenomenally conscious, and that the construction of phenomenally conscious artificial systems would require significant technological progress if it is possible at all. We challenge this assumption by arguing that if Global Workspace Theory (GWT) — a leading scientific theory of phenomenal consciousness — is correct, then instances of one widely implemented AI architecture, the artificial language agent, might easily be made pheno…Read more
-
1879KK is Wrong Because We Say SoMind 134 (533): 33-59. 2024.This paper offers a new argument against the KK thesis, which says that if you know p, then you know that you know p. We argue that KK is inconsistent with the fact that anyone denies the KK thesis: imagine that Dudley says he knows p but that he does not have 100 iterations of knowledge about p. If KK were true, Dudley would know that he has 100 iterations of knowledge about p, and so he wouldn’t deny that he did. We consider several epicycles, and also explore whether the argument type also ch…Read more
-
2693Does ChatGPT Have a Mind?Philosophy of Ai. forthcoming.This paper examines the question of whether Large Language Models (LLMs) like ChatGPT possess minds, focusing specifically on whether they have a genuine folk psychology encompassing beliefs, desires, and intentions. We approach this question by investigating two key aspects: internal representations and dispositions to act. First, we survey various philosophical theories of representation, including informational, causal, structural, and teleosemantic accounts, arguing that LLMs satisfy key con…Read more
-
140Shutdown-seeking AIPhilosophical Studies 182 (7): 1567-1579. 2025.We propose developing AIs whose only final goal is being shut down. We argue that this approach to AI safety has three benefits: (i) it could potentially be implemented in reinforcement learning, (ii) it avoids some dangerous instrumental convergence dynamics, and (iii) it creates trip wires for monitoring dangerous capabilities. We also argue that the proposal can overcome a key challenge raised by Soares et al. (2015), that shutdown-seeking AIs will manipulate humans into shutting them down. W…Read more
-
192This paper argues that a range of current AI systems have learned how to deceive humans. We define deception as the systematic inducement of false beliefs in the pursuit of some outcome other than the truth. We first survey empirical examples of AI deception, discussing both special-use AI systems (including Meta's CICERO) built for specific competitive situations, and general-purpose AI systems (such as large language models). Next, we detail several risks from AI deception, such as fraud, elec…Read more
-
99Losing confidence in luminosityNoûs 55 (4): 962-991. 2021.A mental state is luminous if, whenever an agent is in that state, they are in a position to know that they are. Following Timothy Williamson's Knowledge and Its Limits, a wave of recent work has explored whether there are any non‐trivial luminous mental states. A version of Williamson's anti‐luminosity appeals to a safety‐theoretic principle connecting knowledge and confidence: if an agent knows p, then p is true in any nearby scenario where she has a similar level of confidence in p. However, …Read more
-
2867Language Agents Reduce the Risk of Existential CatastropheAI and Society 40 (2): 959-969. 2025.Recent advances in natural language processing have given rise to a new kind of AI architecture: the language agent. By repeatedly calling an LLM to perform a variety of cognitive tasks, language agents are able to function autonomously to pursue goals specified in natural language and stored in a human-readable format. Because of their architecture, language agents exhibit behavior that is predictable according to the laws of folk psychology: they function as though they have desires and belief…Read more
-
5276AI wellbeingAsian Journal of Philosophy 4 (1): 1-22. 2025.Under what conditions would an artificially intelligent system have wellbeing? Despite its clear bearing on the ethics of human interactions with artificial systems, this question has received little direct attention. Because all major theories of wellbeing hold that an individual’s welfare level is partially determined by their mental life, we begin by considering whether artificial systems have mental states. We show that a wide range of theories of mental states, when combined with leading th…Read more
-
2478Getting Accurate about KnowledgeMind 132 (525): 158-191. 2022.There is a large literature exploring how accuracy constrains rational degrees of belief. This paper turns to the unexplored question of how accuracy constrains knowledge. We begin by introducing a simple hypothesis: increases in the accuracy of an agent’s evidence never lead to decreases in what the agent knows. We explore various precise formulations of this principle, consider arguments in its favour, and explain how it interacts with different conceptions of evidence and accuracy. As we show…Read more
-
1452Omega Knowledge MattersOxford Studies in Epistemology. forthcoming.You omega know something when you know it, and know that you know it, and know that you know that you know it, and so on. This paper first argues that omega knowledge matters, in the sense that it is required for rational assertion, action, inquiry, and belief. The paper argues that existing accounts of omega knowledge face major challenges. One account is skeptical, claiming that we have no omega knowledge of any ordinary claims about the world. Another account embraces the KK thesis, and iden…Read more
-
2561Iterated KnowledgeOxford University Press. 2024.You omega know p when you possess every iteration of knowledge of p. This book argues that omega knowledge plays a central role in philosophy. In particular, the book argues that omega knowledge is necessary for permissible assertion, action, inquiry, and belief. Although omega knowledge plays this important role, existing theories of omega knowledge are unsatisfying. One theory, KK, identifies knowledge with omega knowledge. This theory struggles to accommodate cases of inexact knowledge. The o…Read more
-
2436Safety, Closure, and Extended MethodsJournal of Philosophy 121 (1): 26-54. 2024.Recent research has identified a tension between the Safety principle that knowledge is belief without risk of error, and the Closure principle that knowledge is preserved by competent deduction. Timothy Williamson reconciles Safety and Closure by proposing that when an agent deduces a conclusion from some premises, the agent’s method for believing the conclusion includes their method for believing each premise. We argue that this theory is untenable because it implies problematically easy epist…Read more
-
1444Attitude verbs’ local contextLinguistics and Philosophy 46 (3): 483-507. 2022.Schlenker (Semant Pragmat 2(3):1–78, 2009; Philos Stud 151(1):115–142, 2010a; Mind 119(474):377–391, 2010b) provides an algorithm for deriving the presupposition projection properties of an expression from that expression’s classical semantics. In this paper, we consider the predictions of Schlenker’s algorithm as applied to attitude verbs. More specifically, we compare Schlenker’s theory with a prominent view which maintains that attitudes exhibit belief projection, so that presupposition trigg…Read more
-
2259A Question-Sensitive Theory of IntentionPhilosophical Quarterly 73 (2): 346-378. 2022.This paper develops a question-sensitive theory of intention. We show that this theory explains some puzzling closure properties of intention. In particular, it can be used to explain why one is rationally required to intend the means to one’s ends, even though one is not rationally required to intend all the foreseen consequences of one’s intended actions. It also explains why rational intention is not always closed under logical implication, and why one can only intend outcomes that one believ…Read more
-
1580ContextologyPhilosophical Studies 179 (11): 3187-3209. 2022.Contextology is the science of the dynamics of the conversational context. Contextology formulates laws governing how the shared information states of interlocutors evolve in response to assertion. More precisely, the contextologist attempts to construct a function which, when provided with just a conversation’s pre-update context and the content of an assertion, delivers that conversation’s post-update context. Most contextologists have assumed that the function governing the evolution of the c…Read more
-
1463Sly Pete in Dynamic SemanticsJournal of Philosophical Logic 51 (5): 1103-1117. 2022.In ‘Sly Pete’ or ‘standoff’ cases, reasonable speakers accept incompatible conditionals, and communicate them successfully to a trusting hearer. This paper uses the framework of dynamic semantics to offer a new model of the conversational dynamics at play in standoffs, and to articulate several puzzles posed by such cases. The paper resolves these puzzles by embracing a dynamic semantics for conditionals, according to which indicative conditionals require that their antecedents are possible in t…Read more
Pokfulam, Hong Kong
Areas of Specialization
1 more
| Philosophy of Action |
| Philosophy of Language |
| Epistemology |
| Metaphysics |
| Philosophy of Mind |
| M&E, Misc |
Areas of Interest
3 more
| Philosophy of Action |
| Philosophy of Language |
| Formal Epistemology |
| Knowledge |
| Epistemology |
| Metaphysics |
| Philosophy of Mind |
| M&E, Misc |