-
45Still no lie detector for language models: probing empirical and conceptual roadblocks: Still no lie detector for language models..Philosophical Studies 182 (7): 1539-1565. 2024.We consider the questions of whether or not large language models (LLMs) have beliefs, and, if they do, how we might measure them. First, we consider whether or not we should expect LLMs to have something like beliefs in the first place. We consider some recent arguments aiming to show that LLMs cannot have beliefs. We show that these arguments are misguided. We provide a more productive framing of questions surrounding the status of beliefs in LLMs, and highlight the empirical nature of the pro…Read more
-
96Tickles, iteration, and habitsTheory and Decision 100 (2): 531-559. 2026.At first pass, Evidential Decision Theory (EDT) recommends one-boxing in Newcomb’s Problem and Causal Decision Theory (CDT) recommends two-boxing. However, it has been acknowledged that concrete instances of the problem have messy features complicating their analyses. Recently, a third competitor, Functional Decision Theory (FDT) has emerged recommending one-boxing in some versions and two-boxing in others. This paper explores the verdicts of these competing theories in a few variations of the p…Read more
-
1554Standards for Belief Representations in LLMsMinds and Machines 35 (1): 1-25. 2024.As large language models (LLMs) continue to demonstrate remarkable abilities across various domains, computer scientists are developing methods to understand their cognitive processes, particularly concerning how (and if) LLMs internally represent their beliefs about the world. However, this field currently lacks a unified theoretical foundation to underpin the study of belief in LLMs. This article begins filling this gap by proposing adequacy conditions for a representation in an LLM to count a…Read more
-
931Bigger, Badder BugsMind 134 (533): 134-170. 2025.In this paper we motivate the ‘principles of trust’, chance-credence principles that are strictly stronger than the New Principle yet strictly weaker than the Principal Principle, and argue, by proving some limitative results, that the principles of trust conflict with Humean Supervenience.
-
2690Does ChatGPT Have a Mind?Philosophy of Ai. forthcoming.This paper examines the question of whether Large Language Models (LLMs) like ChatGPT possess minds, focusing specifically on whether they have a genuine folk psychology encompassing beliefs, desires, and intentions. We approach this question by investigating two key aspects: internal representations and dispositions to act. First, we survey various philosophical theories of representation, including informational, causal, structural, and teleosemantic accounts, arguing that LLMs satisfy key con…Read more
-
837Evidential Decision Theory and the OstrichPhilosophers' Imprint 24 (1). 2024.Evidential Decision Theory is flawed, but its flaws are not fully understood. David Lewis (1981) famously charged that EDT recommends an irrational policy of managing the news and “commends the ostrich as rational”. Lewis was right, but the case he appealed to—Newcomb’s Problem—does not demonstrate his conclusion. Indeed, decision theories other than EDT, such as Committal Decision Theory and Functional Decision Theory, agree with EDT's verdicts in Newcomb’s Problem, but their flaws, whatever th…Read more
-
288Still no lie detector for language models: probing empirical and conceptual roadblocksPhilosophical Studies 182 (7). 2025.We consider the questions of whether or not large language models (LLMs) have beliefs, and, if they do, how we might measure them. First, we consider whether or not we should expect LLMs to have something like beliefs in the first place. We consider some recent arguments aiming to show that LLMs cannot have beliefs. We show that these arguments are misguided. We provide a more productive framing of questions surrounding the status of beliefs in LLMs, and highlight the empirical nature of the pro…Read more
-
1534Probability and Informed ConsentTheoretical Medicine and Bioethics 44 (6): 545-566. 2023.In this paper, we illustrate some serious difficulties involved in conveying information about uncertain risks and securing informed consent for risky interventions in a clinical setting. We argue that in order to secure informed consent for a medical intervention, physicians often need to do more than report a bare, numerical probability value. When probabilities are given, securing informed consent generally requires communicating how probability expressions are to be interpreted and communica…Read more
-
230Decision Theory without LuminosityMind 133 (530): 346-376. 2023.Our decision-theoretic states are not luminous. We are imperfectly reliable at identifying our own credences, utilities and available acts, and thus can never be more than imperfectly reliable at identifying the prescriptions of decision theory. The lack of luminosity affords decision theory a remarkable opportunity — to issue guidance on the basis of epistemically inaccessible facts. We show how a decision theory can guarantee action in accordance with contingent truths about which an agent is …Read more
-
254Accuracy, Deference, and ChancePhilosophical Review 132 (1): 43-87. 2023.Chance both guides our credences and is an objective feature of the world. How and why we should conform our credences to chance depends on the underlying metaphysical account of what chance is. I use considerations of accuracy (how close your credences come to truth-values) to propose a new way of deferring to chance. The principle I endorse, called the Trust Principle, requires chance to be a good guide to the world, permits modest chances, tells us how to listen to chance even when the chance…Read more
-
2081Deference Done BetterPhilosophical Perspectives 35 (1): 99-150. 2021.There are many things—call them ‘experts’—that you should defer to in forming your opinions. The trouble is, many experts are modest: they’re less than certain that they are worthy of deference. When this happens, the standard theories of deference break down: the most popular (“Reflection”-style) principles collapse to inconsistency, while their most popular (“New-Reflection”-style) variants allow you to defer to someone while regarding them as an anti-expert. We propose a middle way: deferring…Read more
-
137Strict propriety is weakAnalysis 81 (1): 8-13. 2021.Considerations of accuracy – the epistemic good of having credences close to truth-values – have led to the justification of a host of epistemic norms. These arguments rely on specific ways of measuring accuracy. In particular, the accuracy measure should be strictly proper. However, the main argument for strict propriety supports only weak propriety. But strict propriety follows from weak propriety given strict truth directedness and additivity. So no further argument is necessary.
-
1773Act Consequentialism without Free RidesPhilosophical Perspectives 34 (1): 88-116. 2020.Consequentialist theories determine rightness solely based on real or expected consequences. Although such theories are popular, they often have difficulty with generalizing intuitions, which demand concern for questions like “What if everybody did that?” Rule consequentialism attempts to incorporate these intuitions by shifting the locus of evaluation from the consequences of acts to those of rules. However, detailed rule-consequentialist theories seem ad hoc or arbitrary compared to act conseq…Read more
-
250Cheating Death in DamascusJournal of Philosophy 117 (5): 237-266. 2020.Evidential Decision Theory and Causal Decision Theory are the leading contenders as theories of rational action, but both face counterexamples. We present some new counterexamples, including one in which the optimal action is causally dominated. We also present a novel decision theory, Functional Decision Theory, which simultaneously solves both sets of counterexamples. Instead of considering which physical action of theirs would give rise to the best outcomes, FDT agents consider which output o…Read more
-
221Imprecise Epistemic Values and Imprecise CredencesAustralasian Journal of Philosophy 97 (4): 741-760. 2019.A number of recent arguments purport to show that imprecise credences are incompatible with accuracy-first epistemology. If correct, this conclusion suggests a conflict between evidential a...
-
631The Foundations of Epistemic Decision TheoryMind 128 (509): 69-107. 2019.According to accuracy-first epistemology, accuracy is the fundamental epistemic good. Epistemic norms — Probabilism, Conditionalization, the Principal Principle, etc. — have their binding force in virtue of helping to secure this good. To make this idea precise, accuracy-firsters invoke Epistemic Decision Theory (EpDT) to determine which epistemic policies are the best means toward the end of accuracy. Hilary Greaves and others have recently challenged the tenability of this programme. Their arg…Read more
-
943An objection of varying importance to epistemic utility theoryPhilosophical Studies 176 (11): 2919-2931. 2019.Some propositions are more epistemically important than others. Further, how important a proposition is is often a contingent matter—some propositions count more in some worlds than in others. Epistemic Utility Theory cannot accommodate this fact, at least not in any standard way. For EUT to be successful, legitimate measures of epistemic utility must be proper, i.e., every probability function must assign itself maximum expected utility. Once we vary the importance of propositions across worlds…Read more
-
2337A Pragmatist’s Guide to Epistemic UtilityPhilosophy of Science 84 (4): 613-638. 2017.We use a theorem from M. J. Schervish to explore the relationship between accuracy and practical success. If an agent is pragmatically rational, she will quantify the expected loss of her credence with a strictly proper scoring rule. Which scoring rule is right for her will depend on the sorts of decisions she expects to face. We relate this pragmatic conception of inaccuracy to the purely epistemic one popular among epistemic utility theorists.
-
1107Accuracy Uncomposed: Against CalibrationismEpisteme 14 (1): 59-69. 2017.Pettigrew offers new axiomatic constraints on legitimate measures of inaccuracy. His axiom called ‘Decomposition’ stipulates that legitimate measures of inaccuracy evaluate a credence function in part based on its level of calibration at a world. I argue that if calibration is valuable, as Pettigrew claims, then this fact is an explanandum for accuracy-rst epistemologists, not an explanans, for three reasons. First, the intuitive case for the importance of calibration isn’t as strong as Pettigr…Read more
-
198Leitgeb and Pettigrew on Accuracy and UpdatingPhilosophy of Science 79 (3): 413-424. 2012.Leitgeb and Pettigrew argue that (1) agents should minimize the expected inaccuracy of their beliefs and (2) inaccuracy should be measured via the Brier score. They show that in certain diachronic cases, these claims require an alternative to Jeffrey Conditionalization. I claim that this alternative is an irrational updating procedure and that the Brier score, and quadratic scoring rules generally, should be rejected as legitimate measures of inaccuracy.
-
193With All Due Respect: The Macro-Epistemology of DisagreementPhilosophers' Imprint 15. 2015.In this paper, I develop a new kind of conciliatory answer to the problem of peer disagreement. Instead of trying to guide an agent’s updating behaviour in any particular disagreement, I establish constraints on an agent’s expected behaviour and argue that, in the long run, she should tend to be conciliatory toward her peers. I first claim that this macro-approach affords us new conceptual insight on the problem of peer disagreement and provides an important angle complementary to the standard m…Read more
-
1663Permissive Rationality and SensitivityPhilosophy and Phenomenological Research 94 (2): 342-370. 2017.Permissivism about rationality is the view that there is sometimes more than one rational response to a given body of evidence. In this paper I discuss the relationship between permissivism, deference to rationality, and peer disagreement. I begin by arguing that—contrary to popular opinion—permissivism supports at least a moderate version of conciliationism. I then formulate a worry for permissivism. I show that, given a plausible principle of rational deference, permissive rationality seems to…Read more
-
AnthropicOther
-
-
Urbana and Champaign, Illinois, United States of America
Areas of Specialization
| Metaphysics and Epistemology |
| The Nature of Artificial Intelligence |
| Value Theory |
| Science, Logic, and Mathematics |
Areas of Interest
| Metaphysics and Epistemology |
| Value Theory |
| Science, Logic, and Mathematics |