•  45
    We consider the questions of whether or not large language models (LLMs) have beliefs, and, if they do, how we might measure them. First, we consider whether or not we should expect LLMs to have something like beliefs in the first place. We consider some recent arguments aiming to show that LLMs cannot have beliefs. We show that these arguments are misguided. We provide a more productive framing of questions surrounding the status of beliefs in LLMs, and highlight the empirical nature of the pro…Read more
  •  54
    Review of Cameron J. Buckner’s From Deep Learning to Rational Machines: What the History of Philosophy Can Teach Us about the Future of Artificial Intelligence - Cameron J. Buckner, From Deep Learning to Rational Machines: What the History of Philosophy Can Teach Us about the Future of Artificial Intelligence. Oxford University Press. - Volume 92 Issue 4.
  •  77
    Deference and Decision
    Theory and Decision 1-27. forthcoming.
    Consider two principles of rational decision making. One says that you should never use the value of events that are inconsistent with your evidence to assess what to do. The other says that, if you know what a more informed version of yourself would do, then you should already do likewise with your current information. We show that no decision theory which agrees with evidential decision theory (EDT) and causal decision theory (CDT) whenever they agree can satisfy both principles. In particular…Read more
  •  94
    Invention and Evolution of Correlated Conventions
    British Journal for the Philosophy of Science 76 (1): 223-241. 2025.
    An important feature of many conventions is that the agents use an asymmetry to coordinate their behaviour. We call these ‘correlated conventions’. However, a puzzle arises: since any asymmetry works as well as any other, what are the relevant asymmetries on which a given population founds its correlated conventions? In order to gain traction on this question we need an account of both the invention and evolution of correlated conventions. Invention has remained largely unexplored in the literat…Read more
  •  1554
    Standards for Belief Representations in LLMs
    Minds and Machines 35 (1): 1-25. 2024.
    As large language models (LLMs) continue to demonstrate remarkable abilities across various domains, computer scientists are developing methods to understand their cognitive processes, particularly concerning how (and if) LLMs internally represent their beliefs about the world. However, this field currently lacks a unified theoretical foundation to underpin the study of belief in LLMs. This article begins filling this gap by proposing adequacy conditions for a representation in an LLM to count a…Read more
  •  288
    We consider the questions of whether or not large language models (LLMs) have beliefs, and, if they do, how we might measure them. First, we consider whether or not we should expect LLMs to have something like beliefs in the first place. We consider some recent arguments aiming to show that LLMs cannot have beliefs. We show that these arguments are misguided. We provide a more productive framing of questions surrounding the status of beliefs in LLMs, and highlight the empirical nature of the pro…Read more
  •  806
    Naturalizing Natural Salience
    British Journal for the Philosophy of Science. forthcoming.
    Grice, Lewis, and Skyrms proposed similar distinctions between kinds of meaning. The meaning of terms in human language, as Lewis and Skyrms had it, is ‘conventional’. Skyrms presented models showing how it is possible for conventional meaning to evolve in a population without reliance on pre-existing meaning. But one might think of conventionality as coming in degrees, based on whether the evolutionary process begins with ‘natural saliences’. We propose a theory of natural salience and several …Read more
  •  125
    We often need to have beliefs about things on which we are not experts. Luckily, we often have access to expert judgements on such topics. But how should we form our beliefs on the basis of expert opinion when experts conflict in their judgments? This is the core of the novice/2-expert problem in social epistemology. A closely related question is important in the context of policy making: how should a policy maker use expert judgments when making policy in domains in which she is not herself an …Read more
  •  95
    Sifting the Signal from the Noise
    British Journal for the Philosophy of Science 76 (3): 745-758. 2025.
    Signalling games are useful for understanding how language emerges. In the standard models, the dynamics in some sense already know what the signals are, even if they do not yet have meaning. In this article, we relax this assumption and develop a simple model we call an ‘attention game’, in which agents have to learn which feature of their environment is the signal. We demonstrate that simple reinforcement learning agents can still learn to coordinate in contexts where the agents do not already…Read more
  •  140
    PAC Learning and Occam’s Razor: Probably Approximately Incorrect
    Philosophy of Science 87 (4): 685-703. 2020.
    Computer scientists have provided a distinct justification of Occam’s Razor. Using the probably approximately correct framework, they provide a theorem that they claim demonstrates that we should favor simpler hypotheses. The argument relies on a philosophical interpretation of the theorem. I argue that the standard interpretation of the result in the literature is misguided and that a better reading does not, in fact, support Occam’s Razor at all. To this end, I state and prove a very similar t…Read more