Simon Goldstein (University of Hong Kong): Publications

76

AI Deception: A Survey of Examples, Risks, and Potential Solutions
with Peter Park, Aidan O'Gara, Michael Chen, and Dan Hendrycks

This paper argues that a range of current AI systems have learned how to deceive humans. We define deception as the systematic inducement of false beliefs in the pursuit of some outcome other than the truth. We first survey empirical examples of AI deception, discussing both special-use AI systems (including Meta's CICERO) built for specific competitive situations, and general-purpose AI systems (such as large language models). Next, we detail several risks from AI deception, such as fraud, elec…Read more
This paper argues that a range of current AI systems have learned how to deceive humans. We define deception as the systematic inducement of false beliefs in the pursuit of some outcome other than the truth. We first survey empirical examples of AI deception, discussing both special-use AI systems (including Meta's CICERO) built for specific competitive situations, and general-purpose AI systems (such as large language models). Next, we detail several risks from AI deception, such as fraud, election tampering, and losing control of AI systems. Finally, we outline several potential solutions to the problems posed by AI deception: first, regulatory frameworks should subject AI systems that are capable of deception to robust risk-assessment requirements; second, policymakers should implement bot-or-not laws; and finally, policymakers should prioritize the funding of relevant research, including tools to detect AI deception and to make AI systems less deceptive. Policymakers, researchers, and the broader public should work proactively to prevent AI deception from destabilizing the shared foundations of our society.

Artificial Intelligence Safety Ethics of Artificial Intelligence, Misc
11

Losing confidence in luminosity
with Daniel Waxman

Noûs 55 (4): 962-991. 2021.

A mental state is luminous if, whenever an agent is in that state, they are in a position to know that they are. Following Timothy Williamson's Knowledge and Its Limits, a wave of recent work has explored whether there are any non‐trivial luminous mental states. A version of Williamson's anti‐luminosity appeals to a safety‐theoretic principle connecting knowledge and confidence: if an agent knows p, then p is true in any nearby scenario where she has a similar level of confidence in p. However, …Read more
A mental state is luminous if, whenever an agent is in that state, they are in a position to know that they are. Following Timothy Williamson's Knowledge and Its Limits, a wave of recent work has explored whether there are any non‐trivial luminous mental states. A version of Williamson's anti‐luminosity appeals to a safety‐theoretic principle connecting knowledge and confidence: if an agent knows p, then p is true in any nearby scenario where she has a similar level of confidence in p. However, the relevant notion of confidence is relatively underexplored. This paper develops a precise theory of confidence: an agent's degree of confidence in p is the objective chance they will rely on p in practical reasoning. This theory of confidence is then used to critically evaluate the anti‐luminosity argument, leading to the surprising conclusion that although there are strong reasons for thinking that luminosity does not obtain, they are quite different from those the existing literature has considered. In particular, we show that once the notion of confidence is properly understood, the failure of luminosity follows from the assumption that knowledge requires high confidence, and does not require any kind of safety principle as a premise.

Luminosity
758

Language Agents Reduce the Risk of Existential Catastrophe
with Cameron Domenico Kirk-Giannini

AI and Society 1-11. forthcoming.

Recent advances in natural language processing have given rise to a new kind of AI architecture: the language agent. By repeatedly calling an LLM to perform a variety of cognitive tasks, language agents are able to function autonomously to pursue goals specified in natural language and stored in a human-readable format. Because of their architecture, language agents exhibit behavior that is predictable according to the laws of folk psychology: they function as though they have desires and belief…Read more
Recent advances in natural language processing have given rise to a new kind of AI architecture: the language agent. By repeatedly calling an LLM to perform a variety of cognitive tasks, language agents are able to function autonomously to pursue goals specified in natural language and stored in a human-readable format. Because of their architecture, language agents exhibit behavior that is predictable according to the laws of folk psychology: they function as though they have desires and beliefs, and then make and update plans to pursue their desires given their beliefs. We argue that the rise of language agents significantly reduces the probability of an existential catastrophe due to loss of control over an AGI. This is because the probability of such an existential catastrophe is proportional to the difficulty of aligning AGI systems, and language agents significantly reduce that difficulty. In particular, language agents help to resolve three important issues related to aligning AIs: reward misspecification, goal misgeneralization, and uninterpretability.

Natural Language Processing Machine Learning Artificial Intelligence Safety Existential Risk
1401

AI Wellbeing
with Cameron Domenico Kirk-Giannini

Under what conditions would an artificially intelligent system have wellbeing? Despite its obvious bearing on the ethics of human interactions with artificial systems, this question has received little attention. Because all major theories of wellbeing hold that an individual’s welfare level is partially determined by their mental life, we begin by considering whether artificial systems have mental states. We show that a wide range of theories of mental states, when combined with leading theorie…Read more
Under what conditions would an artificially intelligent system have wellbeing? Despite its obvious bearing on the ethics of human interactions with artificial systems, this question has received little attention. Because all major theories of wellbeing hold that an individual’s welfare level is partially determined by their mental life, we begin by considering whether artificial systems have mental states. We show that a wide range of theories of mental states, when combined with leading theories of wellbeing, predict that certain existing artificial systems have wellbeing. While we do not claim to demonstrate conclusively that AI systems have wellbeing, we argue that our metaphysical and moral uncertainty about AI wellbeing requires us dramatically to reassess our relationship with the intelligent systems we create.

Perfectionist Accounts of Well-Being Objective Accounts of Well-Being Desire Satisfaction Accounts of …Read more
Perfectionist Accounts of Well-Being Objective Accounts of Well-Being Desire Satisfaction Accounts of Well-Being Hedonist Accounts of Well-Being
911

Getting Accurate about Knowledge
with Sam Carter

Mind 132 (525): 158-191. 2022.

There is a large literature exploring how accuracy constrains rational degrees of belief. This paper turns to the unexplored question of how accuracy constrains knowledge. We begin by introducing a simple hypothesis: increases in the accuracy of an agent’s evidence never lead to decreases in what the agent knows. We explore various precise formulations of this principle, consider arguments in its favour, and explain how it interacts with different conceptions of evidence and accuracy. As we show…Read more
There is a large literature exploring how accuracy constrains rational degrees of belief. This paper turns to the unexplored question of how accuracy constrains knowledge. We begin by introducing a simple hypothesis: increases in the accuracy of an agent’s evidence never lead to decreases in what the agent knows. We explore various precise formulations of this principle, consider arguments in its favour, and explain how it interacts with different conceptions of evidence and accuracy. As we show, the principle has some noteworthy consequences for the wider theory of knowledge. First, it implies that an agent cannot be justified in believing a set of mutually inconsistent claims. Second, it implies the existence of a kind of epistemic blindspot: it is not possible to know that one’s evidence is misleading.

Formal Epistemology Evidence and Knowledge Epistemic Internalism and Externalism Justification, Misc Pri…Read more
Formal Epistemology Evidence and Knowledge Epistemic Internalism and Externalism Justification, Misc Principles of Knowledge, Misc
327

Omega Knowledge Matters
Oxford Studies in Epistemology. forthcoming.

You omega know something when you know it, and know that you know it, and know that you know that you know it, and so on. This paper first argues that omega knowledge matters, in the sense that it is required for rational assertion, action, inquiry, and belief. The paper argues that existing accounts of omega knowledge face major challenges. One account is skeptical, claiming that we have no omega knowledge of any ordinary claims about the world. Another account embraces the KK thesis, and iden…Read more
You omega know something when you know it, and know that you know it, and know that you know that you know it, and so on. This paper first argues that omega knowledge matters, in the sense that it is required for rational assertion, action, inquiry, and belief. The paper argues that existing accounts of omega knowledge face major challenges. One account is skeptical, claiming that we have no omega knowledge of any ordinary claims about the world. Another account embraces the KK thesis, and identifies knowledge with omega knowledge. This position faces counterexamples, and struggles to make sense of inexact knowledge. The paper then develops a new account of knowledge, by proposing the principle of Reflective Luminosity: if you know that you know something, then you omega know it. I argue that Reflective Luminosity allows for omega knowledge while avoiding the problems for KK.

Skepticism Theories of Knowledge Epistemic Norms
733

Iterated Knowledge
Oxford University Press. 2024.

You omega know p when you possess every iteration of knowledge of p. This book argues that omega knowledge plays a central role in philosophy. In particular, the book argues that omega knowledge is necessary for permissible assertion, action, inquiry, and belief. Although omega knowledge plays this important role, existing theories of omega knowledge are unsatisfying. One theory, KK, identifies knowledge with omega knowledge. This theory struggles to accommodate cases of inexact knowledge. The o…Read more
You omega know p when you possess every iteration of knowledge of p. This book argues that omega knowledge plays a central role in philosophy. In particular, the book argues that omega knowledge is necessary for permissible assertion, action, inquiry, and belief. Although omega knowledge plays this important role, existing theories of omega knowledge are unsatisfying. One theory, KK, identifies knowledge with omega knowledge. This theory struggles to accommodate cases of inexact knowledge. The other main theory is skeptical, claiming that we do not omega know any ordinary claims about the world. This book develops and critically compares three new theories of omega knowledge.

Epistemic Norms Epistemological Theories Formal Epistemology Knowledge Skepticism
665

Safety, Closure, and Extended Methods
with John Hawthorne

Journal of Philosophy 121 (1): 26-54. 2024.

Recent research has identified a tension between the Safety principle that knowledge is belief without risk of error, and the Closure principle that knowledge is preserved by competent deduction. Timothy Williamson reconciles Safety and Closure by proposing that when an agent deduces a conclusion from some premises, the agent’s method for believing the conclusion includes their method for believing each premise. We argue that this theory is untenable because it implies problematically easy epist…Read more
Recent research has identified a tension between the Safety principle that knowledge is belief without risk of error, and the Closure principle that knowledge is preserved by competent deduction. Timothy Williamson reconciles Safety and Closure by proposing that when an agent deduces a conclusion from some premises, the agent’s method for believing the conclusion includes their method for believing each premise. We argue that this theory is untenable because it implies problematically easy epistemic access to one’s methods. Several possible solutions are explored and rejected.

The Problem of Easy Knowledge Luminosity Safety and Sensitivity Theories of Knowledge, Misc Closure of K…Read more
The Problem of Easy Knowledge Luminosity Safety and Sensitivity Theories of Knowledge, Misc Closure of Knowledge The KK Principle
486

Attitude verbs’ local context
with Kyle Blumberg

Linguistics and Philosophy 46 (3): 483-507. 2022.

Schlenker (Semant Pragmat 2(3):1–78, 2009; Philos Stud 151(1):115–142, 2010a; Mind 119(474):377–391, 2010b) provides an algorithm for deriving the presupposition projection properties of an expression from that expression’s classical semantics. In this paper, we consider the predictions of Schlenker’s algorithm as applied to attitude verbs. More specifically, we compare Schlenker’s theory with a prominent view which maintains that attitudes exhibit belief projection, so that presupposition trigg…Read more
Schlenker (Semant Pragmat 2(3):1–78, 2009; Philos Stud 151(1):115–142, 2010a; Mind 119(474):377–391, 2010b) provides an algorithm for deriving the presupposition projection properties of an expression from that expression’s classical semantics. In this paper, we consider the predictions of Schlenker’s algorithm as applied to attitude verbs. More specifically, we compare Schlenker’s theory with a prominent view which maintains that attitudes exhibit belief projection, so that presupposition triggers in their scope imply that the attitude holder believes the presupposition (Karttunen in Theor Linguist 34(1):181, 1974; Heim in J Semant 9(3):183–221, 1992; Sudo in The art and craft of semantics: a festschrift for Irene Heim, MIT Press, 2014). We show that Schlenker’s theory does not predict belief projection, and discuss several consequences of this result.

Presupposition Attitude Ascriptions
837

Question-Sensitive Theory of Intention
with Bob Beddor

Philosophical Quarterly 73 (2): 346-378. 2022.

This paper develops a question-sensitive theory of intention. We show that this theory explains some puzzling closure properties of intention. In particular, it can be used to explain why one is rationally required to intend the means to one’s ends, even though one is not rationally required to intend all the foreseen consequences of one’s intended actions. It also explains why rational intention is not always closed under logical implication, and why one can only intend outcomes that one believ…Read more
This paper develops a question-sensitive theory of intention. We show that this theory explains some puzzling closure properties of intention. In particular, it can be used to explain why one is rationally required to intend the means to one’s ends, even though one is not rationally required to intend all the foreseen consequences of one’s intended actions. It also explains why rational intention is not always closed under logical implication, and why one can only intend outcomes that one believes to be under one’s control.

The Doctrine of Double Effect Intentions Questions
544

Contextology
with Cameron Domenico Kirk-Giannini

Philosophical Studies 179 (11): 3187-3209. 2022.

Contextology is the science of the dynamics of the conversational context. Contextology formulates laws governing how the shared information states of interlocutors evolve in response to assertion. More precisely, the contextologist attempts to construct a function which, when provided with just a conversation’s pre-update context and the content of an assertion, delivers that conversation’s post-update context. Most contextologists have assumed that the function governing the evolution of the c…Read more
Contextology is the science of the dynamics of the conversational context. Contextology formulates laws governing how the shared information states of interlocutors evolve in response to assertion. More precisely, the contextologist attempts to construct a function which, when provided with just a conversation’s pre-update context and the content of an assertion, delivers that conversation’s post-update context. Most contextologists have assumed that the function governing the evolution of the context is simple: the post-update context is just the pre-update context intersected with the content of the assertion. We argue that this assumption is wrong: not only is it false, it is also incoherent given standard contextological assumptions. Moreover, it is impossible in principle to revise it to correctly describe the dynamics of context. We conclude that there can be no science of Contextology. The laws governing the evolution of the context in response to assertion must make essential reference to the private information states of interlocutors.

Semantics-Pragmatics Distinction The Nature of Context Assertion Theories of Speech Acts
476

Sly Pete in Dynamic Semantics
Journal of Philosophical Logic 51 (5): 1103-1117. 2022.

In ‘Sly Pete’ or ‘standoff’ cases, reasonable speakers accept incompatible conditionals, and communicate them successfully to a trusting hearer. This paper uses the framework of dynamic semantics to offer a new model of the conversational dynamics at play in standoffs, and to articulate several puzzles posed by such cases. The paper resolves these puzzles by embracing a dynamic semantics for conditionals, according to which indicative conditionals require that their antecedents are possible in t…Read more
In ‘Sly Pete’ or ‘standoff’ cases, reasonable speakers accept incompatible conditionals, and communicate them successfully to a trusting hearer. This paper uses the framework of dynamic semantics to offer a new model of the conversational dynamics at play in standoffs, and to articulate several puzzles posed by such cases. The paper resolves these puzzles by embracing a dynamic semantics for conditionals, according to which indicative conditionals require that their antecedents are possible in their local context, and update this body of information by eliminating the possibilities where the antecedent is true and the consequent is false. In this way, the dynamic analysis draws on insights from the material conditional and contextualist analyses, while explaining how standoffs are genuine disagreements.

Conditionals Semantics-Pragmatics Distinction Dynamic Semantics
684

Knowledge from multiple experiences
with John Hawthorne

Philosophical Studies 179 (4): 1341-1372. 2021.

This paper models knowledge in cases where an agent has multiple experiences over time. Using this model, we introduce a series of observations that undermine the pretheoretic idea that the evidential significance of experience depends on the extent to which that experience matches the world. On the basis of these observations, we model knowledge in terms of what is likely given the agent’s experience. An agent knows p when p is implied by her epistemic possibilities. A world is epistemically po…Read more
This paper models knowledge in cases where an agent has multiple experiences over time. Using this model, we introduce a series of observations that undermine the pretheoretic idea that the evidential significance of experience depends on the extent to which that experience matches the world. On the basis of these observations, we model knowledge in terms of what is likely given the agent’s experience. An agent knows p when p is implied by her epistemic possibilities. A world is epistemically possible when its probability given the agent’s experiences is not significantly lower than the probability of the actual world given that experience.

Theories of Knowledge Defining Knowledge, Misc Perceptual Justification Perception and Knowledge, Misc P…Read more
Theories of Knowledge Defining Knowledge, Misc Perceptual Justification Perception and Knowledge, Misc Probabilistic Reasoning
969

Fragile Knowledge
Mind 131 (522): 487-515. 2022.

This paper explores the principle that knowledge is fragile, in that whenever S knows that S doesn’t know that S knows that p, S thereby fails to know p. Fragility is motivated by the infelicity of dubious assertions, utterances which assert p while acknowledging higher-order ignorance whether p. Fragility is interestingly weaker than KK, the principle that if S knows p, then S knows that S knows p. Existing theories of knowledge which deny KK by accepting a Margin for Error principle can be con…Read more
This paper explores the principle that knowledge is fragile, in that whenever S knows that S doesn’t know that S knows that p, S thereby fails to know p. Fragility is motivated by the infelicity of dubious assertions, utterances which assert p while acknowledging higher-order ignorance whether p. Fragility is interestingly weaker than KK, the principle that if S knows p, then S knows that S knows p. Existing theories of knowledge which deny KK by accepting a Margin for Error principle can be conservatively extended with Fragility.

The KK Principle Safety and Sensitivity Epistemic Logic Assertion Luminosity Theories of Knowledge
922

Probability for Epistemic Modalities
with Paolo Santorio

Philosophers' Imprint 21 (33). 2021.

This paper develops an information-sensitive theory of the semantics and probability of conditionals and statements involving epistemic modals. The theory validates a number of principles linking probability and modality, including the principle that the probability of a conditional If A, then C equals the probability of C, updated with A. The theory avoids so-called triviality results, which are standardly taken to show that principles of this sort cannot be validated. To achieve this, we deny …Read more
This paper develops an information-sensitive theory of the semantics and probability of conditionals and statements involving epistemic modals. The theory validates a number of principles linking probability and modality, including the principle that the probability of a conditional If A, then C equals the probability of C, updated with A. The theory avoids so-called triviality results, which are standardly taken to show that principles of this sort cannot be validated. To achieve this, we deny that rational agents update their credences via conditionalization. We offer a new rule of update, Hyperconditionalization, which agrees with Conditionalization whenever nonmodal statements are at stake but differs for modal and conditional sentences.

Modal Expressions Indicative Conditionals and Conditional Probabilities Dynamic Semantics Possible Worl…Read more
Modal Expressions Indicative Conditionals and Conditional Probabilities Dynamic Semantics Possible World Semantics Conditionalization Updating Principles
812

Counterfactual Contamination
with John Hawthorne

Australasian Journal of Philosophy 100 (2): 262-278. 2022.

Many defend the thesis that when someone knows p, they couldn’t easily have been wrong about p. But the notion of easy possibility in play is relatively undertheorized. One structural idea in the literature, the principle of Counterfactual Closure (CC), connects easy possibility with counterfactuals: if it easily could have happened that p, and if p were the case, then q would be the case, it follows that it easily could have happened that q. We first argue that while CC is false, there is a tru…Read more
Many defend the thesis that when someone knows p, they couldn’t easily have been wrong about p. But the notion of easy possibility in play is relatively undertheorized. One structural idea in the literature, the principle of Counterfactual Closure (CC), connects easy possibility with counterfactuals: if it easily could have happened that p, and if p were the case, then q would be the case, it follows that it easily could have happened that q. We first argue that while CC is false, there is a true restriction of it to cases involving counterfactual dependence on a coin flip. The failure of CC falsifies a model where the easy possibilities are counterfactually similar to actuality. Next, we show that extant normality models, where the easy possibilities are the sufficiently normal ones, are incompatible with the restricted CC thesis involving coin flips. Next, we develop a new kind of normality theory that can accommodate the restricted version of CC. This new theory introduces a principle of Counterfactual Contamination, which says roughly that any world is fairly abnormal if at that world very abnormal events counterfactually depend on a coin flip. Finally, we explain why coin flips and other related events have a special status. A central take home lesson is that the correct principle in the vicinity of Safety is importantly normality-theoretic rather than (as it is usually conceived) similarity-theoretic.

Possible-World Theories of Counterfactuals Epistemological Theories, Misc Theories of Knowledge, Misc R…Read more
Possible-World Theories of Counterfactuals Epistemological Theories, Misc Theories of Knowledge, Misc Reliabilism about Knowledge
744

The normality of error
with Sam Carter

Philosophical Studies 178 (8): 2509-2533. 2021.

Formal models of appearance and reality have proved fruitful for investigating structural properties of perceptual knowledge. This paper applies the same approach to epistemic justification. Our central goal is to give a simple account of The Preface, in which justified belief fails to agglomerate. Following recent work by a number of authors, we understand knowledge in terms of normality. An agent knows p iff p is true throughout all relevant normal worlds. To model The Preface, we appeal to th…Read more
Formal models of appearance and reality have proved fruitful for investigating structural properties of perceptual knowledge. This paper applies the same approach to epistemic justification. Our central goal is to give a simple account of The Preface, in which justified belief fails to agglomerate. Following recent work by a number of authors, we understand knowledge in terms of normality. An agent knows p iff p is true throughout all relevant normal worlds. To model The Preface, we appeal to the normality of error. Sometimes, it is more normal for reality and appearance to diverge than to match. We show that this simple idea has dramatic consequences for the theory of knowledge and justification. Among other things, we argue that a proper treatment of The Preface requires a departure from the internalist idea that epistemic justification supervenes on the appearances and the widespread idea that one knows most when free from error.

Evidentialism Knowledge Skepticism Epistemic Internalism and Externalism
800

Mighty Knowledge
with Bob Beddor

Journal of Philosophy 118 (5): 229-269. 2021.

We often claim to know what might be—or probably is—the case. Modal knowledge along these lines creates a puzzle for information-sensitive semantics for epistemic modals. This paper develops a solution. We start with the idea that knowledge requires safe belief: a belief amounts to knowledge only if it could not easily have been held falsely. We then develop an interpretation of the modal operator in safety that allows it to non-trivially embed information-sensitive contents. The resulting theor…Read more
We often claim to know what might be—or probably is—the case. Modal knowledge along these lines creates a puzzle for information-sensitive semantics for epistemic modals. This paper develops a solution. We start with the idea that knowledge requires safe belief: a belief amounts to knowledge only if it could not easily have been held falsely. We then develop an interpretation of the modal operator in safety that allows it to non-trivially embed information-sensitive contents. The resulting theory avoids various paradoxes that arise from other accounts of modal knowledge. It also delivers plausible predictions about modal Gettier cases.

Epistemic Possibility Safety and Sensitivity Theories of Knowledge, Misc Epistemic Modals
853

Losing Confidence in Luminosity
with Daniel Waxman

Noûs (4): 1-30. 2020.

A mental state is luminous if, whenever an agent is in that state, they are in a position to know that they are. Following Timothy Williamson’s Knowledge and Its Limits, a wave of recent work has explored whether there are any non-trivial luminous mental states. A version of Williamson’s anti-luminosity appeals to a safety- theoretic principle connecting knowledge and confidence: if an agent knows p, then p is true in any nearby scenario where she has a similar level of confidence in p. However,…Read more
A mental state is luminous if, whenever an agent is in that state, they are in a position to know that they are. Following Timothy Williamson’s Knowledge and Its Limits, a wave of recent work has explored whether there are any non-trivial luminous mental states. A version of Williamson’s anti-luminosity appeals to a safety- theoretic principle connecting knowledge and confidence: if an agent knows p, then p is true in any nearby scenario where she has a similar level of confidence in p. However, the relevant notion of confidence is relatively underexplored. This paper develops a precise theory of confidence: an agent’s degree of confidence in p is the objective chance they will rely on p in practical reasoning. This theory of confidence is then used to critically evaluate the anti-luminosity argument, leading to the surprising conclusion that although there are strong reasons for thinking that luminosity does not obtain, they are quite different from those the existing literature has considered. In particular, we show that once the notion of confidence is properly understood, the failure of luminosity follows from the assumption that knowledge requires high confidence, and does not require any kind of safety principle as a premise

Safety and Sensitivity The KK Principle Luminosity
887

Epistemic Modal Credence
Philosophers' Imprint 21 (26). 2021.

Triviality results threaten plausible principles governing our credence in epistemic modal claims. This paper develops a new account of modal credence which avoids triviality. On the resulting theory, probabilities are assigned not to sets of worlds, but rather to sets of information state-world pairs. The theory avoids triviality by giving up the principle that rational credence is closed under conditionalization. A rational agent can become irrational by conditionalizing on new evidence. In pl…Read more
Triviality results threaten plausible principles governing our credence in epistemic modal claims. This paper develops a new account of modal credence which avoids triviality. On the resulting theory, probabilities are assigned not to sets of worlds, but rather to sets of information state-world pairs. The theory avoids triviality by giving up the principle that rational credence is closed under conditionalization. A rational agent can become irrational by conditionalizing on new evidence. In place of conditionalization, the paper develops a new account of updating: conditionalization with normalization.

Epistemic Modals Conditionals, Misc Formal Epistemology, Misc Dynamic Semantics Indicative Conditionals …Read more
Epistemic Modals Conditionals, Misc Formal Epistemology, Misc Dynamic Semantics Indicative Conditionals and Conditional Probabilities Conditionalization Updating Principles
351

Free choice and homogeneity
Semantics and Pragmatics 12 1-48. 2019.

This paper develops a semantic solution to the puzzle of Free Choice permission. The paper begins with a battery of impossibility results showing that Free Choice is in tension with a variety of classical principles, including Disjunction Introduction and the Law of Excluded Middle. Most interestingly, Free Choice appears incompatible with a principle concerning the behavior of Free Choice under negation, Double Prohibition, which says that Mary can’t have soup or salad implies Mary can’t have s…Read more
This paper develops a semantic solution to the puzzle of Free Choice permission. The paper begins with a battery of impossibility results showing that Free Choice is in tension with a variety of classical principles, including Disjunction Introduction and the Law of Excluded Middle. Most interestingly, Free Choice appears incompatible with a principle concerning the behavior of Free Choice under negation, Double Prohibition, which says that Mary can’t have soup or salad implies Mary can’t have soup and Mary can’t have salad. Alonso-Ovalle 2006 and others have appealed to Double Prohibition to motivate pragmatic accounts of Free Choice. Aher 2012, Aloni 2018, and others have developed semantic accounts of Free Choice that also explain Double Prohibition. This paper offers a new semantic analysis of Free Choice designed to handle the full range of impossibility results involved in Free Choice. The paper develops the hypothesis that Free Choice is a homogeneity effect. The claim possibly A or B is defined only when A and B are homogenous with respect to their modal status, either both possible or both impossible. Paired with a notion of entailment that is sensitive to definedness conditions, this theory validates Free Choice while retaining a wide variety of classical principles except for the transitivity of entailment. The homogeneity hypothesis is implemented in two different ways, homogeneous alternative semantics and homogeneous dynamic semantics, with interestingly different consequences.

Modal Expressions Connectives Dynamic Semantics
75

The counterfactual direct argument
Linguistics and Philosophy 43 (2): 193-232. 2020.

Many have accepted that ordinary counterfactuals and might counterfactuals are duals. In this paper, I show that this thesis leads to paradoxical results when combined with a few different unorthodox yet increasingly popular theses, including the thesis that counterfactuals are strict conditionals. Given Duality and several other theses, we can quickly infer the validity of another paradoxical principle, ‘The Counterfactual Direct Argument’, which says that ‘A> ’ entails ‘A> ’. First, I provide …Read more
Many have accepted that ordinary counterfactuals and might counterfactuals are duals. In this paper, I show that this thesis leads to paradoxical results when combined with a few different unorthodox yet increasingly popular theses, including the thesis that counterfactuals are strict conditionals. Given Duality and several other theses, we can quickly infer the validity of another paradoxical principle, ‘The Counterfactual Direct Argument’, which says that ‘A> ’ entails ‘A> ’. First, I provide a collapse theorem for the ‘counterfactual direct argument’. The counterfactual direct argument entails the logical equivalence of the subjunctive and material conditional, given a variety of assumptions. Second, I provide a semantics that validates the counterfactual direct argument without collapse. This theory further develops extant dynamic accounts of conditionals. I give a new semantics for disjunction, on which A or B is only true in a context when A and B are both unsettled. The resulting framework validates CDA while invalidating other commonly accepted principles concerning the conditional and disjunction.

Philosophy of Language Meaning, Misc Logic of Conditionals Dynamic Semantics Subjunctive Conditionals, M…Read more
Philosophy of Language Meaning, Misc Logic of Conditionals Dynamic Semantics Subjunctive Conditionals, Misc
61

Free Choice Impossibility Results
Journal of Philosophical Logic 49 (2): 249-282. 2020.

Free Choice is the principle that possibly p or q implies and is implied by possibly p and possibly q. A variety of recent attempts to validate Free Choice rely on a nonclassical semantics for disjunction, where the meaning of p or q is not a set of possible worlds. This paper begins with a battery of impossibility results, showing that some kind of nonclassical semantics for disjunction is required in order to validate Free Choice. The paper then provides a positive account of Free Choice, by i…Read more
Free Choice is the principle that possibly p or q implies and is implied by possibly p and possibly q. A variety of recent attempts to validate Free Choice rely on a nonclassical semantics for disjunction, where the meaning of p or q is not a set of possible worlds. This paper begins with a battery of impossibility results, showing that some kind of nonclassical semantics for disjunction is required in order to validate Free Choice. The paper then provides a positive account of Free Choice, by identifying a family of dynamic semantics for disjunction that can validate the inference. On all such theories, the meaning of p or q has two parts. First, p or q requires that our information is consistent with each of p and q. Second, p or q narrows down our information by eliminating some worlds. It turns out that this second component of or is well behaved: there is a strongest such meaning that p or q can express, consistent with validating Free Choice. The strongest such meaning is the classical one, on which p or q eliminates any world where both p and q are false. In this way, the classical meaning of disjunction turns out to be intimately related to the validity of Free Choice.

Disjunction Modal Expressions, Misc Dynamic Semantics Meaning, Misc Logical Semantics and Logical Truth L…Read more
Disjunction Modal Expressions, Misc Dynamic Semantics Meaning, Misc Logical Semantics and Logical Truth Logics
497

A Theory of Conditional Assertion
Journal of Philosophy 116 (6): 293-318. 2019.

According to one tradition, uttering an indicative conditional involves performing a special sort of speech act: a conditional assertion. We introduce a formal framework that models this speech act. Using this framework, we show that any theory of conditional assertion validates several inferences in the logic of conditionals, including the False Antecedent inference. Next, we determine the space of truth-conditional semantics for conditionals consistent with conditional assertion. The truth val…Read more
According to one tradition, uttering an indicative conditional involves performing a special sort of speech act: a conditional assertion. We introduce a formal framework that models this speech act. Using this framework, we show that any theory of conditional assertion validates several inferences in the logic of conditionals, including the False Antecedent inference. Next, we determine the space of truth-conditional semantics for conditionals consistent with conditional assertion. The truth value of any such conditional is settled whenever the antecedent is false, and whenever the antecedent is true and the consequent is false. Then, we consider the space of dynamic meanings consistent with the theory of conditional assertion. We develop a new family of dynamic conditional-assertion operators that combine a traditional test operator with an update operation.

Logic of Conditionals
129

Generalized Update Semantics
Mind 128 (511): 795-835. 2019.

This paper explores the relationship between dynamic and truth conditional semantics for epistemic modals. It provides a generalization of a standard dynamic update semantics for modals. This new semantics derives a Kripke semantics for modals and a standard dynamic semantics for modals as special cases. The semantics allows for new characterizations of a variety of principles in modal logic, including the inconsistency of ‘p and might not p’. Finally, the semantics provides a construction proce…Read more
This paper explores the relationship between dynamic and truth conditional semantics for epistemic modals. It provides a generalization of a standard dynamic update semantics for modals. This new semantics derives a Kripke semantics for modals and a standard dynamic semantics for modals as special cases. The semantics allows for new characterizations of a variety of principles in modal logic, including the inconsistency of ‘p and might not p’. Finally, the semantics provides a construction procedure for transforming any truth conditional semantics for modals into a dynamic semantics for modals with similar properties.

Epistemic Modals Meaning, Misc Dynamic Semantics
1135

Believing epistemic contradictions
with Beddor Bob

Review of Symbolic Logic (1): 87-114. 2018.

What is it to believe something might be the case? We develop a puzzle that creates difficulties for standard answers to this question. We go on to propose our own solution, which integrates a Bayesian approach to belief with a dynamic semantics for epistemic modals. After showing how our account solves the puzzle, we explore a surprising consequence: virtually all of our beliefs about what might be the case provide counterexamples to the view that rational belief is closed under logical implica…Read more
What is it to believe something might be the case? We develop a puzzle that creates difficulties for standard answers to this question. We go on to propose our own solution, which integrates a Bayesian approach to belief with a dynamic semantics for epistemic modals. After showing how our account solves the puzzle, we explore a surprising consequence: virtually all of our beliefs about what might be the case provide counterexamples to the view that rational belief is closed under logical implication.

Epistemic Modals Deductive Reasoning Dynamic Semantics
114

A Preface Paradox for Intention
Philosophers' Imprint 16. 2016.

In this paper I argue that there is a preface paradox for intention. The preface paradox for intention shows that intentions do not obey an agglomeration norm, requiring one to intend conjunctions of whatever else one intends. But what norms do intentions obey? I will argue that intentions come in degrees. These partial intentions are governed by the norms of the probability calculus. First, I will give a dispositional theory of partial intention, on which degrees of intention are the degrees to…Read more
In this paper I argue that there is a preface paradox for intention. The preface paradox for intention shows that intentions do not obey an agglomeration norm, requiring one to intend conjunctions of whatever else one intends. But what norms do intentions obey? I will argue that intentions come in degrees. These partial intentions are governed by the norms of the probability calculus. First, I will give a dispositional theory of partial intention, on which degrees of intention are the degrees to which one possesses the dispositions characteristic of full intention. I will use this dispositional theory to defend probabilism about intention. Next, I will offer a more general argument for probabilism about intention. To do so, I will generalize recent decision theoretic arguments for probabilism from the case of belief to the case of intention.

Intentions, Misc
509

Triviality Results For Probabilistic Modals
Philosophy and Phenomenological Research 99 (1): 188-222. 2017.

In recent years, a number of theorists have claimed that beliefs about probability are transparent. To believe probably p is simply to have a high credence that p. In this paper, I prove a variety of triviality results for theses like the above. I show that such claims are inconsistent with the thesis that probabilistic modal sentences have propositions or sets of worlds as their meaning. Then I consider the extent to which a dynamic semantics for probabilistic modals can capture theses connecti…Read more
In recent years, a number of theorists have claimed that beliefs about probability are transparent. To believe probably p is simply to have a high credence that p. In this paper, I prove a variety of triviality results for theses like the above. I show that such claims are inconsistent with the thesis that probabilistic modal sentences have propositions or sets of worlds as their meaning. Then I consider the extent to which a dynamic semantics for probabilistic modals can capture theses connecting belief, certainty, credence, and probability. I show that although a dynamic semantics for probabilistic modals does allow one to validate such theses, it can only do so at a cost. I prove that such theses can only be valid if probabilistic modals do not satisfy the axioms of the probability calculus.
152

A Stronger Doctrine of Double Effect
with Ben Bronner

Australasian Journal of Philosophy 96 (4): 793-805. 2018.

Many believe that intended harms are more difficult to justify than are harms that result as a foreseen side effect of one's conduct. We describe cases of harming in which the harm is not intended, yet the harmful act nevertheless runs afoul of the intuitive moral constraint that governs intended harms. We note that these cases provide new and improved counterexamples to the so-called Simple View, according to which intentionally phi-ing requires intending to phi. We then give a new theory of th…Read more
Many believe that intended harms are more difficult to justify than are harms that result as a foreseen side effect of one's conduct. We describe cases of harming in which the harm is not intended, yet the harmful act nevertheless runs afoul of the intuitive moral constraint that governs intended harms. We note that these cases provide new and improved counterexamples to the so-called Simple View, according to which intentionally phi-ing requires intending to phi. We then give a new theory of the moral relevance of intention. This theory yields the traditional constraint on intending harm as a special case, along with several stronger demands.

Intentional Action The Doctrine of Double Effect The Simple View of Intention
2009

Conditional Heresies
with Fabrizio Cariani

Philosophy and Phenomenological Research (2): 251-282. 2018.

Philosophy and Phenomenological Research, EarlyView.

Conditionals, Misc Semantic Theories, Misc Logical Consequence and Entailment Presupposition Logic of Co…Read more
Conditionals, Misc Semantic Theories, Misc Logical Consequence and Entailment Presupposition Logic of Conditionals

Simon Goldstein

AI Deception: A Survey of Examples, Risks, and Potential Solutions
with Peter Park, Aidan O'Gara, Michael Chen, and Dan Hendrycks

Losing confidence in luminosity
with Daniel Waxman

Noûs 55 (4): 962-991. 2021.

Language Agents Reduce the Risk of Existential Catastrophe
with Cameron Domenico Kirk-Giannini

AI and Society 1-11. forthcoming.

AI Wellbeing
with Cameron Domenico Kirk-Giannini

Getting Accurate about Knowledge
with Sam Carter

Mind 132 (525): 158-191. 2022.

Omega Knowledge Matters
Oxford Studies in Epistemology. forthcoming.

Iterated Knowledge
Oxford University Press. 2024.

Safety, Closure, and Extended Methods
with John Hawthorne

Journal of Philosophy 121 (1): 26-54. 2024.

Attitude verbs’ local context
with Kyle Blumberg

Linguistics and Philosophy 46 (3): 483-507. 2022.

Question-Sensitive Theory of Intention
with Bob Beddor

Philosophical Quarterly 73 (2): 346-378. 2022.

Contextology
with Cameron Domenico Kirk-Giannini

Philosophical Studies 179 (11): 3187-3209. 2022.

Sly Pete in Dynamic Semantics
Journal of Philosophical Logic 51 (5): 1103-1117. 2022.

Knowledge from multiple experiences
with John Hawthorne

Philosophical Studies 179 (4): 1341-1372. 2021.

Fragile Knowledge
Mind 131 (522): 487-515. 2022.

Probability for Epistemic Modalities
with Paolo Santorio

Philosophers' Imprint 21 (33). 2021.

Counterfactual Contamination
with John Hawthorne

Australasian Journal of Philosophy 100 (2): 262-278. 2022.

The normality of error
with Sam Carter

Philosophical Studies 178 (8): 2509-2533. 2021.

Mighty Knowledge
with Bob Beddor

Journal of Philosophy 118 (5): 229-269. 2021.

Losing Confidence in Luminosity
with Daniel Waxman

Noûs (4): 1-30. 2020.

Epistemic Modal Credence
Philosophers' Imprint 21 (26). 2021.

Free choice and homogeneity
Semantics and Pragmatics 12 1-48. 2019.

The counterfactual direct argument
Linguistics and Philosophy 43 (2): 193-232. 2020.

Free Choice Impossibility Results
Journal of Philosophical Logic 49 (2): 249-282. 2020.

A Theory of Conditional Assertion
Journal of Philosophy 116 (6): 293-318. 2019.

Generalized Update Semantics
Mind 128 (511): 795-835. 2019.

Believing epistemic contradictions
with Beddor Bob

Review of Symbolic Logic (1): 87-114. 2018.

A Preface Paradox for Intention
Philosophers' Imprint 16. 2016.

Triviality Results For Probabilistic Modals
Philosophy and Phenomenological Research 99 (1): 188-222. 2017.

A Stronger Doctrine of Double Effect
with Ben Bronner

Australasian Journal of Philosophy 96 (4): 793-805. 2018.

Conditional Heresies
with Fabrizio Cariani

Philosophy and Phenomenological Research (2): 251-282. 2018.

Simon Goldstein

AI Deception: A Survey of Examples, Risks, and Potential Solutions with Peter Park, Aidan O'Gara, Michael Chen, and Dan Hendrycks

Losing confidence in luminosity with Daniel Waxman Noûs 55 (4): 962-991. 2021.

Language Agents Reduce the Risk of Existential Catastrophe with Cameron Domenico Kirk-Giannini AI and Society 1-11. forthcoming.

AI Wellbeing with Cameron Domenico Kirk-Giannini

Getting Accurate about Knowledge with Sam Carter Mind 132 (525): 158-191. 2022.

Omega Knowledge Matters Oxford Studies in Epistemology. forthcoming.

Iterated Knowledge Oxford University Press. 2024.

Safety, Closure, and Extended Methods with John Hawthorne Journal of Philosophy 121 (1): 26-54. 2024.

Attitude verbs’ local context with Kyle Blumberg Linguistics and Philosophy 46 (3): 483-507. 2022.

Question-Sensitive Theory of Intention with Bob Beddor Philosophical Quarterly 73 (2): 346-378. 2022.

Contextology with Cameron Domenico Kirk-Giannini Philosophical Studies 179 (11): 3187-3209. 2022.

Sly Pete in Dynamic Semantics Journal of Philosophical Logic 51 (5): 1103-1117. 2022.

Knowledge from multiple experiences with John Hawthorne Philosophical Studies 179 (4): 1341-1372. 2021.

Fragile Knowledge Mind 131 (522): 487-515. 2022.

Probability for Epistemic Modalities with Paolo Santorio Philosophers' Imprint 21 (33). 2021.

Counterfactual Contamination with John Hawthorne Australasian Journal of Philosophy 100 (2): 262-278. 2022.

The normality of error with Sam Carter Philosophical Studies 178 (8): 2509-2533. 2021.

Mighty Knowledge with Bob Beddor Journal of Philosophy 118 (5): 229-269. 2021.

Losing Confidence in Luminosity with Daniel Waxman Noûs (4): 1-30. 2020.

Epistemic Modal Credence Philosophers' Imprint 21 (26). 2021.

Free choice and homogeneity Semantics and Pragmatics 12 1-48. 2019.

The counterfactual direct argument Linguistics and Philosophy 43 (2): 193-232. 2020.

Free Choice Impossibility Results Journal of Philosophical Logic 49 (2): 249-282. 2020.

A Theory of Conditional Assertion Journal of Philosophy 116 (6): 293-318. 2019.

Generalized Update Semantics Mind 128 (511): 795-835. 2019.

Believing epistemic contradictions with Beddor Bob Review of Symbolic Logic (1): 87-114. 2018.

A Preface Paradox for Intention Philosophers' Imprint 16. 2016.

Triviality Results For Probabilistic Modals Philosophy and Phenomenological Research 99 (1): 188-222. 2017.

A Stronger Doctrine of Double Effect with Ben Bronner Australasian Journal of Philosophy 96 (4): 793-805. 2018.

Conditional Heresies with Fabrizio Cariani Philosophy and Phenomenological Research (2): 251-282. 2018.

AI Deception: A Survey of Examples, Risks, and Potential Solutions
with Peter Park, Aidan O'Gara, Michael Chen, and Dan Hendrycks

Losing confidence in luminosity
with Daniel Waxman

Noûs 55 (4): 962-991. 2021.

Language Agents Reduce the Risk of Existential Catastrophe
with Cameron Domenico Kirk-Giannini

AI and Society 1-11. forthcoming.

AI Wellbeing
with Cameron Domenico Kirk-Giannini

Getting Accurate about Knowledge
with Sam Carter

Mind 132 (525): 158-191. 2022.

Omega Knowledge Matters
Oxford Studies in Epistemology. forthcoming.

Iterated Knowledge
Oxford University Press. 2024.

Safety, Closure, and Extended Methods
with John Hawthorne

Journal of Philosophy 121 (1): 26-54. 2024.

Attitude verbs’ local context
with Kyle Blumberg

Linguistics and Philosophy 46 (3): 483-507. 2022.

Question-Sensitive Theory of Intention
with Bob Beddor

Philosophical Quarterly 73 (2): 346-378. 2022.

Contextology
with Cameron Domenico Kirk-Giannini

Philosophical Studies 179 (11): 3187-3209. 2022.

Sly Pete in Dynamic Semantics
Journal of Philosophical Logic 51 (5): 1103-1117. 2022.

Knowledge from multiple experiences
with John Hawthorne

Philosophical Studies 179 (4): 1341-1372. 2021.

Fragile Knowledge
Mind 131 (522): 487-515. 2022.

Probability for Epistemic Modalities
with Paolo Santorio

Philosophers' Imprint 21 (33). 2021.

Counterfactual Contamination
with John Hawthorne

Australasian Journal of Philosophy 100 (2): 262-278. 2022.

The normality of error
with Sam Carter

Philosophical Studies 178 (8): 2509-2533. 2021.

Mighty Knowledge
with Bob Beddor

Journal of Philosophy 118 (5): 229-269. 2021.

Losing Confidence in Luminosity
with Daniel Waxman

Noûs (4): 1-30. 2020.

Epistemic Modal Credence
Philosophers' Imprint 21 (26). 2021.

Free choice and homogeneity
Semantics and Pragmatics 12 1-48. 2019.

The counterfactual direct argument
Linguistics and Philosophy 43 (2): 193-232. 2020.

Free Choice Impossibility Results
Journal of Philosophical Logic 49 (2): 249-282. 2020.

A Theory of Conditional Assertion
Journal of Philosophy 116 (6): 293-318. 2019.

Generalized Update Semantics
Mind 128 (511): 795-835. 2019.

Believing epistemic contradictions
with Beddor Bob

Review of Symbolic Logic (1): 87-114. 2018.

A Preface Paradox for Intention
Philosophers' Imprint 16. 2016.

Triviality Results For Probabilistic Modals
Philosophy and Phenomenological Research 99 (1): 188-222. 2017.

A Stronger Doctrine of Double Effect
with Ben Bronner

Australasian Journal of Philosophy 96 (4): 793-805. 2018.

Conditional Heresies
with Fabrizio Cariani

Philosophy and Phenomenological Research (2): 251-282. 2018.