Amir Konigsberg (Hebrew University of Jerusalem): Publications

More details

Hebrew University of Jerusalem
Department of Philosophy

Post-doctoral fellow
Princeton University
Department of Philosophy

Post-doctoral fellow

Hebrew University of Jerusalem

Department of Philosophy

PhD, 2011

Homepage

Jerusalem, Israel

Areas of Specialization

Epistemology

Philosophy of Cognitive Science

Areas of Interest

Epistemology

Philosophy of Cognitive Science

Philosophy of Social Science

156

Beyond Behavior: Why AI Evaluation Needs a Cognitive Revolution

In 1950, Alan Turing proposed replacing the question "Can machines think?" with a behavioral test: if a machine's outputs are indistinguishable from those of a thinking being, the question of whether it truly thinks can be set aside. This paper argues that Turing's move was not only a pragmatic simplification but also an epistemological commitment, a decision about what kind of evidence counts as relevant to intelligence attribution, and that this commitment has quietly constrained AI research f…Read more
In 1950, Alan Turing proposed replacing the question "Can machines think?" with a behavioral test: if a machine's outputs are indistinguishable from those of a thinking being, the question of whether it truly thinks can be set aside. This paper argues that Turing's move was not only a pragmatic simplification but also an epistemological commitment, a decision about what kind of evidence counts as relevant to intelligence attribution, and that this commitment has quietly constrained AI research for seven decades. We trace how Turing's behavioral epistemology became embedded in the field's evaluative infrastructure, rendering the field structurally unable to pose a class of questions about process, mechanism, and internal organization that cognitive psychology, neuroscience, and related disciplines learned to ask. We draw a parallel to the behaviorist-to-cognitivist transition in psychology: just as psychology's commitment to studying only observable behavior prevented it from asking productive questions about internal mental processes until that commitment was abandoned, AI's commitment to behavioral evaluation prevents it from distinguishing between systems that achieve identical outputs through fundamentally different computational processes, a distinction on which intelligence attribution depends. We argue that the field requires an epistemological transition comparable to the cognitive revolution: not an abandonment of behavioral evidence, but a recognition that behavioral evidence alone is insufficient for the construct claims the field wishes to make. We articulate what a post-behaviorist epistemology for AI would involve and identify the specific questions it would make askable that the field currently has no way to ask.

Computationalism AI without Representation?Computation and Physical Systems Interpretability in Artifi…Read more
Computationalism AI without Representation?Computation and Physical Systems Interpretability in Artificial Intelligence Subsymbolic Computation Psychological Behaviorism Artificial Intelligence Safety Explainability in Artificial Intelligence Large Language Models
193

Beyond Behavior: Why AI Evaluation Needs a Cognitive Revolution

In 1950, Alan Turing proposed replacing the question "Can machines think?" with a behavioral test: if a machine's outputs are indistinguishable from those of a thinking being, the question of whether it truly thinks can be set aside. This paper argues that Turing's move was not only a pragmatic simplification but also an epistemological commitment, a decision about what kind of evidence counts as relevant to intelligence attribution, and that this commitment has constrained AI research for seven…Read more
In 1950, Alan Turing proposed replacing the question "Can machines think?" with a behavioral test: if a machine's outputs are indistinguishable from those of a thinking being, the question of whether it truly thinks can be set aside. This paper argues that Turing's move was not only a pragmatic simplification but also an epistemological commitment, a decision about what kind of evidence counts as relevant to intelligence attribution, and that this commitment has constrained AI research for seven decades. We trace how Turing's behavioral epistemology became embedded in the field's evaluative infrastructure, rendering the field structurally unable to pose a class of questions about process, mechanism, and internal organization that cognitive psychology, neuroscience, and related disciplines learned to ask. We draw a parallel to the behaviorist-to-cognitivist transition in psychology: just as psychology's commitment to studying only observable behavior prevented it from asking productive questions about internal mental processes until that commitment was abandoned, AI's commitment to behavioral evaluation prevents it from distinguishing between systems that achieve identical outputs through fundamentally different computational processes, a distinction on which intelligence attribution depends. We argue that the field requires an epistemological transition comparable to the cognitive revolution: not an abandonment of behavioral evidence, but a recognition that behavioral evidence alone is insufficient for the construct claims the field wishes to make. We articulate what a post-behaviorist epistemology for AI would involve and identify the specific questions it would make askable that the field currently has no way to ask.

Large Language Models Behaviorism Cognitivism in Psychology Artificial Intelligence in Science Computati…Read more
Large Language Models Behaviorism Cognitivism in Psychology Artificial Intelligence in Science Computationalism AI without Representation?Computation and Physical Systems Interpretability in Artificial Intelligence Subsymbolic Computation Psychological Behaviorism Artificial Intelligence Safety Explainability in Artificial Intelligence
454

Cognitive Sovereignty: The Authorship Problem in AI-Assisted Thought

The rapid integration of large language models into everyday cognitive tasks has created a need for conceptual frameworks adequate to the cognitive consequences of delegating thinking to AI systems. Existing constructs in psychology and also in epistemology, including critical thinking, metacognition, intellectual autonomy, and epistemic agency, each address related phenomena but none adequately captures the specific capacity threatened by habitual AI-assisted cognition, which I define as the ab…Read more
The rapid integration of large language models into everyday cognitive tasks has created a need for conceptual frameworks adequate to the cognitive consequences of delegating thinking to AI systems. Existing constructs in psychology and also in epistemology, including critical thinking, metacognition, intellectual autonomy, and epistemic agency, each address related phenomena but none adequately captures the specific capacity threatened by habitual AI-assisted cognition, which I define as the ability to remain the genuine author of one's own understanding. This paper introduces cognitive sovereignty as a distinct construct, defined as the capacity to (a) notice when one's thinking is being displaced, (b) maintain a meaningful connection to how one's beliefs and judgments are formed, and (c) distinguish between genuine reasoning and the subjective impression of having reasoned. I trace the concept's philosophical lineage, engage with the extended mind objection, differentiate cognitive sovereignty from adjacent constructs through systematic comparison, and present a growing body of empirical evidence that motivates the construct. The paper argues that cognitive sovereignty names a phenomenon that existing constructs individually fail to capture and that its articulation is a prerequisite for empirical research on AI's impact on human thinking.

Large Language Models Thought and Artificial Intelligence Mental States in Artificial Intelligence, Mi…Read more
Large Language Models Thought and Artificial Intelligence Mental States in Artificial Intelligence, Misc Agency and Artificial Intelligence Understanding and Artificial Intelligence
283

Can AI Agents Agree to Disagree? Aumann's Theorem and the Epistemic Status of Machine Outputs

Aumann's Agreement Theorem (1976) establishes that two Bayesian rational agents with common priors and common knowledge of each other's posterior beliefs cannot agree to disagree. Their posteriors must coincide. This paper applies Aumann's framework to AI agents built on large language models (LLMs), a domain in which the theorem's conditions appear, at first glance, to be unusually well satisfied. LLMs trained on overlapping data are often assumed to share something like common priors, and in m…Read more
Aumann's Agreement Theorem (1976) establishes that two Bayesian rational agents with common priors and common knowledge of each other's posterior beliefs cannot agree to disagree. Their posteriors must coincide. This paper applies Aumann's framework to AI agents built on large language models (LLMs), a domain in which the theorem's conditions appear, at first glance, to be unusually well satisfied. LLMs trained on overlapping data are often assumed to share something like common priors, and in multi-agent protocols their outputs are shared between participants. Yet LLM-based agents routinely produce divergent outputs on identical inputs, and multi-agent systems built from them are increasingly deployed in debate, deliberation, and consensus protocols that implicitly treat this divergence as epistemically meaningful. We argue that Aumann's theorem fails to apply to these agents not because the prior or rationality conditions are violated in the familiar ways they are violated for humans, but for a more fundamental reason: LLMs do not possess beliefs in the sense the theorem requires. Their outputs are samples from conditional probability distributions over token sequences, not reports of posterior probabilities conditioned on private information. We formalize the distinction between genuine disagreement, which carries epistemic content because it signals the existence of unshared evidence, and what we term pseudo-disagreement, which has the surface form of disagreement but arises from stochastic variation in generation processes that lack epistemic states. We show formally that pseudo-disagreement does not satisfy the informational conditions that make genuine disagreement epistemically valuable, and we trace the implications for multi-agent debate protocols, consensus methods, LLM-as-judge paradigms, and the broader practice of treating AI outputs as bearing on questions of truth. Our analysis applies specifically to autoregressive language models and the multi-agent systems built from them; AI systems with fundamentally different architectures, such as those maintaining explicit world models or calibrated Bayesian uncertainty estimates, may require separate treatment.

Common Knowledge Computer Science Mathematics Mathematical Logic Epistemology of Disagreement Belief Revi…Read more
Common Knowledge Computer Science Mathematics Mathematical Logic Epistemology of Disagreement Belief Revision, Misc
129

Aesthetic Educators, Aesthetic Experts, and Deferential Belief Formation
Journal of Aesthetic Education 50 (1): 34-45. 2016.

Rational aesthetic deference becomes apparent when one person’s aesthetic belief gives another person a reason to move his own aesthetic belief in the direction of the other person. It occurs when one person’s aesthetic belief gives another person a normative reason to move your belief in the direction of mine, on epistemic grounds. In such a case, what the first person believes also provides a justification for the second person’s aesthetic belief. This kind of justification is an indirect just…Read more
Rational aesthetic deference becomes apparent when one person’s aesthetic belief gives another person a reason to move his own aesthetic belief in the direction of the other person. It occurs when one person’s aesthetic belief gives another person a normative reason to move your belief in the direction of mine, on epistemic grounds. In such a case, what the first person believes also provides a justification for the second person’s aesthetic belief. This kind of justification is an indirect justification because it is based on reasons that merit deferring to someone else’s judgment, rather than on reasons that support..

Aesthetic Cognition
296

The Acquaintance Principle, Aesthetic Autonomy, and Aesthetic Appreciation
British Journal of Aesthetics 52 (2): 153-168. 2012.

The acquaintance principle (AP) and the view it expresses have recently been tied to a debate surrounding the possibility of aesthetic testimony, which, plainly put, deals with the question whether aesthetic knowledge can be acquired through testimony—typically aesthetic and non-aesthetic descriptions communicated from person to person. In this context a number of suggestions have been put forward opting for a restricted acceptance of AP. This paper is an attempt to restrict AP even more

Testimony, Misc Aesthetic Judgment Aesthetic Knowledge
309

The Problem with Uniform Solutions to Peer Disagreement
Theoria 79 (2): 96-126. 2013.

Contributors to the recent disagreement debate have sought to provide a uniform response to cases in which epistemic peers disagree about the epistemic import of a shared body of evidence, no matter what kind of evidence they are disagreeing about. The varied cases addressed in the literature have included examples of disagreement about restaurant bills, court verdicts, weather forecasting, chess, morality, religious beliefs, and even disagreements about philosophical disagreements. The equal tr…Read more
Contributors to the recent disagreement debate have sought to provide a uniform response to cases in which epistemic peers disagree about the epistemic import of a shared body of evidence, no matter what kind of evidence they are disagreeing about. The varied cases addressed in the literature have included examples of disagreement about restaurant bills, court verdicts, weather forecasting, chess, morality, religious beliefs, and even disagreements about philosophical disagreements. The equal treatment of these varied cases has motivated the search for a uniform response to peer disagreement wherever it is encountered. In this article I challenge this prevalent approach in the literature. I grant the notion of epistemic peer and accept that being a peer may amount to the same thing in different domains; nonetheless I contend that different domains appear to call for different responses to disagreement. I argue that the appropriate response to finding out about a disagreement with a peer is different in different domains.

Disagreement in Philosophy Social Epistemology, Miscellaneous Epistemology of Disagreement
193

Epistemic Value and Epistemic Compromise, A Reply to Moss
Episteme 10 (1): 87-97. 2013.

In this paper I present a criticism of Sarah Moss‘ recent proposal to use scoring rules as a means of reaching epistemic compromise in disagreements between epistemic peers that have encountered conflict. The problem I have with Moss‘ proposal is twofold. Firstly, it appears to involve a double counting of epistemic value. Secondly, it isn‘t clear whether the notion of epistemic value that Moss appeals to actually involves the type of value that would be acceptable and unproblematic to regard as…Read more
In this paper I present a criticism of Sarah Moss‘ recent proposal to use scoring rules as a means of reaching epistemic compromise in disagreements between epistemic peers that have encountered conflict. The problem I have with Moss‘ proposal is twofold. Firstly, it appears to involve a double counting of epistemic value. Secondly, it isn‘t clear whether the notion of epistemic value that Moss appeals to actually involves the type of value that would be acceptable and unproblematic to regard as epistemic.

Formal Epistemology, Misc Scoring Rules Formal Social Epistemology Epistemology of Disagreement Peer Dis…Read more
Formal Epistemology, Misc Scoring Rules Formal Social Epistemology Epistemology of Disagreement Peer Disagreement

Amir Konigsberg

Beyond Behavior: Why AI Evaluation Needs a Cognitive Revolution

Beyond Behavior: Why AI Evaluation Needs a Cognitive Revolution

Cognitive Sovereignty: The Authorship Problem in AI-Assisted Thought

Can AI Agents Agree to Disagree? Aumann's Theorem and the Epistemic Status of Machine Outputs

Aesthetic Educators, Aesthetic Experts, and Deferential Belief Formation
Journal of Aesthetic Education 50 (1): 34-45. 2016.

The Acquaintance Principle, Aesthetic Autonomy, and Aesthetic Appreciation
British Journal of Aesthetics 52 (2): 153-168. 2012.

The Problem with Uniform Solutions to Peer Disagreement
Theoria 79 (2): 96-126. 2013.

Epistemic Value and Epistemic Compromise, A Reply to Moss
Episteme 10 (1): 87-97. 2013.

Amir Konigsberg

Beyond Behavior: Why AI Evaluation Needs a Cognitive Revolution

Beyond Behavior: Why AI Evaluation Needs a Cognitive Revolution

Cognitive Sovereignty: The Authorship Problem in AI-Assisted Thought

Can AI Agents Agree to Disagree? Aumann's Theorem and the Epistemic Status of Machine Outputs

Aesthetic Educators, Aesthetic Experts, and Deferential Belief Formation Journal of Aesthetic Education 50 (1): 34-45. 2016.

The Acquaintance Principle, Aesthetic Autonomy, and Aesthetic Appreciation British Journal of Aesthetics 52 (2): 153-168. 2012.

The Problem with Uniform Solutions to Peer Disagreement Theoria 79 (2): 96-126. 2013.

Epistemic Value and Epistemic Compromise, A Reply to Moss Episteme 10 (1): 87-97. 2013.

Aesthetic Educators, Aesthetic Experts, and Deferential Belief Formation
Journal of Aesthetic Education 50 (1): 34-45. 2016.

The Acquaintance Principle, Aesthetic Autonomy, and Aesthetic Appreciation
British Journal of Aesthetics 52 (2): 153-168. 2012.

The Problem with Uniform Solutions to Peer Disagreement
Theoria 79 (2): 96-126. 2013.

Epistemic Value and Epistemic Compromise, A Reply to Moss
Episteme 10 (1): 87-97. 2013.