-
18Language Models’ Hall of Mirrors Problem: Why AI Alignment Requires Peircean SemiosisPhilosophy and Technology 39 (1): 9. 2026.This paper examines some limitations of large language models (LLMs) through the framework of Peircean semiotics. We argue that basic LLMs exist within a “hall of mirrors,” reflecting only the linguistic surface of training data without indexical grounding in a shared external world, and manipulating symbols without participation in socially-mediated epistemology. We then argue that newer developments, including extended context windows, persistent memory, and mediated interactions with reality,…Read more
-
1287Language Models’ Hall of Mirrors Problem: Why AI Alignment Requires Peircean Semiosis (2nd ed.)Philosophy and Technology. forthcoming.This paper examines some limitations of large language models (LLMs) through the framework of Peircean semiotics. We argue that basic LLMs exist within a "hall of mirrors," manipulating symbols without indexical grounding or participation in socially-mediated epistemology. We then argue that newer developments, including extended context windows, persistent memory, and mediated interactions with reality, are moving towards making newer Artificial Intelligence (AI) systems into genuine Peircean i…Read more
-
68The necessity of AI audit standards boardsAI and Society 40 (8): 6609-6624. 2025.Auditing of AI systems is a promising way to understand and manage ethical problems and societal risks associated with contemporary AI systems, as well as some anticipated future risks. Efforts to develop standards for auditing artificial intelligence (AI) systems have therefore understandably gained momentum. However, current approaches are not just insufficient, but can be actively harmful. Transparency alone does not address concerns about risk. Internal auditing is insufficient, and easily b…Read more
-
146Value Alignment for Advanced Artificial Judicial IntelligenceAmerican Philosophical Quarterly 60 (2): 187-203. 2023.This paper considers challenges resulting from the use of advanced artificial judicial intelligence (AAJI). We argue that these challenges should be considered through the lens of value alignment. Instead of discussing why specific goals and values, such as fairness and nondiscrimination, ought to be implemented, we consider the question of how AAJI can be aligned with goals and values more generally, in order to be reliably integrated into legal and judicial systems. This value alignment framin…Read more
-
8956How much value can our decisions create? We argue that unless our current understanding of physics is wrong in fairly fundamental ways, there exists an upper limit of value relevant to our decisions. First, due to the speed of light and the definition and conception of economic growth, the limit to economic growth is a restrictive one. Additionally, a related far larger but still finite limit exists for value in a much broader sense due to the physics of information and the ability of physical b…Read more
-
2701The Fragile World Hypothesis: Complexity, Fragility, and Systemic Existential RiskFutures. forthcoming.The possibility of social and technological collapse has been the focus of science fiction tropes for decades, but more recent focus has been on specific sources of existential and global catastrophic risk. Because these scenarios are simple to understand and envision, they receive more attention than risks due to complex interplay of failures, or risks that cannot be clearly specified. In this paper, we discuss the possibility that complexity of a certain type leads to fragility which can fun…Read more
David Manheim
Association for Long Term Existence and Resilience
Technion, Israel Institute of Technology
-
Association for Long Term Existence and ResilienceOther
-
Technion, Israel Institute of TechnologyDepartment of Humanities and ArtsVisiting Researcher (Part-time)
Pardee RAND
Alumnus, 2017
Areas of Specialization
| Effective Altruism |
| Decision Theory |
| Health Sciences |
| Game Theory |
| Topics in Decision Theory, Misc |