University of Texas at Austin
Computer Science
PhD, 2016
CV
Los Angeles, California, United States of America
Areas of Specialization
Philosophy of Artificial Intelligence
  •  201
    Despite significant progress, the AI alignment literature has largely overlooked a philosophical dilemma: \textit{humans and machines represent and communicate uncertainty in fundamentally incompatible ways.} To formalize the arguments around this dilemma, we introduce a four-quadrant taxonomy of uncertainty that partitions uncertainty according to two dimensions: whether uncertainty arises from a human or machine agent, and whether it remains internal or is communicated externally. The taxonomy…Read more
  •  173
    How do abstract singular terms like "the Pythagoras Theorem" refer? While theories of reference have been extensively developed for concrete singular terms (proper names, definite descriptions), the question of reference to abstract objects remains under-explored. This essay argues that abstract singular terms do refer, obeying similar principles as concrete singular terms while presenting distinct philosophical challenges. We examine three prominent theories of reference—Millianism, the Frege-R…Read more
  •  236
    In "Is Power-Seeking AI an Existential Risk?" Joe Carlsmith provides an intriguing analysis of the potential pathways from advanced artificial intelligence to existential catastrophe. This essay contends that while the report's framework offers a fundamental contribution to AI safety, its application of longtermist principles is incomplete. By focusing almost exclusively on catastrophic failure modes, it overlooks the symmetrical potential for "uber-beneficence" i.e., outcomes of profound and du…Read more