•  17
    Explainable and Human-Grounded AI for Decision Support Systems: The Theory of Epistemic Quasi-Partnerships
    with Maximilian Moll
    In Vincent C. Müller, Leonard Dung, Guido Löhr & Aliya Rumana (eds.), Philosophy of Artificial Intelligence: The State of the Art, Springernature. pp. 83-103. 2026.
    In the context of AI decision support systems (AI-DSS), we argue that meeting the demands of ethical and explainable AI (XAI) is about developing AI-DSS to provide human decision-makers with three types of human-grounded explanations: reasons, counterfactuals, and confidence, an approach we refer to as the RCC approach. We begin by reviewing current empirical XAI literature that investigates the relationship between various methods for generating model explanations (e.g., LIME, SHAP, Anchors), t…Read more
  •  21
    “Know thyself.” Once carved into the stone at Delphi and popularized by Socrates as the guiding maxim of philosophy, this command captures the enduring human struggle to examine and justify our own beliefs. This book asks how such a capacity for self-knowledge could have arisen—our ability to reflect on and regulate what we believe in light of epistemic norms like justification and truth. To answer that question, it weaves together insights from comparative and developmental psychology, cognitiv…Read more
  •  20
    This study explores whether labeling AI as either “trustworthy” or “reliable” influences user perceptions and acceptance of automotive AI technologies. Utilizing a one-way between-subjects design, the research presented online participants (N = 478) with a text presenting guidelines for either trustworthy or reliable AI, before asking them to evaluate 3 vignette scenarios and fill in a modified version of the Technology Acceptance Model which covers different variables, such as perceived ease of…Read more
  •  201
    This paper presents a systematic review of explainable reinforcement learning methodologies with an emphasis on human-centered evaluation frameworks. Drawing from literature between 2017 and 2025, we apply and extend the Reasons, Confidence, and Counterfactuals (RCC) framework—originally designed for supervised learning—to reinforcement learning contexts. Our analysis reveals two predominant explanatory strategies: constructive, where explicit explanations are generated, and supportive, where us…Read more
  •  887
    How did humans come to know themselves—not just to have thoughts, but to reflect on those thoughts and ask whether they’re true or justified? And as our world slowly integrates a new kind of intelligence—non-human, non-animal, and increasingly entangled with our thinking—what might this mean for the future of self-knowledge? What is the relationship between artificial intelligence and our capacity to justify what we believe? Are AI-curated information environments enhancing this capacity—or hija…Read more
  •  678
    In this Comment, we critique the growing “AI welfare” movement and propose a novel guideline, the Precarity Guideline, to determine care entitlement. In contrast to approaches that emphasize potential for suffering, the Precarity Guideline is grounded in empirically identifiable features. The severity of ongoing humanitarian crises, biodiversity loss, and climate change provides additional reasons to prioritize the needs of living beings over machine learning algorithms as candidates for care.
  •  687
    This chapter examines possible ramifications of mindshaping a social robot. It explores how such an agent might learn to represent psychological states, align its behavior with evolving societal norms, and develop capacities for self-directed mindreading and normative self-knowledge. Integrating perspectives from cultural evolution and naturalized intentionality, this approach suggests that social robots could achieve a level of norm-based self-regulation typically reserved for humans, fulfillin…Read more
  •  71
    Epistemic and Moral Agency in a Crisis of Trust?
    International Journal of Philosophical Studies 32 (4): 484-493. 2024.
    1. In The Tinkering Mind, Tillmann Vierkant embarks on a profound exploration of epistemic and moral agency, delving into the nuanced interplay between intentional control, evaluative processes, an...
  •  664
    The first premise of Schellenberg’s particularity argument reads, “If a subject S perceives a particular α, then S discriminates and singles out α” (2018: 25). But this is false if seeing a ganzfeld is possible (i.e., a homogeneous field without any particulars to discriminate). In response, Schellenberg argues that seeing a ganzfeld is impossible by appealing to the ganzfeld effect (viz. hallucinatory experiences caused by ganzfeld exposure) exclusively as a ‘sense of blindness’. I present two …Read more
  •  1059
    Many policies and ethical guidelines recommend developing “trustworthy AI”. We argue that developing morally trustworthy AI is not only unethical, as it promotes trust in an entity that cannot be trustworthy, but it is also unnecessary for optimal calibration. Instead, we show that reliability, exclusive of moral trust, entails the appropriate normative constraints that enable optimal calibration and mitigate the vulnerability that arises in high-stakes hybrid decision-making environments, witho…Read more
  •  1347
    Are noetic feelings embodied? The case for embodied metacognition
    Philosophical Psychology 1 (2): 1-23. 2023.
    One routinely undergoes a noetic feeling (also called “metacognitive feeling” or “epistemic feeling”), the so-called “feeling of knowing”, whenever trying to recall a person’s name. One feels the name is known despite being unable to recall it. Other experiences also fall under this category, e.g., the tip-of-the-tongue experience, the feeling of confidence. A distinguishing characteristic of noetic feelings is how they are crucially related to the facts we know, so much so that the activation o…Read more
  •  897
    Hijacking Epistemic Agency - How Emerging Technologies Threaten our Wellbeing as Knowers
    Proceedings of the 2022 Aaai/Acm Conference on Ai, Ethics, and Society 1. 2022.
    The aim of this project to expose the reasons behind the pandemic of misinformation (henceforth, PofM) by examining the enabling conditions of epistemic agency and the emerging technologies that threaten it. I plan to research the emotional origin of epistemic agency, i.e. on the origin of our capacity to acquire justification for belief, as well as on the significance this emotional origin has for our lives as epistemic agents in our so-called Misinformation Age. This project has three objectiv…Read more
  •  84
    Embodied metacognition: how we feel our hearts to know our minds
    Dissertation, University of Edinburgh. 2022.
    The aim of the present work is to make a plausible case for the phylogenetic origin of self-knowledge, one which is compatible with a prevalent view about its ontogenetic origin, the social-scaffolding view. Essentially, the phylogenetic origin is generally argued to be evaluative metacognition, i.e. a system of cognitive control mechanisms, while the ontogenetic origin is generally argued to be mindreading, i.e. cognitive capacities supporting mental state attribution. So put simply, the presen…Read more
  •  132
    On the basis of Descartes’s account of the passions of the soul, we argue that current interoception-based theories of emotions cannot account for the hallmark of a passion of the soul, i.e., that its effects are felt as being in the soul itself. We also pay attention to the epistemic functions of the passions and to Descartes’s category of emotions that are caused and occur in the soul alone. Certain passions of the soul and certain internal emotions are similar to what are today called ‘episte…Read more
  •  1017
    Upon first hearing sinewaves, all that can be discerned are beeps and whistles. But after hearing the original speech, the beeps and whistles sound like speech. The difference between these two episodes undoubtedly involves an alteration in phenomenal character. O’Callaghan (2011) argues that this alteration is non-sensory, but he leaves open the possibility of attributing it to some other source, e.g. cognition. I discuss whether the alteration in phenomenal character involved in sinewave speec…Read more
  •  991
    Irreducible Cognitive Phenomenology and the AHA! Experience
    Phenomenology and Mind 10 108-121. 2016.
    Elijah Chudnoff’s case for irreducible cognitive phenomenology hinges on seeming to see the truth of a mathematical proposition (Chudnoff 2015). In the following, I develop an augmented version of Chudnoff’s case, not based on seeming to see, or intuition, but based on being in a state with presentational phenomenology of high-level content. In contrast to other cases for cognitive phenomenology, those based on Strawson’s case (Strawson 2011), I argue that the case presented here is able to with…Read more