•  159
    The ubiquitous integration of AI-powered systems in morally consequential decision-making procedures raises a thorny question: when such systems generate harm, who should be held responsible? A prominent regulatory response proposes that suitably designed control architectures can ensure that blame is appropriately allocated. To live up to this promise, the proposed architectures must be both normatively adequate and regarded as such, since otherwise they are at best practically useless, and at …Read more
  •  194
    Philosophers disagree about what makes news “fake”: some believe it is falsity, whereas others emphasize intentional deception about the content or the source. We report two preregistered studies (total N=1200) testing whether falsity and source deception predict fake-news classification. Participants evaluated scenarios in which a claim was either true or false, and the article appeared on either the official New York Times website or that of a near-identical impersonator. In both studies, fals…Read more
  •  252
    Debiasing Legal Judgement: Outcome Effects in Open-source LLMs
    Journal of Institutional and Theoretical Economics 182 (1). 2026.
    Legal judgment is threatened by a plethora of biases. Among those, the effect of outcome bias – the tendency to let outcome severity distort assessments of mental states and culpability – is particularly well documented and its impact is hypothesized to be pervasive. AI advisory systems could in principle help alleviate human bias, but Almeida et al. (2024) found substantial outcome effects in several commercial LLMs, suggesting AI judgment may be comparably biased. We reassess these findings us…Read more
  •  425
    Two topics at the center of Ethics of AI and HRI regard trust in AI agents as well as the adjudication of moral responsibility in situations where AI causes harm. In this paper we aim to advance the state of the art concerning these topics in several regards: First, we propose and evaluate a new empirical paradigm for measuring appropriate or calibrated trust in AI, that is, attitudes which are neither too trusting nor too cautious. The best way to measure calibrated trust, we argue, is by contr…Read more
  •  50
    Partial Aggregation: What the People Think
    Australasian Journal of Philosophy 104 (1): 260-281. 2026.
    This article reports three empirical studies regarding partially aggregative moral theories in distributive ethics (total N = 417). We begin by documenting the widespread occurrence of the intuitions that motivate partial aggregation views. Thereafter, we advance the literature along two dimensions: First, we extend experimental work by ascertaining which amongst existing versions of partial aggregation (localised vs. global) chimes more fully with moral common sense. Specifically, we document h…Read more
  •  1516
    The Hard Problem of AI Alignment: Value Forks in Moral Judgment
    Proceedings of the 2025 Acm Conference on Fairness, Accountability, and Transparency. 2025.
    Complex moral trade-offs are a basic feature of human life: for example, confronted with scarce medical resources, doctors must frequently choose who amongst equally deserving candidates receives medical treatment. But choosing what to do in moral trade-offs is no longer a ‘humans-only’ task, but often falls to AI agents. In this article, we report findings from a series of experiments (N=1029) intended to establish whether agent-type (Human vs. AI) matters for what should be done in moral trade…Read more
  •  44
    The folk concept of art
    Synthese 205 (1): 1-27. 2024.
    What is the folk concept of art? Does it track any of the major definitions of art philosophers have proposed? In two preregistered experiments (N = 888) focusing on two types of artworks (paintings and musical works), we manipulate three potential features of artworks: intentional creation, the possession of aesthetic value, and institutional recognition. This allows us to investigate whether the folk concept of art fits an essentialist definition drawing on one or more of the manipulated facto…Read more
  •  62
    Responsibility Gaps and Retributive Dispositions: Evidence from the US, Japan and Germany
    with Markus Christen
    Science and Engineering Ethics 30 (6): 1-19. 2024.
    Danaher (2016) has argued that increasing robotization can lead to retribution gaps: Situations in which the normative fact that nobody can be justly held responsible for a harmful outcome stands in conflict with our retributivist moral dispositions. In this paper, we report a cross-cultural empirical study based on Sparrow’s (2007) famous example of an autonomous weapon system committing a war crime, which was conducted with participants from the US, Japan and Germany. We find that (1) people m…Read more
  •  102
    The folk concept of the good life: neither happiness nor well-being
    Philosophical Studies 181 (10): 2525-2538. 2024.
    The concept of a good life is usually assumed by philosophers to be equivalent to that of well-being, or perhaps of a morally good life, and hence has received little attention as a potentially distinct subject matter. In a series of experiments participants were presented with vignettes involving socially sanctioned wrongdoing toward outgroup members. Findings indicated that, for a large majority, judgments of bad character strongly reduce ascriptions of the good life, while having no impact at…Read more
  •  38
    In two experiments (total N=693) we explored whether people are willing to consider paintings made by AI-driven robots as art, and robots as artists. Across the two experiments, we manipulated three factors: (i) agent type (AI-driven robot v. human agent), (ii) behavior type (intentional creation of a painting v. accidental creation), and (iii) object type (abstract v. representational painting). We found that people judge robot paintings and human paintings as art to roughly the same extent. Ho…Read more
  •  80
    Responding to recent concerns about the reliability of the published literature in psychology and other disciplines, we formed the X-Phi Replicability Project (XRP) to estimate the reproducibility of experimental philosophy (osf.io/dvkpr). Drawing on a representative sample of 40 x-phi studies published between 2003 and 2015, we enlisted 20 research teams across 8 countries to conduct a high-quality replication of each study in order to compare the results to the original published findings. We …Read more
  •  43
    According to Anscombe, acting intentionally entails knowledge in action. This thesis has been near-universally rejected due to a well-known counterexample by Davidson: a man intending to make ten legible carbon copies might not believe with confidence, and hence not know, that he will succeed. If he does, however, his action surely counts as intentional. Damaging as it seems, an even more powerful objection can be levelled against Anscombe: while acting, there is as yet no fact of the matter as …Read more
  •  144
    Despite a voluminous literature on happiness and well‐being, debates have been stunted by persistent dissensus on what exactly the subject matter is. Commentators frequently appeal to intuitions about the nature of happiness or well‐being, raising the question of how representative those intuitions are. In a series of studies, we examined lay intuitions involving happiness‐ and well‐being‐related terms to assess their sensitivity to internal (psychological) versus external conditions. We found t…Read more
  •  1164
    This chapter reviews empirical research on the rules governing assertion and retraction, with a focus on the normative role of truth. It examines whether truth is required for an assertion to be considered permissible, and whether there is an expectation that speakers retract statements that turn out to be false. Contrary to factive norms (such as the influential “knowledge norm”), empirical data suggests that there is no expectation that speakers only make true assertions. Additionally, contrar…Read more
  •  985
    Causation, Norms, and Cognitive Bias
    Cognition 259 (C): 106105. 2025.
    Extant research has shown that ordinary causal judgments are sensitive to normative factors. For instance, agents who violate a norm are standardly deemed more causal than norm-conforming agents in identical situations. In this paper, we explore two competing explanations for the Norm Effect: the Responsibility View and the Bias View. According to the former, the Norm Effect arises because ordinary causal judgment is intimately intertwined with moral responsibility. According to the alternative …Read more
  •  2403
    What is Art? The Role of Intention, Beauty, and Institutional Recognition
    Proceedings of the Annual Meeting of the Cognitive Science Society 45 3039-3047. 2023.
    In two experiments (N=888), we explore to what extent the folk concept of art is compatible with the leading philosophical definitions of art, and whether it is an essentialist or a non-essentialist concept. We manipulate three factors: whether an object is created intentionally, whether it has aesthetic value, and whether it is institutionally recognized. In addition, we also manipulate the artistic domain (visual art or music). The results suggest that none of the three properties is seen by t…Read more
  •  125
    Coordination and expertise foster legal textualism
    with Ivar R. Hannikainen, Kevin P. Tobia, Guilherme da F. C. F. de Almeida, N. Struchiner, P. Bystranowski, V. Dranseika, N. Strohmaier, S. Bensinger, K. Dolinina, B. Janik, Egle Lauraityte, M. Laakasuo, A. Liefgreen, I. Neiders, M. Prochnicki, A. Rosas, J. Sundvall, and Tomasz Zuradzki
    Proceedings of the National Academy of Sciences of the United States of America 119 (44). 2022.
    A cross-cultural survey experiment revealed a dominant tendency to rely on a rule’s letter over its spirit when deciding which behaviors violate the rule. This tendency varied markedly across (k = 15) countries, owing to variation in the impact of moral appraisals on judgments of rule violation. Compared with laypeople, legal experts were more inclined to disregard their moral evaluations of the acts altogether and consequently exhibited stronger textualist tendencies. Finally, we evaluated a pl…Read more
  •  81
    In an important and widely discussed series of studies, Jonathan Phillips and colleagues have suggested that the ordinary concept of happiness has a substantial moral component. For in- stance, two persons who enjoy the same extent of positive emotions and are equally satisfied with their lives are judged as happy to different degrees if one is less moral than the other. Considering that the relation between morality and happiness or self-interest has been one of the central questions of moral p…Read more
  •  74
    Coordination Favors Legal Textualism by Suppressing Moral Valuation
    with Ivar R. Hannikainen, Kevin P. Tobia, Guilherme da F. C. F. Almeida, Noel Struchiner, Piotr Bystranowski, Vilius Dranseika, Niek Strohmaier, Samantha Bensinger, Kristina Dolinina, Bartosz Janik, Egle Lauraityte, Michael Laakasuo, Alice Liefgreen, Ivars Neiders, Maciej Próchnicki, Alejandro Rosas Martinez, Jukka Sundvall, and Tomasz Żuradzki
  •  59
    Human control redressed: comparing AI and human predictability in a real-effort task
    with Serhiy Https://Orcidorg Kandul, Vincent Micheli, Juliane Beck, Thomas Burri, François Https://Orcidorg Fleuret, and Markus Christen
    . 2023.
    Predictability is a prerequisite for effective human control of artificial intelligence (AI). The inability to predict malfunctioning of AI, for example, impedes timely human intervention. In this paper, we empirically investigate how AI’s predictability compares to the predictability of humans in a real-effort task. We show that humans are worse at predicting AI performance than at predicting human performance. Importantly, participants are not aware of the differences in relative predictabilit…Read more
  •  45
    The reasonable person standard is of great importance to US criminal and tort law. According to the law, whether or not an agent acted reasonably does not depend on features of the outcome which are not under her control. Mock juror attributions of reasonableness, however, are shown to be outcome-dependent. A series of experiments reveals that this outcome-dependence does not constitute a bias, since the very folk concept of reasonableness is outcome-sensitive. Consequently, the law makes a mist…Read more
  •  1391
    While philosophers hold that it is patently absurd to blame robots or hold them morally responsible [1], a series of recent empirical studies suggest that people do ascribe blame to AI systems and robots in certain contexts [2]. This is disconcerting: Blame might be shifted from the owners, users or designers of AI systems to the systems themselves, leading to the diminished accountability of the responsible human agents [3]. In this paper, we explore one of the potential underlying reasons for …Read more
  •  1632
    The reasonable person standard is key to both Criminal Law and Torts. What does and does not count as reasonable behavior and decision-making is frequently deter- mined by lay jurors. Hence, laypeople’s understanding of the term must be considered, especially whether they use it predominately in an evaluative fashion. In this corpus study based on supervised machine learning models, we investigate whether laypeople use the expression ‘reasonable’ mainly as a descriptive, an evaluative, or merely…Read more
  •  918
    Causation, Foreseeability, and Norms
    Proceedings of the Annual Meeting of the Cognitive Science Society 45. 2023.
    A growing body of literature has revealed ordinary causal judgement to be sensitive to normative factors, such that a norm-violating agent is regarded more causal than their non-norm-violating counterpart. In this paper, we explore two competing explanations for this phenomenon: the Responsibility View and the Bias View. The Bias View, but not the Responsibility View, predicts features peripheral to the agent’s responsibility to impact causal attributions. In a series of three preregistered expe…Read more
  •  1180
    Partial Aggregation: What the People Think
    Australasian Journal of Philosophy 1. 2025.
    This article reports three empirical studies regarding partially aggregative moral theories in distributive ethics (total N = 417). We begin by documenting the widespread occurrence of the intuitions that motivate partial aggregation views. Thereafter, we advance the literature along two dimensions: First, we extend experimental work by ascertaining which amongst existing versions of partial aggregation (localised vs. global) chimes more fully with moral common sense. Specifically, we document h…Read more
  •  1596
    The concept of a good life is usually assumed by philosophers to be equivalent to that of well-being, or perhaps of a morally good life, and hence has received little attention as a potentially distinct subject matter. In a series of experiments participants were presented with vignettes involving socially sanctioned wrongdoing toward outgroup members. Findings indicated that, for a large majority, judgments of bad character strongly reduce ascriptions of the good life, while having no impact at…Read more
  •  137
    Outcome effects, moral luck and the hindsight bias
    Cognition 232 (C): 105258. 2022.
    In a series of ten preregistered experiments (N=2043), we investigate the effect of outcome valence on judgments of probability, negligence, and culpability – a phenomenon sometimes labelled moral (and legal) luck. We found that harmful outcomes, when contrasted with neutral outcomes, lead to increased perceived probability of harm ex post, and consequently to increased attribution of negligence and culpability. Rather than simply postulating a hindsight bias (as is common), we employ a variety …Read more