-
393Bridging the Gap in Responsible AI DividesUnder Review. forthcoming.Tensions between AI Safety (AIS) and AI Ethics (AIE) have increasingly surfaced in AI governance and public debates about AI, leading to what we term the “responsible AI divides.” We introduce a model that categorizes four modes of engagement with the tensions: radical confrontation, disengagement, compartmentalized coexistence, and critical bridging. We then investigate how critical bridging, with a particular focus on bridging problems, offers one of the most viable constructive paths for adva…Read more
-
422Characterizing AI Agents for Alignment and GovernanceNature. forthcoming.The creation of effective governance mechanisms for AI agents requires a deeper understanding of their core properties and how these properties relate to questions surrounding the deployment and operation of agents in the world. This paper provides a characterization of AI agents that focuses on four dimensions: autonomy, efficacy, goal complexity, and generality. We propose different gradations for each dimension, and argue that each dimension raises unique questions about the design, operation…Read more
-
2Contemporary Debates in the Ethics of Artificial Intelligence (edited book)Wiley-Blackwell. 2026._A cutting-edge selection of current issues and explorations of the ethics of artificial intelligence_ As artificial intelligence continues to influence virtually every facet of modern life, _Contemporary Debates in the Ethics of Artificial Intelligence_ offers a timely and rigorous examination of the field's most pressing questions. Equally useful in the classroom or as a reference for interdisciplinary research, this volume fosters informed and critical engagement with the ethical dimensions o…Read more
-
70Beyond model interpretability: socio-structural explanations in machine learningAI and Society 40 (4): 2045-2053. 2025.What is it to interpret the outputs of an opaque machine learning model? One approach is to develop interpretable machine learning techniques. These techniques aim to show how machine learning models function by providing either model-centric local or global explanations, which can be based on mechanistic interpretations (revealing the inner working mechanisms of models) or non-mechanistic approximations (showing input feature–output data relationships). In this paper, we draw on social philosop…Read more
-
3249Two Types of AI Existential Risk: Decisive and AccumulativePhilosophical Studies 182 (7): 1975-2003. 2025.The conventional discourse on existential risks (x-risks) from AI typically focuses on abrupt, dire events caused by advanced AI systems, particularly those that might achieve or surpass human-level intelligence. These events have severe consequences that either lead to human extinction or irreversibly cripple human civilization to a point beyond recovery. This decisive view, however, often neglects the serious possibility of AI x-risk manifesting gradually through an incremental series of small…Read more
-
67Beyond model interpretability: socio-structural explanations in machine learningAI and Society 1-9. forthcoming.What is it to interpret the outputs of an opaque machine learning model? One approach is to develop interpretable machine learning techniques. These techniques aim to show how machine learning models function by providing either model-centric local or global explanations, which can be based on mechanistic interpretations (revealing the inner working mechanisms of models) or non-mechanistic approximations (showing input feature–output data relationships). In this paper, we draw on social philosop…Read more
-
1391Explanation Hacking: The perils of algorithmic recourseIn Juan Manuel Durán & Giorgia Pozzi (eds.), Philosophy of science for machine learning: Core issues and new perspectives, Springer. forthcoming.We argue that the trend toward providing users with feasible and actionable explanations of AI decisions—known as recourse explanations—comes with ethical downsides. Specifically, we argue that recourse explanations face several conceptual pitfalls and can lead to problematic explanation hacking, which undermines their ethical status. As an alternative, we advocate that explanations of AI decisions should aim at understanding.
-
1317Intelligent capacities in artificial systemsIn William A. Bauer & Anna Marmodoro (eds.), Artificial Dispositions: Investigating Ethical and Metaphysical Issues, Bloomsbury Academic. 2023.This paper investigates the nature of dispositional properties in the context of artificial intelligence systems. We start by examining the distinctive features of natural dispositions according to criteria introduced by McGeer (2018) for distinguishing between object-centered dispositions (i.e., properties like ‘fragility’) and agent-based abilities, including both ‘habits’ and ‘skills’ (a.k.a. ‘intelligent capacities’, Ryle 1949). We then explore to what extent the distinction applies to artif…Read more
-
928Algorithmic Fairness and Structural Injustice: Insights from Feminist Political PhilosophyAies '22: Proceedings of the 2022 Aaai/Acm Conference on Ai, Ethics, and Society. 2022.Data-driven predictive algorithms are widely used to automate and guide high-stake decision making such as bail and parole recommendation, medical resource distribution, and mortgage allocation. Nevertheless, harmful outcomes biased against vulnerable groups have been reported. The growing research field known as 'algorithmic fairness' aims to mitigate these harmful biases. Its primary methodology consists in proposing mathematical metrics to address the social harms resulting from an algorithm'…Read more
-
2401In Conversation with Artificial Intelligence: Aligning language Models with Human ValuesPhilosophy and Technology 36 (2): 1-24. 2023.Large-scale language technologies are increasingly used in various forms of communication with humans across different contexts. One particular use case for these technologies is conversational agents, which output natural language text in response to prompts and queries. This mode of engagement raises a number of social and ethical questions. For example, what does it mean to align conversational agents with human norms or values? Which norms or values should they be aligned with? And how can t…Read more
-
880The Use and Misuse of Counterfactuals in Ethical Machine LearningIn Atoosa Kasirzadeh & Andrew Smart (eds.), ACM Conference on Fairness, Accountability, and Transparency (FAccT 21), . 2021.The use of counterfactuals for considerations of algorithmic fairness and explainability is gaining prominence within the machine learning community and industry. This paper argues for more caution with the use of counterfactuals when the facts to be considered are social categories such as race or gender. We review a broad body of papers from philosophy and social sciences on social ontology and the semantics of counterfactuals, and we conclude that the counterfactual approach in machine learni…Read more
-
33Boyer-Kassem et al.'s Scientific Collaboration and Collective Knowledge (review)BJPS Review of Books. 2018.
-
1460Counter Countermathematical ExplanationsErkenntnis 88 (6): 2537-2560. 2021.Recently, there have been several attempts to generalize the counterfactual theory of causal explanations to mathematical explanations. The central idea of these attempts is to use conditionals whose antecedents express a mathematical impossibility. Such countermathematical conditionals are plugged into the explanatory scheme of the counterfactual theory and—so is the hope—capture mathematical explanations. Here, I dash the hope that countermathematical explanations simply parallel counterfactua…Read more
-
1704The Ethical Gravity Thesis: Marrian Levels and the Persistence of Bias in Automated Decision-making SystemsProceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society (AIES '21). 2021.Computers are used to make decisions in an increasing number of domains. There is widespread agreement that some of these uses are ethically problematic. Far less clear is where ethical problems arise, and what might be done about them. This paper expands and defends the Ethical Gravity Thesis: ethical problems that arise at higher levels of analysis of an automated decision-making system are inherited by lower levels of analysis. Particular instantiations of systems can add new problems, but no…Read more
-
1019Algorithmic and human decision making: for a double standard of transparencyAI and Society 37 (1): 375-381. 2022.Should decision-making algorithms be held to higher standards of transparency than human beings? The way we answer this question directly impacts what we demand from explainable algorithms, how we govern them via regulatory proposals, and how explainable algorithms may help resolve the social problems associated with decision making supported by artificial intelligence. Some argue that algorithms and humans should be held to the same standards of transparency and that a double standard of transp…Read more
-
1319A New Role for Mathematics in Empirical SciencesPhilosophy of Science 88 (4): 686-706. 2021.Mathematics is often taken to play one of two roles in the empirical sciences: either it represents empirical phenomena or it explains these phenomena by imposing constraints on them. This article identifies a third and distinct role that has not been fully appreciated in the literature on applicability of mathematics and may be pervasive in scientific practice. I call this the “bridging” role of mathematics, according to which mathematics acts as a connecting scheme in our explanatory reasoning…Read more
-
124Otavio Bueno and Steven French. Applying Mathematics: Immersion, Inference, InterpretationPhilosophy of Science 87 (1): 207-211. 2020.
University of Toronto, St. George Campus
PhD, 2021
Pittsburgh, Pennsylvania, United States of America