The Kantian Filter 2.1:
A Formal Deontic Framework for AI Decision Evaluation, Responsibility Attribution and Regulatory Compliance
As AI systems increasingly make high-stakes decisions, existing governance frameworks (including the EU AI Act) remain procedurally underdetermined: they classify risk and impose transparency requirements, but lack a structured, computable mechanism to evaluate whether an individual algorithmic decision rule is morally permissible.
This preprint introduces the Kant…
Read moreThe Kantian Filter 2.1:
A Formal Deontic Framework for AI Decision Evaluation, Responsibility Attribution and Regulatory Compliance
As AI systems increasingly make high-stakes decisions, existing governance frameworks (including the EU AI Act) remain procedurally underdetermined: they classify risk and impose transparency requirements, but lack a structured, computable mechanism to evaluate whether an individual algorithmic decision rule is morally permissible.
This preprint introduces the Kantian Filter 2.0 (KF) — a constraint-based audit system grounded in Kantian ethics and formalised through deontic logic. The framework operationalises three non-aggregative constraints:
• Universalisability (U) — dual test combining logical contradiction check and multi-agent simulation;
• Human Dignity (D) — via Transparency, Non-Manipulation and Fairness Bound, supported by the Counterfactual Disclosure Test (CDT) and its relational variant for non-individualist communities;
• Non-Delegable Responsibility (R)* — using Shapley-value attribution over the sociotechnical responsibility graph.
The KF produces a binary deontic verdict (Permission/Prohibition) alongside a severity score (KF_score) that distinguishes categorical prohibition from redesign requirements. It operates as accountability infrastructure complementary to existing regulation: governance sets what systems may learn; the Filter evaluates what happens inside those boundaries.
The paper includes a strengthened COMPAS case analysis (n=7,214) with sensitivity analysis across fairness metrics and ε values, normative robustness across Kantian, Rawlsian, prioritarian, Nozickian and relational ethics frameworks, and concrete integration workflows for EU AI Act conformity assessment .