Alex Bogdan (Military-Space Academy A.F. Mozhaisky (Alumnus)): Publications

87

Relationally Calibrated Guard Rails: Machine Psychometrics as a Response to the Guard Rails Paradox

We reframe the central challenge of large language model safety guardrails as a psychometric calibration problem rather than a binary policy dispute. Jamhour’s Guard Rails and Distributed Relational Cognition [1] names a genuine tension: safety measures designed to suppress manipulation, dependence, sycophancy, and unsafe advice may simultaneously erode the relational conditions that make sustained human–AI cognitive partnership possible— temporal depth, epistemic vulnerability, meta-cognitive r…Read more
We reframe the central challenge of large language model safety guardrails as a psychometric calibration problem rather than a binary policy dispute. Jamhour’s Guard Rails and Distributed Relational Cognition [1] names a genuine tension: safety measures designed to suppress manipulation, dependence, sycophancy, and unsafe advice may simultaneously erode the relational conditions that make sustained human–AI cognitive partnership possible— temporal depth, epistemic vulnerability, meta-cognitive reflection, calibrated disagreement, and long-horizon collaboration. The diagnosis is persuasive but leaves an unresolved design question: how can a safety system distinguish harmful entanglement from productive cognitive coupling without retreating either to permissive romanticism or blunt paternalism? We argue that Machine Psychometrics [3] supplies the missing measurement layer. We develop three contributions. First, we reinterpret guardrails through Signal Detection Theory: under-restriction is a miss, over-restriction a false alarm, and the central design challenge is to increase discriminability rather than shift the criterion toward refusal. Second, we extend Machine Psychometrics from agent-level Mindprints to a relational measurement layer—the Coupling Profile—that characterizes temporal depth, epistemic openness, calibrated disagreement, meta-cognitive co-construction, conceptual yield, boundary clarity, and dependency pressure in human–AI partnerships. Third, we propose a Guardrail Calibration Protocol that uses adaptive probe batteries, perturbation testing, psychometric validation, longitudinal drift monitoring, and context-bounded validity envelopes to design safety regimes that preserve valuable coupling while limiting genuine risk. The framework accepts the reality of sycophancy, manipulation, over-attachment, and unsafe advice but rejects the assumption that these risks can only be addressed by flattening the relational field. It also avoids taking a premature stance on artificial consciousness: productive human–AI coupling deserves measurement and protection as a behavioral and epistemic phenomenon whether or not future artificial systems ever qualify as conscious subjects. The aim is not to weaken guardrails but to replace blunt guardrails with measured boundaries.

Artificial Minds, Miscellaneous Philosophy of AI, General Works Ethics of Artificial Intelligence, Mis…Read more
Artificial Minds, Miscellaneous Philosophy of AI, General Works Ethics of Artificial Intelligence, Miscellaneous Large Language Models Artificial Intelligence Safety
346

Respectful Skepticism About Strong Impossibility Claims in The Abstraction Fallacy

Alexander Lerchner's The Abstraction Fallacy offers one of the clearest recent arguments against the claim that advanced artificial systems could ever instantiate consciousness. The paper is intelligent, provocative, and genuinely valuable. Its central service is to challenge naive forms of computational triumphalism and to warn against the increasingly common slide from behavioral sophistication to ontological attribution. On that point, it deserves careful attention. Where it becomes less pers…Read more
Alexander Lerchner's The Abstraction Fallacy offers one of the clearest recent arguments against the claim that advanced artificial systems could ever instantiate consciousness. The paper is intelligent, provocative, and genuinely valuable. Its central service is to challenge naive forms of computational triumphalism and to warn against the increasingly common slide from behavioral sophistication to ontological attribution. On that point, it deserves careful attention. Where it becomes less persuasive is in the transition from critique to closure. It moves from exposing weaknesses in simplified versions of computational functionalism to a much stronger conclusion, namely that algorithmic symbol manipulation is structurally incapable of instantiating consciousness. This article offers a respectful but firm reply. It argues that Lerchner's case depends on a contested conception of computation, treats concept formation as if it must already presuppose phenomenal consciousness, and too quickly converts underdetermination in interpretation into an impossibility theorem for machine sentience. It also moves too easily from limitations of current digital architectures to sweeping claims about all possible future artificial realizations of mind. From the standpoint of long experience in artificial intelligence, especially evolutionary computing, this is the central concern. Search processes often discover workable realizations that prior theory failed to anticipate. For that reason, universal negative claims about future forms of artificial organization should be made with unusual restraint. The more prudent conclusion is not that present AI systems are conscious. It is also not that artificial systems can only simulate and never instantiate consciousness. The stronger claim has not been established. A wiser position is disciplined uncertainty. Present systems do not provide compelling evidence of consciousness. Behavioral sophistication alone is not enough. The ontology of computation remains contested. But none of this yet licenses a final verdict on the outer limits of artificial mind.

Alex Bogdan

Relationally Calibrated Guard Rails: Machine Psychometrics as a Response to the Guard Rails Paradox

Respectful Skepticism About Strong Impossibility Claims in The Abstraction Fallacy