Robust AI Security and Alignment: A Sisyphean Endeavor?

10 December 2025

Apostol Vassilev

AAML

ArXiv (abs)PDF HTML

Main:13 Pages

2 Figures

Bibliography:2 Pages

Appendix:1 Pages

Abstract

This manuscript establishes information-theoretic limitations for robustness of AI security and alignment by extending Gödel's incompleteness theorem to AI. Knowing these limitations and preparing for the challenges they bring is critically important for the responsible adoption of the AI technology. Practical approaches to dealing with these challenges are provided as well. Broader implications for cognitive reasoning limitations of AI systems are also proven.

View on arXiv

Comments on this paper