81

Robust AI Security and Alignment: A Sisyphean Endeavor?

Apostol Vassilev
Main:13 Pages
2 Figures
Bibliography:2 Pages
Appendix:1 Pages
Abstract

This manuscript establishes information-theoretic limitations for robustness of AI security and alignment by extending Gödel's incompleteness theorem to AI. Knowing these limitations and preparing for the challenges they bring is critically important for the responsible adoption of the AI technology. Practical approaches to dealing with these challenges are provided as well. Broader implications for cognitive reasoning limitations of AI systems are also proven.

View on arXiv
Comments on this paper