Imagining and building wise machines: The centrality of AI metacognition

Although AI has become increasingly smart, its wisdom has not kept pace. In this article, we examine what is known about human wisdom and sketch a vision of its AI counterpart. We analyze human wisdom as a set of strategies for solving intractable problems-those outside the scope of analytic techniques-including both object-level strategies like heuristics [for managing problems] and metacognitive strategies like intellectual humility, perspective-taking, or context-adaptability [for managing object-level strategies]. We argue that AI systems particularly struggle with metacognition; improved metacognition would lead to AI more robust to novel environments, explainable to users, cooperative with others, and safer in risking fewer misaligned goals with human users. We discuss how wise AI might be benchmarked, trained, and implemented.
View on arXiv@article{johnson2025_2411.02478, title={ Imagining and building wise machines: The centrality of AI metacognition }, author={ Samuel G. B. Johnson and Amir-Hossein Karimi and Yoshua Bengio and Nick Chater and Tobias Gerstenberg and Kate Larson and Sydney Levine and Melanie Mitchell and Iyad Rahwan and Bernhard Schölkopf and Igor Grossmann }, journal={arXiv preprint arXiv:2411.02478}, year={ 2025 } }