104

Blind Inverse Game Theory: Jointly Decoding Rewards and Rationality in Entropy-Regularized Competitive Games

Main:8 Pages
7 Figures
Bibliography:2 Pages
2 Tables
Appendix:13 Pages
Abstract

Inverse Game Theory (IGT) methods based on the entropy-regularized Quantal Response Equilibrium (QRE) offer a tractable approach for competitive settings, but critically assume the agents' rationality parameter (temperature τ\tau) is known a priori. When τ\tau is unknown, a fundamental scale ambiguity emerges that couples τ\tau with the reward parameters (θ\theta), making them statistically unidentifiable. We introduce Blind-IGT, the first statistical framework to jointly recover both θ\theta and τ\tau from observed behavior. We analyze this bilinear inverse problem and establish necessary and sufficient conditions for unique identification by introducing a normalization constraint that resolves the scale ambiguity. We propose an efficient Normalized Least Squares (NLS) estimator and prove it achieves the optimal O(N1/2)\mathcal{O}(N^{-1/2}) convergence rate for joint parameter recovery. When strong identifiability conditions fail, we provide partial identification guarantees through confidence set construction. We extend our framework to Markov games and demonstrate optimal convergence rates with strong empirical performance even when transition dynamics are unknown.

View on arXiv
Comments on this paper