ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.07310
  4. Cited By
Learning in games with continuous action sets and unknown payoff
  functions
v1v2 (latest)

Learning in games with continuous action sets and unknown payoff functions

Mathematical programming (Math. Program.), 2016
25 August 2016
P. Mertikopoulos
Zhengyuan Zhou
ArXiv (abs)PDFHTML

Papers citing "Learning in games with continuous action sets and unknown payoff functions"

50 / 113 papers shown
A Modular Algorithm for Non-Stationary Online Convex-Concave Optimization
A Modular Algorithm for Non-Stationary Online Convex-Concave Optimization
Qing-xin Meng
Xia Lei
Jian-wei Liu
104
1
0
09 Sep 2025
Finding a Multiple Follower Stackelberg Equilibrium: A Fully First-Order Method
Finding a Multiple Follower Stackelberg Equilibrium: A Fully First-Order Method
April Niu
Kai Wang
Juba Ziani
207
0
0
09 Sep 2025
The impact of uncertainty on regularized learning in games
The impact of uncertainty on regularized learning in games
Pierre-Louis Cauvin
Davide Legacci
P. Mertikopoulos
180
3
0
16 Jun 2025
Hamiltonian of polymatrix zero-sum games
Hamiltonian of polymatrix zero-sum games
Toshihiro Ota
Yuma Fujimoto
AI4CE
497
0
0
19 May 2025
Algorithmic Pricing and Algorithmic Collusion
Algorithmic Pricing and Algorithmic Collusion
Martin Bichler
Julius Durmann
Matthias Oberlechner
283
2
0
23 Apr 2025
Characterizing the Convergence of Game Dynamics via Potentialness
Characterizing the Convergence of Game Dynamics via Potentialness
Martin Bichler
Davide Legacci
P. Mertikopoulos
Matthias Oberlechner
Bary S. R. Pradelski
271
0
0
20 Mar 2025
Convergence and Connectivity: Dynamics of Multi-Agent Q-Learning in Random Networks
Convergence and Connectivity: Dynamics of Multi-Agent Q-Learning in Random Networks
A. Hussain
D. Leonte
Francesco Belardinelli
Raphael Huser
Dario Paccagnan
227
1
0
13 Mar 2025
Expected Variational Inequalities
Expected Variational Inequalities
B. Zhang
Ioannis Anagnostides
Emanuel Tewolde
Ratip Emin Berker
Gabriele Farina
Vincent Conitzer
Tuomas Sandholm
992
5
0
25 Feb 2025
Networked Digital Public Goods Games with Heterogeneous Players and Convex Costs
Networked Digital Public Goods Games with Heterogeneous Players and Convex CostsThe Web Conference (WWW), 2025
Yukun Cheng
Xiaotie Deng
Yunxuan Ma
150
1
0
03 Feb 2025
Solving Infinite-Player Games with Player-to-Strategy Networks
Solving Infinite-Player Games with Player-to-Strategy Networks
Carlos Martin
Tuomas Sandholm
220
0
0
17 Jan 2025
No-regret learning in harmonic games: Extrapolation in the face of conflicting interests
No-regret learning in harmonic games: Extrapolation in the face of conflicting interestsNeural Information Processing Systems (NeurIPS), 2024
Davide Legacci
P. Mertikopoulos
Christos H. Papadimitriou
Georgios Piliouras
Bary S. R. Pradelski
340
6
0
31 Dec 2024
Accelerated regularized learning in finite N-person games
Accelerated regularized learning in finite N-person gamesNeural Information Processing Systems (NeurIPS), 2024
Kyriakos Lotidis
Angeliki Giannou
P. Mertikopoulos
Nicholas Bambos
285
2
0
31 Dec 2024
Online Optimization Algorithms in Repeated Price Competition: Equilibrium Learning and Algorithmic Collusion
Online Optimization Algorithms in Repeated Price Competition: Equilibrium Learning and Algorithmic Collusion
Martin Bichler
Julius Durmann
Matthias Oberlechner
212
6
0
20 Dec 2024
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model AlignmentInternational Conference on Learning Representations (ICLR), 2024
Mingzhi Wang
Chengdong Ma
Qizhi Chen
Linjian Meng
Yang Han
Jiancong Xiao
Zhaowei Zhang
Jing Huo
Weijie Su
Wenbo Ding
686
20
0
22 Oct 2024
Eco-driving Incentive Mechanisms for Mitigating Emissions in Urban Transportation
Eco-driving Incentive Mechanisms for Mitigating Emissions in Urban TransportationIEEE Transactions on Control of Network Systems (TCNS), 2024
M. Umar B. Niazi
Jung-Hoon Cho
Munther A. Dahleh
Roy Dong
Cathy Wu
107
0
0
10 Oct 2024
Joint-perturbation simultaneous pseudo-gradient
Joint-perturbation simultaneous pseudo-gradientInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Carlos Martin
Tuomas Sandholm
301
2
0
17 Aug 2024
Decentralized and Uncoordinated Learning of Stable Matchings: A
  Game-Theoretic Approach
Decentralized and Uncoordinated Learning of Stable Matchings: A Game-Theoretic Approach
S. Rasoul Etesami
R. Srikant
193
3
0
31 Jul 2024
Nested replicator dynamics, nested logit choice, and similarity-based
  learning
Nested replicator dynamics, nested logit choice, and similarity-based learning
P. Mertikopoulos
William H. Sandholm
211
3
0
25 Jul 2024
Proximal Point Method for Online Saddle Point Problem
Proximal Point Method for Online Saddle Point Problem
Qing-xin Meng
Jian-wei Liu
350
3
0
05 Jul 2024
Learning to Control Unknown Strongly Monotone Games
Learning to Control Unknown Strongly Monotone Games
Siddharth Chandak
Ilai Bistritz
Nicholas Bambos
345
4
0
30 Jun 2024
Adaptive Incentive Design with Learning Agents
Adaptive Incentive Design with Learning Agents
C. Maheshwari
Kshitij Kulkarni
Manxi Wu
S. Shankar Sastry
346
4
0
26 May 2024
Is Thompson Sampling Susceptible to Algorithmic Collusion?
Is Thompson Sampling Susceptible to Algorithmic Collusion?
Yi Xiong
Ningyuan Chen
Yi Xiong
278
0
0
23 May 2024
A geometric decomposition of finite games: Convergence vs. recurrence
  under exponential weights
A geometric decomposition of finite games: Convergence vs. recurrence under exponential weights
Davide Legacci
P. Mertikopoulos
Bary S. R. Pradelski
336
10
0
12 May 2024
Understanding Model Selection For Learning In Strategic Environments
Understanding Model Selection For Learning In Strategic EnvironmentsNeural Information Processing Systems (NeurIPS), 2024
Tinashe Handina
Eric Mazumdar
243
2
0
12 Feb 2024
Exploiting hidden structures in non-convex games for convergence to Nash
  equilibrium
Exploiting hidden structures in non-convex games for convergence to Nash equilibrium
Iosif Sakos
Emmanouil-Vasileios Vlatakis-Gkaragkounis
P. Mertikopoulos
Georgios Piliouras
190
6
0
27 Dec 2023
Scalable and Independent Learning of Nash Equilibrium Policies in
  $n$-Player Stochastic Games with Unknown Independent Chains
Scalable and Independent Learning of Nash Equilibrium Policies in nnn-Player Stochastic Games with Unknown Independent Chains
Tiancheng Qin
S. Rasoul Etesami
323
2
0
04 Dec 2023
Payoff-based learning with matrix multiplicative weights in quantum
  games
Payoff-based learning with matrix multiplicative weights in quantum gamesNeural Information Processing Systems (NeurIPS), 2023
Kyriakos Lotidis
P. Mertikopoulos
Nicholas Bambos
Jose Blanchet
185
2
0
04 Nov 2023
The equivalence of dynamic and strategic stability under regularized
  learning in games
The equivalence of dynamic and strategic stability under regularized learning in gamesNeural Information Processing Systems (NeurIPS), 2023
Victor Boone
P. Mertikopoulos
288
8
0
04 Nov 2023
Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and
  Exp-Concave Games with Gradient Feedback
Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient FeedbackOperational Research (OR), 2023
Michael I. Jordan
Tianyi Lin
Zhengyuan Zhou
563
12
0
21 Oct 2023
Convergence Analysis of the Best Response Algorithm for Time-Varying
  Games
Convergence Analysis of the Best Response Algorithm for Time-Varying GamesIEEE Conference on Decision and Control (CDC), 2023
Zifan Wang
Yi Shen
Michael M. Zavlanos
Karl H. Johansson
141
1
0
01 Sep 2023
Stability of Multi-Agent Learning: Convergence in Network Games with
  Many Players
Stability of Multi-Agent Learning: Convergence in Network Games with Many Players
A. Hussain
D. Leonte
Francesco Belardinelli
Georgios Piliouras
MLT
231
1
0
26 Jul 2023
Stochastic Methods in Variational Inequalities: Ergodicity, Bias and
  Refinements
Stochastic Methods in Variational Inequalities: Ergodicity, Bias and RefinementsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Angeliki Giannou
Yudong Chen
Qiaomin Xie
244
7
0
28 Jun 2023
Partially Personalized Federated Learning: Breaking the Curse of Data
  Heterogeneity
Partially Personalized Federated Learning: Breaking the Curse of Data Heterogeneity
Konstantin Mishchenko
Rustem Islamov
Eduard A. Gorbunov
Samuel Horváth
FedML
291
13
0
29 May 2023
Adaptively Perturbed Mirror Descent for Learning in Games
Adaptively Perturbed Mirror Descent for Learning in GamesInternational Conference on Machine Learning (ICML), 2023
Kenshi Abe
Kaito Ariu
Mitsuki Sakamoto
Atsushi Iwasaki
600
9
0
26 May 2023
Mastering Strategy Card Game (Hearthstone) with Improved Techniques
Mastering Strategy Card Game (Hearthstone) with Improved Techniques
Changnan Xiao
Yongxin Zhang
Xuefeng Huang
Qinhan Huang
Jie Chen
Peng Sun
206
13
0
09 Mar 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End
  Policy and Optimistic Smooth Fictitious Play
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
180
10
0
07 Mar 2023
Single-Call Stochastic Extragradient Methods for Structured Non-monotone
  Variational Inequalities: Improved Analysis under Weaker Conditions
Single-Call Stochastic Extragradient Methods for Structured Non-monotone Variational Inequalities: Improved Analysis under Weaker ConditionsNeural Information Processing Systems (NeurIPS), 2023
S. Choudhury
Eduard A. Gorbunov
Nicolas Loizou
333
17
0
27 Feb 2023
Achieving Hierarchy-Free Approximation for Bilevel Programs With
  Equilibrium Constraints
Achieving Hierarchy-Free Approximation for Bilevel Programs With Equilibrium ConstraintsInternational Conference on Machine Learning (ICML), 2023
Jiayang Li
Jiahao Yu
Boyi Liu
Zhaoran Wang
Y. Nie
297
7
0
20 Feb 2023
Learning in quantum games
Learning in quantum games
Kyriakos Lotidis
P. Mertikopoulos
Nicholas Bambos
196
9
0
05 Feb 2023
Doubly Optimal No-Regret Learning in Monotone Games
Doubly Optimal No-Regret Learning in Monotone GamesInternational Conference on Machine Learning (ICML), 2023
Yang Cai
Weiqiang Zheng
283
22
0
30 Jan 2023
Asymptotic Convergence and Performance of Multi-Agent Q-Learning
  Dynamics
Asymptotic Convergence and Performance of Multi-Agent Q-Learning DynamicsAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
A. Hussain
Francesco Belardinelli
Georgios Piliouras
251
16
0
23 Jan 2023
ApproxED: Approximate exploitability descent via learned best responses
ApproxED: Approximate exploitability descent via learned best responsesAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Carlos Martin
Tuomas Sandholm
463
2
0
20 Jan 2023
Global Nash Equilibrium in Non-convex Multi-player Game: Theory and
  Algorithms
Global Nash Equilibrium in Non-convex Multi-player Game: Theory and Algorithms
Guanpu Chen
Gehui Xu
Fengxiang He
Yiguang Hong
Leszek Rutkowski
Dacheng Tao
266
5
0
19 Jan 2023
Offline Reinforcement Learning for Human-Guided Human-Machine
  Interaction with Private Information
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private InformationManagement Sciences (MS), 2022
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
207
1
0
23 Dec 2022
Finding mixed-strategy equilibria of continuous-action games without
  gradients using randomized policy networks
Finding mixed-strategy equilibria of continuous-action games without gradients using randomized policy networksInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Carlos Martin
Tuomas Sandholm
216
11
0
29 Nov 2022
The rate of convergence of Bregman proximal methods: Local geometry vs.
  regularity vs. sharpness
The rate of convergence of Bregman proximal methods: Local geometry vs. regularity vs. sharpnessSIAM Journal on Optimization (SIAM J. Optim.), 2022
Waïss Azizian
F. Iutzeler
J. Malick
P. Mertikopoulos
319
1
0
15 Nov 2022
On the convergence of policy gradient methods to Nash equilibria in
  general stochastic games
On the convergence of policy gradient methods to Nash equilibria in general stochastic gamesNeural Information Processing Systems (NeurIPS), 2022
Angeliki Giannou
Kyriakos Lotidis
P. Mertikopoulos
Emmanouil-Vasileios Vlatakis-Gkaragkounis
330
25
0
17 Oct 2022
Differentiable Bilevel Programming for Stackelberg Congestion Games
Differentiable Bilevel Programming for Stackelberg Congestion Games
Jiayang Li
Jiahao Yu
Qianni Wang
Boyi Liu
Zhaoran Wang
Y. Nie
396
18
0
15 Sep 2022
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player
  Zero-Sum Games
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum GamesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Kenshi Abe
Kaito Ariu
Mitsuki Sakamoto
Kenta Toyoshima
Atsushi Iwasaki
309
15
0
21 Aug 2022
Near-Optimal No-Regret Learning Dynamics for General Convex Games
Near-Optimal No-Regret Learning Dynamics for General Convex GamesNeural Information Processing Systems (NeurIPS), 2022
Gabriele Farina
Ioannis Anagnostides
Haipeng Luo
Chung‐Wei Lee
Christian Kroer
Tuomas Sandholm
305
34
0
17 Jun 2022
123
Next
Page 1 of 3