ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1401.3460
  4. Cited By
Policy Iteration for Decentralized Control of Markov Decision Processes

Policy Iteration for Decentralized Control of Markov Decision Processes

Journal of Artificial Intelligence Research (JAIR), 2009
15 January 2014
D. Bernstein
Chris Amato
E. Hansen
S. Zilberstein
ArXiv (abs)PDFHTML

Papers citing "Policy Iteration for Decentralized Control of Markov Decision Processes"

24 / 24 papers shown
Scalable Solution Methods for Dec-POMDPs with Deterministic Dynamics
Scalable Solution Methods for Dec-POMDPs with Deterministic Dynamics
Yang You
Alex Schutz
Ruoyao Xiao
Bruno Lacerda
R. Skilton
Nick Hawes
149
0
0
29 Aug 2025
Shared Control with Black Box Agents using Oracle Queries
Shared Control with Black Box Agents using Oracle QueriesInternational Conference on Auditory Display (ICAD), 2024
Inbal Avraham
Reuth Mirsky
316
3
0
24 Feb 2025
$\widetilde{O}(T^{-1})$ Convergence to (Coarse) Correlated Equilibria in
  Full-Information General-Sum Markov Games
O~(T−1)\widetilde{O}(T^{-1})O(T−1) Convergence to (Coarse) Correlated Equilibria in Full-Information General-Sum Markov Games
Weichao Mao
Haoran Qiu
Chen Wang
Hubertus Franke
Zbigniew T. Kalbarczyk
Tamer Basar
276
0
0
02 Feb 2024
Computing Universal Plans for Partially Observable Multi-Agent Routing Using Answer Set Programming
Computing Universal Plans for Partially Observable Multi-Agent Routing Using Answer Set Programming
Fengming Zhu
Fang-Chang Lin
273
0
0
25 May 2023
CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action
  Spaces
CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action SpacesConference on Learning for Dynamics & Control (L4DC), 2022
Elie Aljalbout
Maximilian Karl
Patrick van der Smagt
195
6
0
28 Nov 2022
Mean-Field Control Approach to Decentralized Stochastic Control with
  Finite-Dimensional Memories
Mean-Field Control Approach to Decentralized Stochastic Control with Finite-Dimensional Memories
Takehiro Tottori
Tetsuya J. Kobayashi
158
6
0
12 Sep 2022
Learning Cooperation and Online Planning Through Simulation and Graph
  Convolutional Network
Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network
Rafid Ameer Mahmud
Fahim Faisal
Saaduddin Mahmud
Md.Mosaddek Khan
125
1
0
16 Oct 2021
On Improving Model-Free Algorithms for Decentralized Multi-Agent
  Reinforcement Learning
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Weichao Mao
Lin F. Yang
Jianchao Tan
Tamer Bacsar
372
63
0
12 Oct 2021
Solving infinite-horizon Dec-POMDPs using Finite State Controllers
  within JESP
Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP
Yang You
Vincent Thomas
F. Colas
Olivier Buffet
168
8
0
17 Sep 2021
Forward and Backward Bellman equations improve the efficiency of EM
  algorithm for DEC-POMDP
Forward and Backward Bellman equations improve the efficiency of EM algorithm for DEC-POMDPEntropy (Entropy), 2021
Takehiro Tottori
Tetsuya J. Kobayashi
212
4
0
19 Mar 2021
Knowledge-Based Strategies for Multi-Agent Teams Playing Against Nature
Knowledge-Based Strategies for Multi-Agent Teams Playing Against NatureArtificial Intelligence (AI), 2020
D. Gurov
V. Goranko
Edvin Lundberg
LLMAG
216
11
0
29 Dec 2020
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and
  Algorithms
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Jianchao Tan
Zhuoran Yang
Tamer Basar
775
1,552
0
24 Nov 2019
Online Planning for Decentralized Stochastic Control with Partial
  History Sharing
Online Planning for Decentralized Stochastic Control with Partial History SharingAmerican Control Conference (ACC), 2019
Jianchao Tan
Erik Miehling
Tamer Basar
316
14
0
06 Aug 2019
Distributed Policy Iteration for Scalable Approximation of Cooperative
  Multi-Agent Policies
Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies
Thomy Phan
Kyrill Schmid
Lenz Belzner
Thomas Gabor
Sebastian Feld
Claudia Linnhoff-Popien
279
5
0
25 Jan 2019
Q-CP: Learning Action Values for Cooperative Planning
Q-CP: Learning Action Values for Cooperative PlanningIEEE International Conference on Robotics and Automation (ICRA), 2018
Francesco Riccio
Roberto Capobianco
Daniele Nardi
108
7
0
01 Mar 2018
Scalable Accelerated Decentralized Multi-Robot Policy Search in
  Continuous Observation Spaces
Scalable Accelerated Decentralized Multi-Robot Policy Search in Continuous Observation Spaces
Shayegan Omidshafiei
Chris Amato
Miao Liu
Michael Everett
Jonathan P. How
J. Vian
204
4
0
16 Mar 2017
Semantic-level Decentralized Multi-Robot Decision-Making using
  Probabilistic Macro-Observations
Semantic-level Decentralized Multi-Robot Decision-Making using Probabilistic Macro-Observations
Shayegan Omidshafiei
Shih‐Yuan Liu
Michael Everett
B. Lopez
Chris Amato
Miao Liu
Jonathan P. How
J. Vian
280
6
0
16 Mar 2017
Real-time Rescheduling in Distributed Railway Network: An Agent-Based
  Approach
Real-time Rescheduling in Distributed Railway Network: An Agent-Based Approach
Poulami Dalapati
Piyush Agarwal
Animesh Dutta
S. Bhattacharya
77
2
0
12 Jul 2016
Stick-Breaking Policy Learning in Dec-POMDPs
Stick-Breaking Policy Learning in Dec-POMDPs
Miao Liu
Chris Amato
X. Liao
Lawrence Carin
Jonathan P. How
202
30
0
01 May 2015
Decentralized Control of Partially Observable Markov Decision Processes
  using Belief Space Macro-actions
Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions
Chris Amato
Ali-akbar Agha-mohammadi
A. Geramifard
N. K. Üre
271
98
0
20 Feb 2015
Incremental Clustering and Expansion for Faster Optimal Planning in
  Dec-POMDPs
Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPsJournal of Artificial Intelligence Research (JAIR), 2013
F. Oliehoek
M. Spaan
Chris Amato
Shimon Whiteson
287
45
0
04 Feb 2014
A Hybrid LP-RPG Heuristic for Modelling Numeric Resource Flows in
  Planning
A Hybrid LP-RPG Heuristic for Modelling Numeric Resource Flows in PlanningJournal of Artificial Intelligence Research (JAIR), 2014
A. Coles
A. Coles
M. Fox
D. Long
401
57
0
04 Feb 2014
Scaling Up Decentralized MDPs Through Heuristic Search
Scaling Up Decentralized MDPs Through Heuristic SearchConference on Uncertainty in Artificial Intelligence (UAI), 2012
J. Dibangoye
Chris Amato
Arnaud Doniec
241
31
0
16 Oct 2012
Anytime Planning for Decentralized POMDPs using Expectation Maximization
Anytime Planning for Decentralized POMDPs using Expectation MaximizationConference on Uncertainty in Artificial Intelligence (UAI), 2010
Akshat Kumar
S. Zilberstein
318
41
0
15 Mar 2012
1
Page 1 of 1