ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.05492
  4. Cited By
Mastering the Game of No-Press Diplomacy via Human-Regularized
  Reinforcement Learning and Planning

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

11 October 2022
A. Bakhtin
David J. Wu
Adam Lerer
Jonathan Gray
Athul Paul Jacob
Gabriele Farina
Alexander H. Miller
Noam Brown
ArXivPDFHTML

Papers citing "Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning"

10 / 10 papers shown
Title
Among Us: A Sandbox for Measuring and Detecting Agentic Deception
Among Us: A Sandbox for Measuring and Detecting Agentic Deception
Satvik Golechha
Adrià Garriga-Alonso
LLMAG
52
2
0
05 Apr 2025
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL
Wichayaporn Wongkamjan
Yanze Wang
Feng Gu
Denis Peskoff
Jonathan K. Kummerfeld
Jonathan May
Jordan Lee Boyd-Graber
55
0
0
18 Feb 2025
Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control
Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control
Timothée Anne
Noah Syrkis
Meriem Elhosni
Florian Turati
Franck Legendre
Alain Jaquier
Sebastian Risi
LLMAG
95
2
0
16 Dec 2024
More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play
More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play
Wichayaporn Wongkamjan
Feng Gu
Yanze Wang
Ulf Hermjakob
Jonathan May
Brandon M. Stewart
Jonathan K. Kummerfeld
Denis Peskoff
Jordan L. Boyd-Graber
53
3
0
07 Jun 2024
Prospect Personalized Recommendation on Large Language Model-based Agent
  Platform
Prospect Personalized Recommendation on Large Language Model-based Agent Platform
Jizhi Zhang
Keqin Bao
Wenjie Wang
Yang Zhang
Wentao Shi
Wanhong Xu
Fuli Feng
Tat-Seng Chua
LLMAG
48
16
0
28 Feb 2024
Regularized Conventions: Equilibrium Computation as a Model of Pragmatic
  Reasoning
Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning
Athul Paul Jacob
Gabriele Farina
Jacob Andreas
20
3
0
16 Nov 2023
Towards automating Codenames spymasters with deep reinforcement learning
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
17
2
0
28 Dec 2022
Collaborating with Humans without Human Data
Collaborating with Humans without Human Data
D. Strouse
Kevin R. McKee
M. Botvinick
Edward Hughes
Richard Everett
124
161
0
15 Oct 2021
No-Press Diplomacy from Scratch
No-Press Diplomacy from Scratch
A. Bakhtin
David J. Wu
Adam Lerer
Noam Brown
98
42
0
06 Oct 2021
"Other-Play" for Zero-Shot Coordination
"Other-Play" for Zero-Shot Coordination
Hengyuan Hu
Adam Lerer
A. Peysakhovich
Jakob N. Foerster
VLM
OffRL
136
218
0
06 Mar 2020
1