ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.13943
  4. Cited By
Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction

Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction

18 July 2024
Suma Bailis
Jane Friedhoff
Feiyang Chen
ArXivPDFHTML

Papers citing "Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction"

2 / 2 papers shown
Title
DSGBench: A Diverse Strategic Game Benchmark for Evaluating LLM-based Agents in Complex Decision-Making Environments
Wenjie Tang
Yuan Zhou
Erqiang Xu
Keyan Cheng
Minne Li
Liquan Xiao
ELM
47
1
0
08 Mar 2025
Simulating Human Strategic Behavior: Comparing Single and Multi-agent
  LLMs
Simulating Human Strategic Behavior: Comparing Single and Multi-agent LLMs
Karthik Sreedhar
Lydia B. Chilton
LLMAG
46
12
0
13 Feb 2024
1