Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2403.19305
Cited By

MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended
Text Evaluation

v1v2 (latest)

MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text Evaluation

28 March 2024

Yu Li

ArXiv (abs)PDF HTML

Papers citing "MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text Evaluation"

10 / 10 papers shown

Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation

Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation

215

9

0

28 Jul 2025

HiMATE: A Hierarchical Multi-Agent Framework for Machine Translation Evaluation

HiMATE: A Hierarchical Multi-Agent Framework for Machine Translation Evaluation

181

2

0

22 May 2025

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding

SeriesBench: A Benchmark for Narrative-Driven Drama Series UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025

521

0

0

30 Apr 2025

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

...

548

23

0

26 Apr 2025

Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments

Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments

Jama Hussein Mohamud

394

6

0

23 Apr 2025

Reading between the Lines: Can LLMs Identify Cross-Cultural Communication Gaps?

Reading between the Lines: Can LLMs Identify Cross-Cultural Communication Gaps?

Saurabh Kumar Pandey

Monojit Choudhury

313

1

0

21 Feb 2025

Is my Meeting Summary Good? Estimating Quality with a Multi-LLM
Evaluator

Is my Meeting Summary Good? Estimating Quality with a Multi-LLM EvaluatorInternational Conference on Computational Linguistics (COLING), 2024

Frederic Kirstein

356

3

0

27 Nov 2024

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Alimohammad Beigi

Chengshuai Zhao

...

1.1K

287

0

25 Nov 2024

SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for
reference-free open-ended text

SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text

H M Sajjad Hossain

230

1

0

25 Nov 2024

What Makes a Good Story and How Can We Measure It? A Comprehensive
Survey of Story Evaluation

What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation

407

15

0

26 Aug 2024