SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks

SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks

Pavel Adamenko
Mikhail Ivanov
Aidar Valeev
Rodion Levichev
Pavel Zadorozhny
Ivan Lopatin
Dmitry Babayev
Alena Fenogenova
Valentin Malykh

Papers citing "SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks"

Title
No papers