
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks
Pavel Adamenko
Mikhail Ivanov
Aidar Valeev
Rodion Levichev
Pavel Zadorozhny
Ivan Lopatin
Dmitry Babayev
Alena Fenogenova
Valentin Malykh
Papers citing "SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks"
Title | |||
---|---|---|---|
No papers |