117

Towards Community-Driven Agents for Machine Learning Engineering

Sijie Li
Weiwei Sun
Shanda Li
Ameet Talwalkar
Yiming Yang
Main:9 Pages
12 Figures
Bibliography:3 Pages
7 Tables
Appendix:37 Pages
Abstract

Large language model-based machine learning (ML) agents have shown great promise in automating ML research. However, existing agents typically operate in isolation on a given research problem, without engaging with the broader research community, where human researchers often gain insights and contribute by sharing knowledge. To bridge this gap, we introduce MLE-Live, a live evaluation framework designed to assess an agent's ability to communicate with and leverage collective knowledge from a simulated Kaggle research community. Building on this framework, we propose CoMind, a novel agent that excels at exchanging insights and developing novel solutions within a community context. CoMind achieves state-of-the-art performance on MLE-Live and outperforms 79.2% human competitors on average across four ongoing Kaggle competitions. Our code is released at this https URL.

View on arXiv
Comments on this paper