Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.14497
Cited By
CodeRAG-Bench: Can Retrieval Augment Code Generation?
20 June 2024
Zora Zhiruo Wang
Akari Asai
Xinyan Velocity Yu
Frank F. Xu
Yiqing Xie
Graham Neubig
Daniel Fried
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeRAG-Bench: Can Retrieval Augment Code Generation?"
23 / 23 papers shown
Title
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Yixin Cao
Shibo Hong
X. Li
Jiahao Ying
Yubo Ma
...
Juanzi Li
Aixin Sun
Xuanjing Huang
Tat-Seng Chua
Yu Jiang
ALM
ELM
84
0
0
26 Apr 2025
RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning
Jerry Huang
Siddarth Madala
Risham Sidhu
Cheng Niu
Julia Hockenmaier
Tong Zhang
RALM
LRM
83
1
0
17 Mar 2025
LocAgent: Graph-Guided LLM Agents for Code Localization
Zhaoling Chen
Xiangru Tang
Gangda Deng
Fang Wu
Jialong Wu
Zhiwei Jiang
Viktor Prasanna
Arman Cohan
Xingyao Wang
LLMAG
89
2
0
12 Mar 2025
OpenRAG: Optimizing RAG End-to-End via In-Context Retrieval Learning
Jiawei Zhou
Lei Chen
3DV
VLM
73
0
0
11 Mar 2025
RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing
Yiqing Xie
Alex Xie
Divyanshu Sheth
Pengfei Liu
Daniel Fried
Carolyn Rose
LRM
62
0
0
10 Mar 2025
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Roham Koohestani
Philippe de Bekker
M. Izadi
VLM
45
0
0
07 Mar 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM
3DV
46
2
0
21 Feb 2025
LLM Program Optimization via Retrieval Augmented Search
Sagnik Anupam
Alexander Shypula
Osbert Bastani
120
1
0
31 Jan 2025
How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs
Jialun Cao
Yuk-Kit Chan
Zixuan Ling
Wenxuan Wang
Shuqing Li
...
Pinjia He
Shuai Wang
Zibin Zheng
Michael R. Lyu
S. Cheung
ALM
69
1
0
18 Jan 2025
CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking
Tarun Suresh
R. Reddy
Yifei Xu
Zach Nussbaum
Andriy Mulyar
Brandon Duderstadt
Heng Ji
83
3
0
01 Dec 2024
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval
Y. Liu
Rui Meng
Shafiq R. Joty
Silvio Savarese
Caiming Xiong
Yingbo Zhou
Semih Yavuz
87
3
0
19 Nov 2024
Can Large Language Models Replace Data Scientists in Biomedical Research?
Z. Wang
Benjamin P. Danek
Ziwei Yang
Zheng Chen
J. Sun
ELM
LM&MA
31
0
0
28 Oct 2024
An LLM Agent for Automatic Geospatial Data Analysis
Yuxing Chen
Weijie Wang
Sylvain Lobry
Camille Kurtz
LLMAG
30
3
0
24 Oct 2024
Self-adaptive Multimodal Retrieval-Augmented Generation
Wenjia Zhai
VLM
19
0
0
15 Oct 2024
Context-Augmented Code Generation Using Programming Knowledge Graphs
Iman Saberi
Fatemeh H. Fard
16
0
0
09 Oct 2024
Retrieval-Augmented Test Generation: How Far Are We?
Jiho Shin
Reem Aleithan
Hadi Hemmati
Song Wang
3DV
21
2
0
19 Sep 2024
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
To Eun Kim
Alireza Salemi
Andrew Drozdov
Fernando Diaz
Hamed Zamani
48
5
0
17 Jul 2024
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Terry Yue Zhuo
Minh Chien Vu
Jenny Chim
Han Hu
Wenhao Yu
...
David Lo
Daniel Fried
Xiaoning Du
H. D. Vries
Leandro von Werra
65
125
0
22 Jun 2024
MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Jianbo Dai
Jianqiao Lu
Yunlong Feng
Rongju Ruan
Ming Cheng
Haochen Tan
Zhijiang Guo
ELM
LRM
36
11
0
19 May 2024
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
John Yang
Carlos E. Jimenez
Alexander Wettig
K. Lieret
Shunyu Yao
Karthik Narasimhan
Ofir Press
LLMAG
96
36
0
06 May 2024
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation
Jiawei Liu
Chun Xia
Yuyao Wang
Lingming Zhang
ELM
ALM
166
388
0
02 May 2023
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur
Nils Reimers
Andreas Rucklé
Abhishek Srivastava
Iryna Gurevych
VLM
229
720
0
17 Apr 2021
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
Shuai Lu
Daya Guo
Shuo Ren
Junjie Huang
Alexey Svyatkovskiy
...
Nan Duan
Neel Sundaresan
Shao Kun Deng
Shengyu Fu
Shujie Liu
ELM
186
1,098
0
09 Feb 2021
1