Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.09436
Cited By
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
20 September 2019
Hamel Husain
Hongqiu Wu
Tiferet Gazit
Miltiadis Allamanis
Marc Brockschmidt
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeSearchNet Challenge: Evaluating the State of Semantic Code Search"
50 / 472 papers shown
Title
RepoQA: Evaluating Long Context Code Understanding
Jiawei Liu
Jia Le Tian
Vijay Daita
Yuxiang Wei
Yifeng Ding
Yuhan Katherine Wang
Jun Yang
Lingming Zhang
LLMAG
31
17
0
10 Jun 2024
Enhancing Repository-Level Code Generation with Integrated Contextual Information
Zhiyuan Pan
Xing Hu
Xin Xia
Xiaohu Yang
31
3
0
05 Jun 2024
R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models
Ken Deng
Jiaheng Liu
He Zhu
Congnan Liu
Jingxin Li
...
Yuanxing Zhang
Wenbo Su
Bangyu Xiang
Tiezheng Ge
Bo Zheng
47
2
0
03 Jun 2024
A Survey of Generative Information Retrieval
Tzu-Lin Kuo
Tzu-Wei Chiu
Tzung-Sheng Lin
Sheng-Yang Wu
Chao-Wei Huang
Yun-Nung Chen
SyDa
77
2
0
03 Jun 2024
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
50
161
0
01 Jun 2024
Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Jingchang Chen
Hongxuan Tang
Zheng Chu
Qianglong Chen
Zekun Wang
Ming Liu
Bing Qin
52
4
0
30 May 2024
Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion
Wei Cheng
Yuhan Wu
Wei Hu
38
11
0
30 May 2024
Large Language Models for Code Summarization
Balázs Szalontai
GergHo Szalay
Tamás Márton
Anna Sike
Balázs Pintér
Tibor Gregorics
ELM
28
1
0
29 May 2024
Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass
Ethan Shen
Alan Fan
Sarah M Pratt
Jae Sung Park
Matthew Wallingford
Sham Kakade
Ari Holtzman
Ranjay Krishna
Ali Farhadi
Aditya Kusupati
39
2
0
28 May 2024
Aligning LLMs through Multi-perspective User Preference Ranking-based Feedback for Programming Question Answering
Hongyu Yang
Liyang He
Min Hou
Shuanghong Shen
Rui Li
Jiahui Hou
Jianhui Ma
Junda Zhao
27
4
0
27 May 2024
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting
Tong Ye
Yangkai Du
Tengfei Ma
Lingfei Wu
Xuhong Zhang
Shouling Ji
Wenhai Wang
DeLMO
46
6
0
25 May 2024
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization
Zi Yang
Samridhi Choudhary
Xinfeng Xie
Cao Gao
Siegfried Kunzmann
Zheng-Wei Zhang
VLM
38
6
0
23 May 2024
Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs
Harsh Patel
Buvaneswari A. Ramanan
Manzoor A. Khan
Thomas Williams
Brian D. Friedman
Lawrence Drabeck
ELM
16
0
0
10 May 2024
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning
Karim Galliamov
Leila Khaertdinova
Karina Denisova
38
1
0
07 May 2024
On Training a Neural Network to Explain Binaries
Alex Interrante-Grant
Andy Davis
Heather Preslier
Tim Leek
24
0
0
30 Apr 2024
Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach
Yao Wan
Guanghua Wan
Shijie Zhang
Hongyu Zhang
Yulei Sui
Pan Zhou
Hai Jin
Lichao Sun
27
2
0
22 Apr 2024
LLMs in Web Development: Evaluating LLM-Generated PHP Code Unveiling Vulnerabilities and Limitations
Rebeka Tóth
Tamas Bisztray
László Erdodi
SILM
25
16
0
21 Apr 2024
CodeCloak: A Method for Evaluating and Mitigating Code Leakage by LLM Code Assistants
Amit Finkman
Eden Bar-Kochva
Avishag Shapira
D. Mimran
Yuval Elovici
A. Shabtai
ELM
36
3
0
13 Apr 2024
Structure-aware Fine-tuning for Code Pre-trained Models
Jiayi Wu
Renyu Zhu
Nuo Chen
Qiushi Sun
Xiang Li
Ming Gao
37
2
0
11 Apr 2024
Analyzing the Performance of Large Language Models on Code Summarization
Rajarshi Haldar
J. Hockenmaier
40
17
0
10 Apr 2024
Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition
Kehua Feng
Keyan Ding
Kede Ma
Zhihua Wang
Qiang Zhang
Huajun Chen
34
10
0
10 Apr 2024
RAR-b: Reasoning as Retrieval Benchmark
Chenghao Xiao
G. Thomas
Al Moubayed
LRM
RALM
31
8
0
09 Apr 2024
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
Zhihao Lin
Wei Ma
Tao Lin
Yaowen Zheng
Jingquan Ge
Jun Wang
Jacques Klein
Tegawende F. Bissyande
Yang Liu
Li Li
VLM
35
4
0
09 Apr 2024
Multi-modal Learning for WebAssembly Reverse Engineering
Hanxian Huang
Jishen Zhao
31
2
0
04 Apr 2024
CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks
Yiqing Xie
Alex Xie
Divyanshu Sheth
Pengfei Liu
Daniel Fried
Carolyn Rose
43
8
0
31 Mar 2024
FoC: Figure out the Cryptographic Functions in Stripped Binaries with LLMs
Guoqiang Chen
Xiuwei Shang
Shaoyin Cheng
Yanming Zhang
Weiming Zhang
Neng H. Yu
N. Yu
94
2
0
27 Mar 2024
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search
Zehan Li
Jianfei Zhang
Chuantao Yin
Y. Ouyang
Wenge Rong
34
3
0
25 Mar 2024
Investigating the Performance of Language Models for Completing Code in Functional Programming Languages: a Haskell Case Study
Tim van Dam
Frank van der Heijden
Philippe de Bekker
Berend Nieuwschepen
Marc Otten
M. Izadi
41
5
0
22 Mar 2024
Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models
Chengzhe Feng
Yanan Sun
Ke Li
Pan Zhou
Jiancheng Lv
Aojun Lu
51
1
0
20 Mar 2024
Bridging Expert Knowledge with Deep Learning Techniques for Just-In-Time Defect Prediction
Xin Zhou
Donggyun Han
David Lo
VLM
21
2
0
17 Mar 2024
CodeUltraFeedback: An LLM-as-a-Judge Dataset for Aligning Large Language Models to Coding Preferences
M. Weyssow
Aton Kamanda
H. Sahraoui
ALM
59
30
0
14 Mar 2024
CommitBench: A Benchmark for Commit Message Generation
Maximilian Schall
Tamara Czinczoll
Gerard de Melo
22
3
0
08 Mar 2024
Semi-Instruct: Bridging Natural-Instruct and Self-Instruct for Code Large Language Models
Xianzhen Luo
Qingfu Zhu
Zhiming Zhang
Xu Wang
Qing Yang
Dongliang Xu
Wanxiang Che
ALM
32
2
0
01 Mar 2024
CLLMs: Consistency Large Language Models
Siqi Kou
Lanxiang Hu
Zhe He
Zhijie Deng
Hao Zhang
39
27
0
28 Feb 2024
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation
Shicheng Xu
Liang Pang
Mo Yu
Fandong Meng
Huawei Shen
Xueqi Cheng
Jie Zhou
RALM
33
10
0
28 Feb 2024
Language Models for Code Completion: A Practical Evaluation
M. Izadi
J. Katzy
Tim van Dam
Marc Otten
R. Popescu
A. van Deursen
ALM
ELM
39
22
0
25 Feb 2024
RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian
Adrian Cosma
Ioan-Bogdan Iordache
Paolo Rosso
OffRL
41
2
0
20 Feb 2024
On the Effectiveness of Machine Learning-based Call Graph Pruning: An Empirical Study
A. Mir
Mehdi Keshani
Sebastian Proksch
15
1
0
11 Feb 2024
Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates
Ashish Hooda
Mihai Christodorescu
Miltos Allamanis
Aaron Wilson
Kassem Fawaz
Somesh Jha
ELM
27
7
0
08 Feb 2024
Studying Vulnerable Code Entities in R
Zixiao Zhao
Millon Madhur Das
Fatemeh H. Fard
AAML
54
0
0
06 Feb 2024
UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing
Yifeng He
Jiabo Huang
Yuyang Rong
Yiwen Guo
Ethan Wang
Hao Chen
26
4
0
04 Feb 2024
Solution-oriented Agent-based Models Generation with Verifier-assisted Iterative In-context Learning
Tong Niu
Weihao Zhang
Rong Zhao
LLMAG
27
2
0
04 Feb 2024
GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding
Cunxiao Du
Jing Jiang
Yuanchen Xu
Jiawei Wu
Sicheng Yu
...
Shenggui Li
Kai Xu
Liqiang Nie
Zhaopeng Tu
Yang You
26
29
0
03 Feb 2024
Calibration and Correctness of Language Models for Code
Claudio Spiess
David Gros
Kunal Suresh Pai
Michael Pradel
Md Rafiqul Islam Rabin
Amin Alipour
Susmit Jha
Prem Devanbu
Toufique Ahmed
60
19
0
03 Feb 2024
Code Representation Learning At Scale
Dejiao Zhang
W. Ahmad
Ming Tan
Hantian Ding
Ramesh Nallapati
Dan Roth
Xiaofei Ma
Bing Xiang
OffRL
21
8
0
02 Feb 2024
COMET: Generating Commit Messages using Delta Graph Context Representation
Abhinav Reddy Mandli
Saurabhsingh Rajput
Tushar Sharma
31
1
0
02 Feb 2024
Nomic Embed: Training a Reproducible Long Context Text Embedder
Zach Nussbaum
John X. Morris
Brandon Duderstadt
Andriy Mulyar
19
95
0
02 Feb 2024
Embedding-based search in JetBrains IDEs
Evgeny Abramov
Nikolai Palchikov
VLM
17
1
0
26 Jan 2024
A Systematic Literature Review on Explainability for Machine/Deep Learning-based Software Engineering Research
Sicong Cao
Xiaobing Sun
Ratnadira Widyasari
David Lo
Xiaoxue Wu
...
Jiale Zhang
Bin Li
Wei Liu
Di Wu
Yixin Chen
31
6
0
26 Jan 2024
Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models
Mayank Agarwal
Yikang Shen
Bailin Wang
Yoon Kim
Jie Chen
37
5
0
19 Jan 2024
Previous
1
2
3
4
5
6
...
8
9
10
Next