Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.08366
Cited By
GraphCodeBERT: Pre-training Code Representations with Data Flow
17 September 2020
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
Shujie Liu
Long Zhou
Nan Duan
Alexey Svyatkovskiy
Shengyu Fu
Michele Tufano
Shao Kun Deng
Colin B. Clement
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GraphCodeBERT: Pre-training Code Representations with Data Flow"
50 / 403 papers shown
Title
Instruction-Driven Game Engines on Large Language Models
Hongqiu Wu
Xing-Chen Liu
Haizhen Zhao
Min Zhang
32
1
0
30 Mar 2024
SCALE: Constructing Structured Natural Language Comment Trees for Software Vulnerability Detection
Xinjie Wen
Cuiyun Gao
Shuzheng Gao
Yang Xiao
Michael R. Lyu
22
5
0
28 Mar 2024
Vulnerability Detection with Code Language Models: How Far Are We?
Yangruibo Ding
Yanjun Fu
Omniyyah Ibrahim
Chawin Sitawarin
Xinyun Chen
Basel Alomair
David A. Wagner
Baishakhi Ray
Yizheng Chen
AAML
41
43
0
27 Mar 2024
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search
Zehan Li
Jianfei Zhang
Chuantao Yin
Y. Ouyang
Wenge Rong
21
1
0
25 Mar 2024
CodeS: Natural Language to Code Repository via Multi-Layer Sketch
Daoguang Zan
Ailun Yu
Wei Liu
Dong Chen
Bo Shen
...
Bei Guan
Zhiguang Yang
Yongji Wang
Qianxiang Wang
Li-zhen Cui
20
14
0
25 Mar 2024
Reasoning Runtime Behavior of a Program with LLM: How Far Are We?
Junkai Chen
Zhiyuan Pan
Xing Hu
Zhenhao Li
Ge Li
Xin Xia
LRM
32
20
0
25 Mar 2024
A hybrid LLM workflow can help identify user privilege related variables in programs of any size
Haizhou Wang
Zhilong Wang
Peng Liu
14
3
0
23 Mar 2024
An Exploratory Investigation into Code License Infringements in Large Language Model Training Datasets
J. Katzy
R. Popescu
A. van Deursen
M. Izadi
30
5
0
22 Mar 2024
Investigating the Performance of Language Models for Completing Code in Functional Programming Languages: a Haskell Case Study
Tim van Dam
Frank van der Heijden
Philippe de Bekker
Berend Nieuwschepen
Marc Otten
M. Izadi
33
5
0
22 Mar 2024
Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models
Chengzhe Feng
Yanan Sun
Ke Li
Pan Zhou
Jiancheng Lv
Aojun Lu
46
1
0
20 Mar 2024
CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Zhenru Lin
Yiqun Yao
Yang Yuan
ELM
18
0
0
04 Mar 2024
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Penghao Zhao
Hailin Zhang
Qinhan Yu
Zhengren Wang
Yunteng Geng
Fangcheng Fu
Ling Yang
Wentao Zhang
Jie Jiang
Bin Cui
3DV
110
220
0
29 Feb 2024
Language Models for Code Completion: A Practical Evaluation
M. Izadi
J. Katzy
Tim van Dam
Marc Otten
R. Popescu
A. van Deursen
ALM
ELM
39
22
0
25 Feb 2024
Proof-of-concept: Using ChatGPT to Translate and Modernize an Earth System Model from Fortran to Python/JAX
Anthony Zhou
Linnia Hawkins
Pierre Gentine
9
1
0
13 Feb 2024
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou
Guchun Zhang
Gerasimos Lampouras
AI4TS
13
1
0
08 Feb 2024
Rocks Coding, Not Development--A Human-Centric, Experimental Evaluation of LLM-Supported SE Tasks
Wei Wang
Huilong Ning
Gaowei Zhang
Libo Liu
Yi Wang
24
11
0
08 Feb 2024
Studying Vulnerable Code Entities in R
Zixiao Zhao
Millon Madhur Das
Fatemeh H. Fard
AAML
51
0
0
06 Feb 2024
Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS
Matthew DeLorenzo
A. B. Chowdhury
Vasudev Gohil
Shailja Thakur
Ramesh Karri
Siddharth Garg
Jeyavijayan Rajendran
26
30
0
05 Feb 2024
Solution-oriented Agent-based Models Generation with Verifier-assisted Iterative In-context Learning
Tong Niu
Weihao Zhang
Rong Zhao
LLMAG
17
2
0
04 Feb 2024
EffiBench: Benchmarking the Efficiency of Automatically Generated Code
Dong Huang
Yuhao Qing
Weiyi Shang
Heming Cui
Jie M. Zhang
77
30
0
03 Feb 2024
The Landscape and Challenges of HPC Research and LLMs
Le Chen
Nesreen K. Ahmed
Akashnil Dutta
Arijit Bhattacharjee
Sixing Yu
...
Vy A. Vo
J. P. Muñoz
Ted Willke
Tim Mattson
Ali Jannesari
AI4CE
29
20
0
03 Feb 2024
Code Representation Learning At Scale
Dejiao Zhang
W. Ahmad
Ming Tan
Hantian Ding
Ramesh Nallapati
Dan Roth
Xiaofei Ma
Bing Xiang
OffRL
10
8
0
02 Feb 2024
COMET: Generating Commit Messages using Delta Graph Context Representation
Abhinav Reddy Mandli
Saurabhsingh Rajput
Tushar Sharma
31
1
0
02 Feb 2024
Security and Privacy Challenges of Large Language Models: A Survey
B. Das
M. H. Amini
Yanzhao Wu
PILM
ELM
19
101
0
30 Jan 2024
PPM: Automated Generation of Diverse Programming Problems for Benchmarking Code Generation Models
Simin Chen
Xiaoning Feng
Xiao Han
Cong Liu
Wei Yang
40
3
0
28 Jan 2024
A Systematic Literature Review on Explainability for Machine/Deep Learning-based Software Engineering Research
Sicong Cao
Xiaobing Sun
Ratnadira Widyasari
David Lo
Xiaoxue Wu
...
Jiale Zhang
Bin Li
Wei Liu
Di Wu
Yixin Chen
24
6
0
26 Jan 2024
Investigating the Efficacy of Large Language Models for Code Clone Detection
Mohamad Khajezade
Jie Wu
Fatemeh H. Fard
Gema Rodríguez-Pérez
Mohamed Sami Shehata
19
16
0
24 Jan 2024
Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models
Mayank Agarwal
Yikang Shen
Bailin Wang
Yoon Kim
Jie Chen
37
5
0
19 Jan 2024
A Novel Approach for Automatic Program Repair using Round-Trip Translation with Large Language Models
Fernando Vallecillos Ruiz
Anastasiia Grishina
Max Hort
Leon Moonen
LRM
28
4
0
15 Jan 2024
Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends
Yunshi Lan
Xinyuan Li
Hanyue Du
Xuesong Lu
Ming Gao
Weining Qian
Aoying Zhou
33
1
0
15 Jan 2024
Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search
Haochen Li
Xin Zhou
Zhiqi Shen
29
9
0
09 Jan 2024
Enhanced Automated Code Vulnerability Repair using Large Language Models
David de-Fitero-Dominguez
Eva García-López
Antonio Garcia-Cabot
J. Martínez-Herráiz
19
11
0
08 Jan 2024
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding
Linyuan Gong
Mostafa Elhoushi
Alvin Cheung
29
11
0
05 Jan 2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
27
20
0
30 Dec 2023
Source Code is a Graph, Not a Sequence: A Cross-Lingual Perspective on Code Clone Detection
Mohammed Ataaur Rahaman
Julia Ive
11
0
0
27 Dec 2023
A Prompt Learning Framework for Source Code Summarization
Weisong Sun
Chunrong Fang
Yudu You
Yuchen Chen
Yi Liu
...
Quanjun Zhang
Hanwei Qian
Wei-Ye Zhao
Yang Liu
Zhenyu Chen
LLMAG
37
13
0
26 Dec 2023
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation
Dong Huang
Jie M.Zhang
Michael Luck
Qi Bu
Yuhao Qing
Heming Cui
LLMAG
25
0
0
20 Dec 2023
A Case Study on Test Case Construction with Large Language Models: Unveiling Practical Insights and Challenges
Roberto Francisco de Lima Junior
Luiz Fernando Paes de Barros Presta
Lucca Santos Borborema
Vanderson Nogueira da Silva
Marcio Dahia
Anderson Carlos Sousa e Santos
11
2
0
19 Dec 2023
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models
Xin Jin
Jonathan Larson
Weiwei Yang
Zhiqiang Lin
ELM
13
20
0
15 Dec 2023
Towards Trustworthy AI Software Development Assistance
Daniel Maninger
Krishna Narasimhan
Mira Mezini
22
3
0
14 Dec 2023
INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers
Anjan Karmakar
Romain Robbes
22
4
0
08 Dec 2023
Code Search Debiasing:Improve Search Results beyond Overall Ranking Performance
Sheng Zhang
Hui Li
Yanlin Wang
Zhao Wei
Yong Xiu
Juhong Wang
Rongong Ji
11
2
0
25 Nov 2023
Naturalness of Attention: Revisiting Attention in Code Language Models
M. Saad
Tushar Sharma
28
2
0
22 Nov 2023
Large Language Model-Enhanced Algorithm Selection: Towards Comprehensive Algorithm Representation
Xingyu Wu
Yan Zhong
Jibin Wu
Bingbing Jiang
Kay Chen Tan
25
5
0
22 Nov 2023
GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language Understanding
Andor Diera
Abdelhalim Hafedh Dahou
Lukas Galke
Fabian Karl
Florian Sihler
A. Scherp
ELM
30
4
0
16 Nov 2023
An Extensive Study on Adversarial Attack against Pre-trained Models of Code
Xiaohu Du
Ming Wen
Zichao Wei
Shangwen Wang
Hai Jin
AAML
27
15
0
13 Nov 2023
AdaCCD: Adaptive Semantic Contrasts Discovery Based Cross Lingual Adaptation for Code Clone Detection
Yangkai Du
Tengfei Ma
Lingfei Wu
Xuhong Zhang
Shouling Ji
27
3
0
13 Nov 2023
DocGen: Generating Detailed Parameter Docstrings in Python
Vatsal Venkatkrishna
Durga Shree Nagabushanam
Emmanuel Iko-Ojo Simon
M. Vidoni
8
0
0
11 Nov 2023
TransformCode: A Contrastive Learning Framework for Code Embedding via Subtree Transformation
Zixiang Xian
Rubing Huang
Dave Towey
Chunrong Fang
Zhenyu Chen
10
5
0
10 Nov 2023
Noisy Pair Corrector for Dense Retrieval
Hang Zhang
Yeyun Gong
Xingwei He
Dayiheng Liu
Daya Guo
Jiancheng Lv
Jian Guo
24
5
0
07 Nov 2023
Previous
1
2
3
4
5
6
7
8
9
Next