Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.08366
Cited By
GraphCodeBERT: Pre-training Code Representations with Data Flow
17 September 2020
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
Shujie Liu
Long Zhou
Nan Duan
Alexey Svyatkovskiy
Shengyu Fu
Michele Tufano
Shao Kun Deng
Colin B. Clement
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GraphCodeBERT: Pre-training Code Representations with Data Flow"
50 / 403 papers shown
Title
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
Aftab Hussain
Md Rafiqul Islam Rabin
Bowen Xu
David Lo
Mohammad Amin Alipour
34
3
0
08 Mar 2023
Judging Adam: Studying the Performance of Optimization Methods on ML4SE Tasks
D. Pasechnyuk
Anton Prazdnichnykh
Mikhail Evtikhiev
T. Bryksin
22
1
0
06 Mar 2023
APIContext2Com: Code Comment Generation by Incorporating Pre-Defined API Documentation
Ramin Shahbazi
Fatemeh H. Fard
23
2
0
03 Mar 2023
Power Constrained Autotuning using Graph Neural Networks
Akashnil Dutta
JeeWhan Choi
Ali Jannesari
30
5
0
22 Feb 2023
On ML-Based Program Translation: Perils and Promises
Aniketh Malyala
K. Zhou
Baishakhi Ray
Saikat Chakraborty
24
5
0
21 Feb 2023
Transformadores: Fundamentos teoricos y Aplicaciones
J. D. L. Torre
63
0
0
18 Feb 2023
Automating Code-Related Tasks Through Transformers: The Impact of Pre-training
Rosalia Tufano
L. Pascarella
Gabriele Bavota
25
19
0
08 Feb 2023
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models
Changan Niu
Chuanyi Li
Vincent Ng
Bin Luo
ELM
ALM
32
9
0
08 Feb 2023
CodeLMSec Benchmark: Systematically Evaluating and Finding Security Vulnerabilities in Black-Box Code Language Models
Hossein Hajipour
Keno Hassler
Thorsten Holz
Lea Schonherr
Mario Fritz
ELM
34
19
0
08 Feb 2023
Syntax and Domain Aware Model for Unsupervised Program Translation
Fang Liu
Jia Li
Li Zhang
17
18
0
08 Feb 2023
Toward a Theory of Causation for Interpreting Neural Code Models
David Nader-Palacio
Alejandro Velasco
Nathan Cooper
Á. Rodríguez
Kevin Moran
Denys Poshyvanyk
18
16
0
07 Feb 2023
Exploring Data Augmentation for Code Generation Tasks
Pinzhen Chen
Gerasimos Lampouras
29
9
0
05 Feb 2023
LExecutor: Learning-Guided Execution
Beatriz Souza
Michael Pradel
SILM
ELM
21
13
0
05 Feb 2023
Transformers Meet Directed Graphs
Simon Geisler
Yujia Li
D. Mankowitz
A. Cemgil
Stephan Günnemann
Cosmin Paduraru
22
35
0
31 Jan 2023
Execution-based Code Generation using Deep Reinforcement Learning
Parshin Shojaee
Aneesh Jain
Sindhu Tipirneni
Chandan K. Reddy
23
50
0
31 Jan 2023
FLAME: A small language model for spreadsheet formulas
Harshit Joshi
Abishai Ebenezer
J. Cambronero
Sumit Gulwani
Aditya Kanade
Vu Le
Ivan Radivcek
Gust Verbruggen
LMTD
29
12
0
31 Jan 2023
Which Features are Learned by CodeBert: An Empirical Study of the BERT-based Source Code Representation Learning
Lan Zhang
Chen Cao
Zhilong Wang
Peng Liu
SSL
6
3
0
20 Jan 2023
Recommending Root-Cause and Mitigation Steps for Cloud Incidents using Large Language Models
Toufique Ahmed
Supriyo Ghosh
Chetan Bansal
Thomas Zimmermann
Xuchao Zhang
Saravan Rajmohan
AI4CE
30
52
0
10 Jan 2023
Learning Compiler Pass Orders using Coreset and Normalized Value Prediction
Youwei Liang
Kevin R. Stone
A. Shameli
Chris Cummins
Mostafa Elhoushi
...
Benoit Steiner
Xiaomeng Yang
P. Xie
Hugh Leather
Yuandong Tian
9
9
0
09 Jan 2023
Stealthy Backdoor Attack for Code Models
Zhou Yang
Bowen Xu
Jie M. Zhang
Hong Jin Kang
Jieke Shi
Junda He
David Lo
AAML
13
65
0
06 Jan 2023
Code Difference Guided Adversarial Example Generation for Deep Code Models
Zhao Tian
Junjie Chen
Zhi Jin
AAML
18
17
0
06 Jan 2023
Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning
Wenting Zhao
Ibrahim Abdelaziz
Julian T Dolby
Kavitha Srinivas
M. Helali
Essam Mansour
11
0
0
05 Jan 2023
Extending Source Code Pre-Trained Language Models to Summarise Decompiled Binaries
Ali Al-Kaswan
Toufique Ahmed
M. Izadi
A. Sawant
Prem Devanbu
A. van Deursen
SyDa
96
32
0
04 Jan 2023
A Survey on Knowledge-Enhanced Pre-trained Language Models
Chaoqi Zhen
Yanlei Shang
Xiangyu Liu
Yifei Li
Yong Chen
Dell Zhang
VLM
KELM
24
3
0
27 Dec 2022
Improving Automated Program Repair with Domain Adaptation
Armin Zirak
Hadi Hemmati
16
9
0
21 Dec 2022
Generation-Augmented Query Expansion For Code Retrieval
Dong Li
Yelong Shen
Ruoming Jin
Yi Mao
Kuan-Chieh Jackson Wang
Weizhu Chen
RALM
26
8
0
20 Dec 2022
ReCode: Robustness Evaluation of Code Generation Models
Shiqi Wang
Zheng Li
Haifeng Qian
Cheng Yang
Zijian Wang
...
Parminder Bhatia
Ramesh Nallapati
M. K. Ramanathan
Dan Roth
Bing Xiang
13
80
0
20 Dec 2022
A Survey on Pretrained Language Models for Neural Code Intelligence
Yichen Xu
Yanqiao Zhu
4
17
0
20 Dec 2022
Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics Capacities
Wei Ma
Shangqing Liu
Mengjie Zhao
Xiaofei Xie
Wenhan Wang
Q. Hu
Jiexin Zhang
Yang Liu
19
16
0
20 Dec 2022
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
Yangruibo Ding
Zijian Wang
Wasi Uddin Ahmad
M. K. Ramanathan
Ramesh Nallapati
Parminder Bhatia
Dan Roth
Bing Xiang
16
68
0
20 Dec 2022
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion
Zi Gong
Yinpeng Guo
Pingyi Zhou
Cuiyun Gao
Yasheng Wang
Zenglin Xu
12
8
0
19 Dec 2022
JEMMA: An Extensible Java Dataset for ML4Code Applications
Anjan Karmakar
Miltiadis Allamanis
Romain Robbes
VLM
21
3
0
18 Dec 2022
DexBERT: Effective, Task-Agnostic and Fine-grained Representation Learning of Android Bytecode
Tiezhu Sun
Kevin Allix
Kisub Kim
Xin Zhou
Dongsun Kim
David Lo
Tegawende F. Bissyande
Jacques Klein
8
11
0
12 Dec 2022
Parameter-Efficient Finetuning of Transformers for Source Code
Shamil Ayupov
Nadezhda Chirkova
8
17
0
12 Dec 2022
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
23
1
0
12 Dec 2022
Codex Hacks HackerRank: Memorization Issues and a Framework for Code Synthesis Evaluation
Anjan Karmakar
Julian Aron Prenner
Marco DÁmbros
Romain Robbes
ELM
11
17
0
06 Dec 2022
Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5
Nghi D. Q. Bui
Yue Wang
Steven C. H. Hoi
18
15
0
27 Nov 2022
DeepVulSeeker: A Novel Vulnerability Identification Framework via Code Graph Structure and Pre-training Mechanism
Jin Wang
Hui Xiao
Shuwen Zhong
Yinhao Xiao
34
11
0
23 Nov 2022
When Language Model Meets Private Library
Daoguang Zan
Bei Chen
Zeqi Lin
Bei Guan
Yongji Wang
Jian-Guang Lou
ALM
74
71
0
31 Oct 2022
CodeEditor: Learning to Edit Source Code with Pre-trained Models
Jia Li
Ge Li
Zhuo Li
Zhi Jin
Xing Hu
Kechi Zhang
Zhiyi Fu
KELM
11
23
0
31 Oct 2022
Multi-lingual Evaluation of Code Generation Models
Ben Athiwaratkun
Sanjay Krishna Gouda
Zijian Wang
Xiaopeng Li
Yuchen Tian
...
Baishakhi Ray
Parminder Bhatia
Sudipta Sengupta
Dan Roth
Bing Xiang
ELM
112
160
0
26 Oct 2022
Benchmarking Language Models for Code Syntax Understanding
Da Shen
Xinyun Chen
Chenguang Wang
Koushik Sen
Dawn Song
ELM
14
16
0
26 Oct 2022
Global Contrastive Batch Sampling via Optimization on Sample Permutations
Vin Sachidananda
Ziyi Yang
Chenguang Zhu
8
4
0
23 Oct 2022
Exploring Representation-Level Augmentation for Code Search
Haochen Li
Chun Miao
Cyril Leung
Yanxian Huang
Yuan Huang
Hongyu Zhang
Yanlin Wang
45
19
0
21 Oct 2022
Soft-Labeled Contrastive Pre-training for Function-level Code Representation
Xiaonan Li
Daya Guo
Yeyun Gong
Yun Lin
Yelong Shen
Xipeng Qiu
Daxin Jiang
Weizhu Chen
Nan Duan
21
17
0
18 Oct 2022
Code Recommendation for Open Source Software Developers
Yiqiao Jin
Yunsheng Bai
Yanqiao Zhu
Yizhou Sun
Wei Wang
20
24
0
15 Oct 2022
Leveraging Artificial Intelligence on Binary Code Comprehension
Yifan Zhang
24
3
0
11 Oct 2022
Pre-Training Representations of Binary Code Using Contrastive Learning
Yifan Zhang
Chen Huang
Yueke Zhang
Kevin Cao
Scott Thomas Andersen
Huajie Shao
Kevin Leach
Yu Huang
37
3
0
11 Oct 2022
SimSCOOD: Systematic Analysis of Out-of-Distribution Generalization in Fine-tuned Source Code Models
Hossein Hajipour
Ning Yu
Cristian-Alexandru Staicu
Mario Fritz
OODD
19
4
0
10 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
16
7
0
07 Oct 2022
Previous
1
2
3
4
5
6
7
8
9
Next