ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.08366
  4. Cited By
GraphCodeBERT: Pre-training Code Representations with Data Flow

GraphCodeBERT: Pre-training Code Representations with Data Flow

17 September 2020
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
Shujie Liu
Long Zhou
Nan Duan
Alexey Svyatkovskiy
Shengyu Fu
Michele Tufano
Shao Kun Deng
Colin B. Clement
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
ArXivPDFHTML

Papers citing "GraphCodeBERT: Pre-training Code Representations with Data Flow"

50 / 403 papers shown
Title
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models
  for Programming Language Attend Code Structure
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure
Nuo Chen
Qiushi Sun
Renyu Zhu
Xiang Li
Xuesong Lu
Ming Gao
36
10
0
07 Oct 2022
MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation
MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation
Zeming Dong
Qiang Hu
Yuejun Guo
Maxime Cordy
Mike Papadakis
Zhenya Zhang
Yves Le Traon
Jianjun Zhao
23
8
0
06 Oct 2022
ContraCLM: Contrastive Learning For Causal Language Model
ContraCLM: Contrastive Learning For Causal Language Model
Nihal Jain
Dejiao Zhang
Wasi Uddin Ahmad
Zijian Wang
Feng Nan
...
Ramesh Nallapati
Baishakhi Ray
Parminder Bhatia
Xiaofei Ma
Bing Xiang
23
16
0
03 Oct 2022
CodeQueries: A Dataset of Semantic Queries over Code
CodeQueries: A Dataset of Semantic Queries over Code
Surya Prakash Sahu
Madhurima Mandal
Shikhar Bharadwaj
Aditya Kanade
Petros Maniatis
S. Shevade
17
4
0
17 Sep 2022
Semantic-Preserving Adversarial Code Comprehension
Semantic-Preserving Adversarial Code Comprehension
Yiyang Li
Hongqiu Wu
Hai Zhao
AAML
14
7
0
12 Sep 2022
Generalizability of Code Clone Detection on CodeBERT
Generalizability of Code Clone Detection on CodeBERT
Tim Sonnekalb
Bernd Gruner
C. Brust
Patrick Mäder
12
14
0
26 Aug 2022
Topical: Learning Repository Embeddings from Source Code using Attention
Topical: Learning Repository Embeddings from Source Code using Attention
Agathe Lherondelle
Varun Babbar
Yash Satsangi
Fran Silavong
Shaltiel Eloul
Sean J. Moran
19
0
0
19 Aug 2022
Learning Program Representations with a Tree-Structured Transformer
Learning Program Representations with a Tree-Structured Transformer
Wenhan Wang
Kechi Zhang
Ge Li
Shangqing Liu
Anran Li
Zhi Jin
Yang Liu
31
5
0
18 Aug 2022
CommitBART: A Large Pre-trained Model for GitHub Commits
CommitBART: A Large Pre-trained Model for GitHub Commits
Shangqing Liu
Yanzhou Li
Xiaofei Xie
Yang Liu
VLM
AI4TS
21
18
0
17 Aug 2022
A Library for Representing Python Programs as Graphs for Machine
  Learning
A Library for Representing Python Programs as Graphs for Machine Learning
David Bieber
Kensen Shi
Petros Maniatis
Charles Sutton
Vincent J. Hellendoorn
Daniel D. Johnson
Daniel Tarlow
GNN
AI4CE
15
5
0
15 Aug 2022
Finding Reusable Machine Learning Components to Build Programming
  Language Processing Pipelines
Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Patrick Flynn
T. Vanderbruggen
C. Liao
Pei-Hung Lin
M. Emani
Xipeng Shen
19
4
0
11 Aug 2022
CoditT5: Pretraining for Source Code and Natural Language Editing
CoditT5: Pretraining for Source Code and Natural Language Editing
Jiyang Zhang
Sheena Panthaplackel
Pengyu Nie
Junyi Jessy Li
Miloš Gligorić
KELM
17
88
0
10 Aug 2022
Multi-View Pre-Trained Model for Code Vulnerability Identification
Multi-View Pre-Trained Model for Code Vulnerability Identification
Xuxia Jiang
Yinhao Xiao
Jun Wang
Wei Zhang
30
1
0
10 Aug 2022
Learning to Learn to Predict Performance Regressions in Production at
  Meta
Learning to Learn to Predict Performance Regressions in Production at Meta
M. Beller
Hongyu Li
V. Nair
V. Murali
Imad Ahmad
Jürgen Cito
Drew Carlson
Gareth Ari Aye
Wes Dyer
26
5
0
08 Aug 2022
Code Comment Inconsistency Detection with BERT and Longformer
Code Comment Inconsistency Detection with BERT and Longformer
Theo Steiner
Rui Zhang
23
4
0
29 Jul 2022
No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code
  Intelligence
No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code Intelligence
Chaozheng Wang
Yuanhang Yang
Cuiyun Gao
Yun Peng
Hongyu Zhang
Michael R. Lyu
AAML
52
134
0
24 Jul 2022
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
Fenia Christopoulou
Gerasimos Lampouras
Milan Gritta
Guchun Zhang
Yinpeng Guo
...
Guangtai Liang
Jia Wei
Xin Jiang
Qianxiang Wang
Qun Liu
ELM
SyDa
ALM
27
74
0
22 Jul 2022
What does Transformer learn about source code?
What does Transformer learn about source code?
Kechi Zhang
Ge Li
Zhi Jin
ViT
14
8
0
18 Jul 2022
Few-shot training LLMs for project-specific code-summarization
Few-shot training LLMs for project-specific code-summarization
Toufique Ahmed
Prem Devanbu
179
213
0
09 Jul 2022
Repository-Level Prompt Generation for Large Language Models of Code
Repository-Level Prompt Generation for Large Language Models of Code
Disha Shrivastava
Hugo Larochelle
Daniel Tarlow
15
137
0
26 Jun 2022
AST-Probe: Recovering abstract syntax trees from hidden representations
  of pre-trained language models
AST-Probe: Recovering abstract syntax trees from hidden representations of pre-trained language models
José Antonio Hernández López
M. Weyssow
Jesús Sánchez Cuadrado
H. Sahraoui
22
22
0
23 Jun 2022
NatGen: Generative pre-training by "Naturalizing" source code
NatGen: Generative pre-training by "Naturalizing" source code
Saikat Chakraborty
Toufique Ahmed
Yangruibo Ding
Prem Devanbu
Baishakhi Ray
AI4CE
48
116
0
15 Jun 2022
CERT: Continual Pre-Training on Sketches for Library-Oriented Code
  Generation
CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation
Daoguang Zan
Bei Chen
Dejian Yang
Zeqi Lin
Minsu Kim
Bei Guan
Yongji Wang
Weizhu Chen
Jian-Guang Lou
12
120
0
14 Jun 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
41
525
0
13 Jun 2022
CodeS: Towards Code Model Generalization Under Distribution Shift
CodeS: Towards Code Model Generalization Under Distribution Shift
Qiang Hu
Yuejun Guo
Xiaofei Xie
Maxime Cordy
Lei Ma
Mike Papadakis
Yves Le Traon
OOD
28
10
0
11 Jun 2022
StructCoder: Structure-Aware Transformer for Code Generation
StructCoder: Structure-Aware Transformer for Code Generation
Sindhu Tipirneni
Ming Zhu
Chandan K. Reddy
28
55
0
10 Jun 2022
Fault-Aware Neural Code Rankers
Fault-Aware Neural Code Rankers
J. Inala
Chenglong Wang
Mei Yang
Andrés Codas
Mark Encarnación
Shuvendu K. Lahiri
Madan Musuvathi
Jianfeng Gao
ALM
16
41
0
04 Jun 2022
Code Generation Tools (Almost) for Free? A Study of Few-Shot,
  Pre-Trained Language Models on Code
Code Generation Tools (Almost) for Free? A Study of Few-Shot, Pre-Trained Language Models on Code
Patrick Bareiss
Beatriz Souza
Marcelo d’Amorim
Michael Pradel
ELM
10
76
0
02 Jun 2022
Learning code summarization from a small and local dataset
Learning code summarization from a small and local dataset
Toufique Ahmed
Prem Devanbu
35
9
0
02 Jun 2022
CodeAttack: Code-Based Adversarial Attacks for Pre-trained Programming
  Language Models
CodeAttack: Code-Based Adversarial Attacks for Pre-trained Programming Language Models
Akshita Jha
Chandan K. Reddy
SILM
ELM
AAML
25
58
0
31 May 2022
HierarchyNet: Learning to Summarize Source Code with Heterogeneous
  Representations
HierarchyNet: Learning to Summarize Source Code with Heterogeneous Representations
Minh Huynh Nguyen
Nghi D. Q. Bui
Truong Son-Hy
Long Tran-Thanh
Tien N. Nguyen
32
4
0
31 May 2022
Understanding Long Programming Languages with Structure-Aware Sparse
  Attention
Understanding Long Programming Languages with Structure-Aware Sparse Attention
Tingting Liu
Chengyu Wang
Cen Chen
Ming Gao
Aoying Zhou
17
3
0
27 May 2022
VulBERTa: Simplified Source Code Pre-Training for Vulnerability
  Detection
VulBERTa: Simplified Source Code Pre-Training for Vulnerability Detection
Hazim Hanif
S. Maffeis
58
95
0
25 May 2022
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models
  of Source Code
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source Code
Changan Niu
Chuanyi Li
Bin Luo
Vincent Ng
SyDa
VLM
47
48
0
24 May 2022
Summarize and Generate to Back-translate: Unsupervised Translation of
  Programming Languages
Summarize and Generate to Back-translate: Unsupervised Translation of Programming Languages
Wasi Uddin Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
36
27
0
23 May 2022
AdaptivePaste: Code Adaptation through Learning Semantics-aware Variable
  Usage Representations
AdaptivePaste: Code Adaptation through Learning Semantics-aware Variable Usage Representations
Xiaoyu Liu
Jinu Jang
Neel Sundaresan
Miltiadis Allamanis
Alexey Svyatkovskiy
8
2
0
23 May 2022
NS3: Neuro-Symbolic Semantic Code Search
NS3: Neuro-Symbolic Semantic Code Search
Shushan Arakelyan
Anna Hakhverdyan
Miltiadis Allamanis
Luis Garcia
Christophe Hauser
Xiang Ren
81
9
0
21 May 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with
  Contrastive Pre-Training
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang
Yasheng Wang
Yao Wan
Jiawei Wang
Pingyi Zhou
Li Li
Hao Wu
Jin Liu
19
33
0
04 May 2022
A Survey of Deep Learning Models for Structural Code Understanding
A Survey of Deep Learning Models for Structural Code Understanding
Ruoting Wu
Yu-xin Zhang
Qibiao Peng
Liang Chen
Zibin Zheng
14
6
0
03 May 2022
On The Cross-Modal Transfer from Natural Language to Code through
  Adapter Modules
On The Cross-Modal Transfer from Natural Language to Code through Adapter Modules
Divyam Goel
Raman Grover
Fatemeh H. Fard
20
18
0
19 Apr 2022
Addressing Leakage in Self-Supervised Contextualized Code Retrieval
Addressing Leakage in Self-Supervised Contextualized Code Retrieval
Johannes Villmow
Viola Campos
A. Ulges
Ulrich Schwanecke
14
3
0
17 Apr 2022
Fix Bugs with Transformer through a Neural-Symbolic Edit Grammar
Fix Bugs with Transformer through a Neural-Symbolic Edit Grammar
Yaojie Hu
Xingjian Shi
Qiang Zhou
Lee Pike
KELM
6
13
0
13 Apr 2022
Characterizing and Understanding the Behavior of Quantized Models for
  Reliable Deployment
Characterizing and Understanding the Behavior of Quantized Models for Reliable Deployment
Qiang Hu
Yuejun Guo
Maxime Cordy
Xiaofei Xie
Wei Ma
Mike Papadakis
Yves Le Traon
MQ
28
1
0
08 Apr 2022
CoCoSoDa: Effective Contrastive Learning for Code Search
CoCoSoDa: Effective Contrastive Learning for Code Search
Ensheng Shi
Yanlin Wang
Wenchao Gu
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Hongbin Sun
28
32
0
07 Apr 2022
Transformer-Based Language Models for Software Vulnerability Detection
Transformer-Based Language Models for Software Vulnerability Detection
Chandra Thapa
Seung Ick Jang
Muhammad Ejaz Ahmed
S. Çamtepe
J. Pieprzyk
Surya Nepal
26
96
0
07 Apr 2022
An Exploratory Study on Code Attention in BERT
An Exploratory Study on Code Attention in BERT
Rishab Sharma
Fuxiang Chen
Fatemeh H. Fard
David Lo
19
25
0
05 Apr 2022
On the Transferability of Pre-trained Language Models for Low-Resource
  Programming Languages
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages
Fuxiang Chen
F. Fard
David Lo
T. Bryksin
18
43
0
05 Apr 2022
Accelerating Code Search with Deep Hashing and Code Classification
Accelerating Code Search with Deep Hashing and Code Classification
Wenchao Gu
Yanlin Wang
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Michael R. Lyu
22
16
0
29 Mar 2022
Graph-Text Multi-Modal Pre-training for Medical Representation Learning
Graph-Text Multi-Modal Pre-training for Medical Representation Learning
Sungjin Park
Seongsu Bae
Jiho Kim
Tackeun Kim
E. Choi
17
16
0
18 Mar 2022
Automating Code Review Activities by Large-Scale Pre-training
Automating Code Review Activities by Large-Scale Pre-training
Zhiyu Li
Shuai Lu
Daya Guo
Nan Duan
Shailesh Jannu
...
Deep Majumder
Jared Green
Alexey Svyatkovskiy
Shengyu Fu
Neel Sundaresan
VLM
18
138
0
17 Mar 2022
Previous
123456789
Next