ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.08366
  4. Cited By
GraphCodeBERT: Pre-training Code Representations with Data Flow

GraphCodeBERT: Pre-training Code Representations with Data Flow

17 September 2020
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
Shujie Liu
Long Zhou
Nan Duan
Alexey Svyatkovskiy
Shengyu Fu
Michele Tufano
Shao Kun Deng
Colin B. Clement
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
ArXivPDFHTML

Papers citing "GraphCodeBERT: Pre-training Code Representations with Data Flow"

50 / 403 papers shown
Title
Enhancing Pre-Trained Language Models for Vulnerability Detection via
  Semantic-Preserving Data Augmentation
Enhancing Pre-Trained Language Models for Vulnerability Detection via Semantic-Preserving Data Augmentation
Weiliang Qi
Jiahao Cao
Darsh Poddar
Sophia Li
Xinda Wang
19
0
0
30 Sep 2024
zsLLMCode: An Effective Approach for Code Embedding via LLM with Zero-Shot Learning
zsLLMCode: An Effective Approach for Code Embedding via LLM with Zero-Shot Learning
Zixiang Xian
Chenhui Cui
Rubing Huang
Chunrong Fang
Zhenyu Chen
16
0
0
23 Sep 2024
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training
  Data
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data
Hossein Hajipour
Lea Schönherr
Thorsten Holz
Mario Fritz
AAML
SyDa
26
0
0
10 Sep 2024
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks
  at Scale
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale
Huy N. Phan
Phong X. Nguyen
Nghi D. Q. Bui
LLMAG
33
10
0
09 Sep 2024
GALLa: Graph Aligned Large Language Models for Improved Source Code
  Understanding
GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding
Ziyin Zhang
Hang Yu
Shijie Li
Peng Di
Jianguo Li
Rui Wang
25
2
0
06 Sep 2024
Unintentional Security Flaws in Code: Automated Defense via Root Cause
  Analysis
Unintentional Security Flaws in Code: Automated Defense via Root Cause Analysis
Nafis Tanveer Islam
Mazal Bethany
Dylan Manuel
Murtuza Jadliwala
Peyman Najafirad
25
0
0
30 Aug 2024
A Joint Learning Model with Variational Interaction for Multilingual
  Program Translation
A Joint Learning Model with Variational Interaction for Multilingual Program Translation
Yali Du
Hui Sun
Ming Li
25
2
0
25 Aug 2024
Understanding Defects in Generated Codes by Language Models
Understanding Defects in Generated Codes by Language Models
Ali Mohammadi Esfahani
N. Kahani
S. Ajila
20
1
0
23 Aug 2024
Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking
Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking
Zhi-Cun Lyu
Xin-Ye Li
Zheng Xie
Ming Li
42
7
0
11 Aug 2024
ViC: Virtual Compiler Is All You Need For Assembly Code Search
ViC: Virtual Compiler Is All You Need For Assembly Code Search
Zeyu Gao
Hao Wang
Yuanda Wang
Chao Zhang
28
1
0
10 Aug 2024
Retrieval-augmented code completion for local projects using large
  language models
Retrieval-augmented code completion for local projects using large language models
Marko Hostnik
Marko Robnik-Sikonja
RALM
27
0
0
09 Aug 2024
From Generalist to Specialist: Exploring CWE-Specific Vulnerability
  Detection
From Generalist to Specialist: Exploring CWE-Specific Vulnerability Detection
Syafiq Al Atiiq
Christian Gehrmann
Kevin Dahlén
Karim Khalil
21
1
0
05 Aug 2024
LLM Agents Improve Semantic Code Search
LLM Agents Improve Semantic Code Search
Sarthak Jain
Aditya Dora
Ka Seng Sam
Prabhat Singh
AIFin
26
5
0
05 Aug 2024
Vulnerability Detection in Ethereum Smart Contracts via Machine
  Learning: A Qualitative Analysis
Vulnerability Detection in Ethereum Smart Contracts via Machine Learning: A Qualitative Analysis
Dalila Ressi
Alvise Spanò
Lorenzo Benetollo
Carla Piazza
M. Bugliesi
Sabina Rossi
29
1
0
26 Jul 2024
BLAZE: Cross-Language and Cross-Project Bug Localization via Dynamic
  Chunking and Hard Example Learning
BLAZE: Cross-Language and Cross-Project Bug Localization via Dynamic Chunking and Hard Example Learning
Partha Chakraborty
Mahmoud Alfadel
Mei Nagappan
12
2
0
24 Jul 2024
Comparison of Static Application Security Testing Tools and Large
  Language Models for Repo-level Vulnerability Detection
Comparison of Static Application Security Testing Tools and Large Language Models for Repo-level Vulnerability Detection
Xin Zhou
Duc-Manh Tran
Thanh Le-Cong
Ting Zhang
I. Irsan
Joshua Sumarlin
Bach Le
David Lo
ELM
16
10
0
23 Jul 2024
Curriculum Learning for Small Code Language Models
Curriculum Learning for Small Code Language Models
Marwa Nair
K. Yamani
Lynda Said Lhadj
Riyadh Baghdadi
24
3
0
14 Jul 2024
DeCE: Deceptive Cross-Entropy Loss Designed for Defending Backdoor
  Attacks
DeCE: Deceptive Cross-Entropy Loss Designed for Defending Backdoor Attacks
Guang Yang
Yu Zhou
Xiang Chen
Xiangyu Zhang
Terry Yue Zhuo
David Lo
Taolue Chen
AAML
47
4
0
12 Jul 2024
DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
38
1
0
11 Jul 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
69
9
0
09 Jul 2024
Looking into Black Box Code Language Models
Looking into Black Box Code Language Models
Muhammad Umair Haider
Umar Farooq
A. B. Siddique
Mark Marron
39
2
0
05 Jul 2024
ESALE: Enhancing Code-Summary Alignment Learning for Source Code
  Summarization
ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization
Chunrong Fang
Weisong Sun
Yuchen Chen
Xiao Chen
Zhao Wei
Quanjun Zhang
Yudu You
Bin Luo
Yang Liu
Zhenyu Chen
AI4TS
40
12
0
01 Jul 2024
GraphArena: Evaluating and Exploring Large Language Models on Graph Computation
GraphArena: Evaluating and Exploring Large Language Models on Graph Computation
Jianheng Tang
Qifan Zhang
Yuhan Li
Nuo Chen
Jia Li
19
2
0
29 Jun 2024
NARRepair: Non-Autoregressive Code Generation Model for Automatic
  Program Repair
NARRepair: Non-Autoregressive Code Generation Model for Automatic Program Repair
Zhenyu Yang
Zhen Yang
Zhongxing Yu
32
1
0
24 Jun 2024
SimClone: Detecting Tabular Data Clones using Value Similarity
SimClone: Detecting Tabular Data Clones using Value Similarity
Xu Yang
Gopi Krishnan Rajbahadur
Dayi Lin
Shaowei Wang
Zhen Ming
Jiang
18
1
0
24 Jun 2024
Toward Exploring the Code Understanding Capabilities of Pre-trained Code
  Generation Models
Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models
Jiayi Lin
Yutao Xie
Yue Yu
Yibiao Yang
Lei Zhang
SyDa
19
0
0
18 Jun 2024
A Critical Study of What Code-LLMs (Do Not) Learn
A Critical Study of What Code-LLMs (Do Not) Learn
Abhinav Anand
Shweta Verma
Krishna Narasimhan
Mira Mezini
40
4
0
17 Jun 2024
AgileCoder: Dynamic Collaborative Agents for Software Development based
  on Agile Methodology
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology
Minh Huynh Nguyen
Thang Phan Chau
Phong X. Nguyen
Nghi D. Q. Bui
26
11
0
16 Jun 2024
Cross-Modality Program Representation Learning for Electronic Design
  Automation with High-Level Synthesis
Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis
Zongyue Qin
Yunsheng Bai
Atefeh Sohrabizadeh
Zijian Ding
Ziniu Hu
Yizhou Sun
Jason Cong
28
1
0
13 Jun 2024
Estimating Difficulty Levels of Programming Problems with Pre-trained
  Model
Estimating Difficulty Levels of Programming Problems with Pre-trained Model
Zhiyuan Wang
Wei Zhang
Jun Wang
21
0
0
13 Jun 2024
Scaling Automatic Extraction of Pseudocode
Scaling Automatic Extraction of Pseudocode
Levent Toksoz
Gang Tan
C. L. Giles
25
0
0
07 Jun 2024
Enhancing Size Generalization in Graph Neural Networks through
  Disentangled Representation Learning
Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning
Zheng Huang
Qihui Yang
Dawei Zhou
Yujun Yan
AI4CE
28
2
0
07 Jun 2024
Generalization-Enhanced Code Vulnerability Detection via Multi-Task
  Instruction Fine-Tuning
Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning
Xiaohu Du
Ming Wen
Jiahao Zhu
Zifan Xie
Bin Ji
Huijun Liu
Xuanhua Shi
Hai Jin
34
14
0
06 Jun 2024
Enhancing Repository-Level Code Generation with Integrated Contextual
  Information
Enhancing Repository-Level Code Generation with Integrated Contextual Information
Zhiyuan Pan
Xing Hu
Xin Xia
Xiaohu Yang
26
3
0
05 Jun 2024
Focus on the Core: Efficient Attention via Pruned Token Compression for
  Document Classification
Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
Jungmin Yun
Mihyeon Kim
Youngbin Kim
69
9
0
03 Jun 2024
A Survey on Large Language Models for Code Generation
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
40
159
0
01 Jun 2024
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via
  Code Rewriting
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting
Tong Ye
Yangkai Du
Tengfei Ma
Lingfei Wu
Xuhong Zhang
Shouling Ji
Wenhai Wang
DeLMO
35
6
0
25 May 2024
Large Language Models for Cyber Security: A Systematic Literature Review
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
K. Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Janet Liu
H. Wang
29
23
0
08 May 2024
Refining Joint Text and Source Code Embeddings for Retrieval Task with
  Parameter-Efficient Fine-Tuning
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning
Karim Galliamov
Leila Khaertdinova
Karina Denisova
29
1
0
07 May 2024
Advanced Detection of Source Code Clones via an Ensemble of Unsupervised
  Similarity Measures
Advanced Detection of Source Code Clones via an Ensemble of Unsupervised Similarity Measures
Jorge Martínez Gil
15
4
0
03 May 2024
On the Limitations of Embedding Based Methods for Measuring Functional
  Correctness for Code Generation
On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation
Atharva Naik
38
2
0
26 Apr 2024
Graph Neural Networks for Vulnerability Detection: A Counterfactual
  Explanation
Graph Neural Networks for Vulnerability Detection: A Counterfactual Explanation
Zhaoyang Chu
Yao Wan
Qian Li
Yang Wu
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
AAML
36
9
0
24 Apr 2024
VulEval: Towards Repository-Level Evaluation of Software Vulnerability
  Detection
VulEval: Towards Repository-Level Evaluation of Software Vulnerability Detection
Xinjie Wen
Xinchen Wang
Yujia Chen
Ruida Hu
David Lo
Cuiyun Gao
27
6
0
24 Apr 2024
On Unified Prompt Tuning for Request Quality Assurance in Public Code
  Review
On Unified Prompt Tuning for Request Quality Assurance in Public Code Review
Xinyu Chen
Lin Li
Rui Zhang
Peng Liang
27
1
0
11 Apr 2024
Structure-aware Fine-tuning for Code Pre-trained Models
Structure-aware Fine-tuning for Code Pre-trained Models
Jiayi Wu
Renyu Zhu
Nuo Chen
Qiushi Sun
Xiang Li
Ming Gao
35
2
0
11 Apr 2024
Analyzing the Performance of Large Language Models on Code Summarization
Analyzing the Performance of Large Language Models on Code Summarization
Rajarshi Haldar
J. Hockenmaier
38
16
0
10 Apr 2024
Open-Source AI-based SE Tools: Opportunities and Challenges of
  Collaborative Software Learning
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
Zhihao Lin
Wei Ma
Tao Lin
Yaowen Zheng
Jingquan Ge
Jun Wang
Jacques Klein
Tegawende F. Bissyande
Yang Liu
Li Li
VLM
30
4
0
09 Apr 2024
Multi-modal Learning for WebAssembly Reverse Engineering
Multi-modal Learning for WebAssembly Reverse Engineering
Hanxian Huang
Jishen Zhao
27
2
0
04 Apr 2024
CSEPrompts: A Benchmark of Introductory Computer Science Prompts
CSEPrompts: A Benchmark of Introductory Computer Science Prompts
Md. Nishat Raihan
Dhiman Goswami
Sadiya Sayara Chowdhury Puspo
Christian D. Newman
Tharindu Ranasinghe
Marcos Zampieri
ELM
26
2
0
03 Apr 2024
An Empirical Study of Automated Vulnerability Localization with Large
  Language Models
An Empirical Study of Automated Vulnerability Localization with Large Language Models
Jian Zhang
Chong Wang
Anran Li
Weisong Sun
Cen Zhang
Wei Ma
Yang Liu
39
17
0
30 Mar 2024
Previous
123456789
Next