Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.08155
Cited By
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
19 February 2020
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
Ming Gong
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeBERT: A Pre-Trained Model for Programming and Natural Languages"
50 / 222 papers shown
Title
GitHub Copilot AI pair programmer: Asset or Liability?
Arghavan Moradi Dakhel
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
Michel C. Desmarais
Zhen Ming
Z. Jiang
26
331
0
30 Jun 2022
Test2Vec: An Execution Trace Embedding for Test Case Prioritization
E. Jabbar
Soheila Zangeneh
Hadi Hemmati
R. Feldt
41
4
0
28 Jun 2022
InvAASTCluster: On Applying Invariant-Based Program Clustering to Introductory Programming Assignments
Pedro Orvalho
Mikolávs Janota
Vasco M. Manquinho
19
7
0
28 Jun 2022
An Extractive-and-Abstractive Framework for Source Code Summarization
Weisong Sun
Chunrong Fang
Yuchen Chen
Quanjun Zhang
Guanhong Tao
Tingxu Han
Yifei Ge
Yudu You
Bin Luo
16
29
0
15 Jun 2022
CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation
Daoguang Zan
Bei Chen
Dejian Yang
Zeqi Lin
Minsu Kim
Bei Guan
Yongji Wang
Weizhu Chen
Jian-Guang Lou
12
120
0
14 Jun 2022
CodeS: Towards Code Model Generalization Under Distribution Shift
Qiang Hu
Yuejun Guo
Xiaofei Xie
Maxime Cordy
Lei Ma
Mike Papadakis
Yves Le Traon
OOD
28
10
0
11 Jun 2022
StructCoder: Structure-Aware Transformer for Code Generation
Sindhu Tipirneni
Ming Zhu
Chandan K. Reddy
28
55
0
10 Jun 2022
Code-DKT: A Code-based Knowledge Tracing Model for Programming Tasks
Yang Shi
Min Chi
Tiffany Barnes
T. Price
AI4Ed
31
23
0
07 Jun 2022
CodeAttack: Code-Based Adversarial Attacks for Pre-trained Programming Language Models
Akshita Jha
Chandan K. Reddy
SILM
ELM
AAML
25
58
0
31 May 2022
Leveraging Causal Inference for Explainable Automatic Program Repair
Jianzong Wang
Shijing Si
Z. Zhu
Xiaoyang Qu
Zhenhou Hong
Jing Xiao
6
3
0
26 May 2022
How to Find Actionable Static Analysis Warnings: A Case Study with FindBugs
Rahul Yedida
Hong Jin Kang
Huy Tu
Xueqi Yang
David Lo
Tim Menzies
17
12
0
21 May 2022
A Neural Network Architecture for Program Understanding Inspired by Human Behaviors
Renyu Zhu
Lei Yuan
Xiang Li
Ming Gao
Wenyuan Cai
19
8
0
10 May 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang
Yasheng Wang
Yao Wan
Jiawei Wang
Pingyi Zhou
Li Li
Hao Wu
Jin Liu
19
33
0
04 May 2022
InCoder: A Generative Model for Code Infilling and Synthesis
Daniel Fried
Armen Aghajanyan
Jessy Lin
Sida I. Wang
Eric Wallace
Freda Shi
Ruiqi Zhong
Wen-tau Yih
Luke Zettlemoyer
M. Lewis
SyDa
22
625
0
12 Apr 2022
CoCoSoDa: Effective Contrastive Learning for Code Search
Ensheng Shi
Yanlin Wang
Wenchao Gu
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Hongbin Sun
28
32
0
07 Apr 2022
An Exploratory Study on Code Attention in BERT
Rishab Sharma
Fuxiang Chen
Fatemeh H. Fard
David Lo
19
25
0
05 Apr 2022
LAMNER: Code Comment Generation Using Character Language Model and Named Entity Recognition
Rishab Sharma
Fuxiang Chen
Fatemeh H. Fard
34
2
0
05 Apr 2022
Accelerating Code Search with Deep Hashing and Code Classification
Wenchao Gu
Yanlin Wang
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Michael R. Lyu
22
16
0
29 Mar 2022
CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis
Erik Nijkamp
Bo Pang
Hiroaki Hayashi
Lifu Tu
Haiquan Wang
Yingbo Zhou
Silvio Savarese
Caiming Xiong
ELM
52
967
0
25 Mar 2022
LineVD: Statement-level Vulnerability Detection using Graph Neural Networks
David Hin
Andrey Kan
Huaming Chen
M. Babar
26
158
0
10 Mar 2022
UniXcoder: Unified Cross-Modal Pre-training for Code Representation
Daya Guo
Shuai Lu
Nan Duan
Yanlin Wang
Ming Zhou
Jian Yin
4
554
0
08 Mar 2022
From Natural Language to Simulations: Applying GPT-3 Codex to Automate Simulation Modeling of Logistics Systems
I. Jackson
M. J. Sáenz
6
8
0
24 Feb 2022
Neural Program Repair: Systems, Challenges and Solutions
Wenkang Zhong
Chuanyi Li
Jidong Ge
B. Luo
21
13
0
22 Feb 2022
Probing Pretrained Models of Source Code
Sergey Troshin
Nadezhda Chirkova
ELM
25
38
0
16 Feb 2022
Better Together? An Evaluation of AI-Supported Code Translation
Justin D. Weisz
Michael J. Muller
Steven I. Ross
Fernando Martinez
Stephanie Houde
Mayank Agarwal
Kartik Talamadupula
John T. Richards
29
67
0
15 Feb 2022
What Do They Capture? -- A Structural Analysis of Pre-Trained Language Models for Source Code
Yao Wan
Wei-Ye Zhao
Hongyu Zhang
Yulei Sui
Guandong Xu
Hairong Jin
27
105
0
14 Feb 2022
Competition-Level Code Generation with AlphaCode
Yujia Li
David Choi
Junyoung Chung
Nate Kushman
Julian Schrittwieser
...
Esme Sutherland Robson
Pushmeet Kohli
Nando de
Koray Kavukcuoglu
Oriol Vinyals
19
1,290
0
08 Feb 2022
Pre-Trained Neural Language Models for Automatic Mobile App User Feedback Answer Generation
Yue Cao
Fatemeh H. Fard
14
7
0
04 Feb 2022
Text and Code Embeddings by Contrastive Pre-Training
Arvind Neelakantan
Tao Xu
Raul Puri
Alec Radford
Jesse Michael Han
...
Tabarak Khan
Toki Sherbakov
Joanne Jang
Peter Welinder
Lilian Weng
SSL
AI4TS
213
421
0
24 Jan 2022
AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees
Rong Liang
Tiehu Zhang
Y. Lu
Yuze Liu
Zhengqing Huang
Xin Chen
14
3
0
20 Jan 2022
Cross-Language Binary-Source Code Matching with Intermediate Representations
Yi Gui
Yao Wan
Hongyu Zhang
Huifang Huang
Yulei Sui
Guandong Xu
Zhiyuan Shao
Hai Jin
20
31
0
19 Jan 2022
Assemble Foundation Models for Automatic Code Summarization
Jian Gu
P. Salza
H. Gall
25
34
0
13 Jan 2022
VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning
Qibin Chen
Jeremy Lacomis
Edward J. Schwartz
Graham Neubig
Bogdan Vasilescu
Claire Le Goues
VLM
13
33
0
05 Dec 2021
Bridging Pre-trained Models and Downstream Tasks for Source Code Understanding
Deze Wang
Zhouyang Jia
Shanshan Li
Yue Yu
Yun Xiong
Wei Dong
Xiangke Liao
17
80
0
04 Dec 2021
Multilingual training for Software Engineering
Toufique Ahmed
Prem Devanbu
57
73
0
03 Dec 2021
NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging
Zihan Liu
Feijun Jiang
Yuxiang Hu
Chen Shi
Pascale Fung
16
37
0
01 Dec 2021
Federated Data Science to Break Down Silos [Vision]
Essam Mansour
Kavitha Srinivas
K. Hose
FedML
AI4CE
17
8
0
25 Nov 2021
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Pengcheng He
Jianfeng Gao
Weizhu Chen
18
1,113
0
18 Nov 2021
FACOS: Finding API Relevant Contents on Stack Overflow with Semantic and Syntactic Analysis
K. Luong
M. Hadi
Ferdian Thung
Fatemeh H. Fard
David Lo
14
4
0
14 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
55
1,029
0
01 Nov 2021
Neural Program Generation Modulo Static Analysis
Rohan Mukherjee
Yeming Wen
Dipak Chaudhari
Thomas W. Reps
Swarat Chaudhuri
C. Jermaine
20
23
0
26 Oct 2021
Towards Learning (Dis)-Similarity of Source Code from Program Contrasts
Yangruibo Ding
Luca Buratti
Saurabh Pujar
Alessandro Morari
Baishakhi Ray
Saikat Chakraborty
8
36
0
08 Oct 2021
Learning Bill Similarity with Annotated and Augmented Corpora of Bills
Jiseon Kim
Elden Griggs
In Song Kim
Alice H. Oh
AILaw
18
5
0
14 Sep 2021
Software Vulnerability Detection via Deep Learning over Disaggregated Code Graph Representation
Yufan Zhuang
Sahil Suneja
Veronika Thost
Giacomo Domeniconi
Alessandro Morari
Jim Laredo
GNN
22
15
0
07 Sep 2021
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
210
1,489
0
02 Sep 2021
AVATAR: A Parallel Corpus for Java-Python Program Translation
W. Ahmad
Md Golam Rahman Tushar
Saikat Chakraborty
Kai-Wei Chang
30
78
0
26 Aug 2021
What do pre-trained code models know about code?
Anjan Karmakar
Romain Robbes
ELM
18
86
0
25 Aug 2021
On Multi-Modal Learning of Editing Source Code
Saikat Chakraborty
Baishakhi Ray
KELM
16
58
0
15 Aug 2021
Predicting Patch Correctness Based on the Similarity of Failing Test Cases
Haoye Tian
Yinghua Li
Weiguo Pian
Abdoul Kader Kaboré
Kui Liu
Andrew Habib
Jacques Klein
Tegawende F. Bissyande
27
29
0
28 Jul 2021
ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback
Mike Wu
Noah D. Goodman
Chris Piech
Chelsea Finn
16
19
0
23 Jul 2021
Previous
1
2
3
4
5
Next