Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.08155
Cited By
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
19 February 2020
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
Ming Gong
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeBERT: A Pre-Trained Model for Programming and Natural Languages"
50 / 222 papers shown
Title
JaCoText: A Pretrained Model for Java Code-Text Generation
Jessica Nayeli López Espejel
Mahaman Sanoussi Yahaya Alassan
Walid Dahhane
E. Ettifouri
27
3
0
22 Mar 2023
Implant Global and Local Hierarchy Information to Sequence based Code Representation Models
Kechi Zhang
Zhuo Li
Zhi Jin
Ge Li
21
6
0
14 Mar 2023
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
Aftab Hussain
Md Rafiqul Islam Rabin
Bowen Xu
David Lo
Mohammad Amin Alipour
34
3
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
24
501
0
07 Mar 2023
ADELT: Transpilation Between Deep Learning Frameworks
Linyuan Gong
Jiayi Wang
Alvin Cheung
30
3
0
07 Mar 2023
On ML-Based Program Translation: Perils and Promises
Aniketh Malyala
K. Zhou
Baishakhi Ray
Saikat Chakraborty
24
5
0
21 Feb 2023
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
Shuyan Zhou
Uri Alon
Sumit Agarwal
Graham Neubig
ELM
ALM
22
98
0
10 Feb 2023
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models
Changan Niu
Chuanyi Li
Vincent Ng
Bin Luo
ELM
ALM
32
9
0
08 Feb 2023
CodeLMSec Benchmark: Systematically Evaluating and Finding Security Vulnerabilities in Black-Box Code Language Models
Hossein Hajipour
Keno Hassler
Thorsten Holz
Lea Schonherr
Mario Fritz
ELM
34
19
0
08 Feb 2023
Exploring Data Augmentation for Code Generation Tasks
Pinzhen Chen
Gerasimos Lampouras
29
9
0
05 Feb 2023
VuLASTE: Long Sequence Model with Abstract Syntax Tree Embedding for vulnerability Detection
Botong Zhu
Huobin Tan
15
0
0
05 Feb 2023
Which Features are Learned by CodeBert: An Empirical Study of the BERT-based Source Code Representation Learning
Lan Zhang
Chen Cao
Zhilong Wang
Peng Liu
SSL
6
3
0
20 Jan 2023
Learning Compiler Pass Orders using Coreset and Normalized Value Prediction
Youwei Liang
Kevin R. Stone
A. Shameli
Chris Cummins
Mostafa Elhoushi
...
Benoit Steiner
Xiaomeng Yang
P. Xie
Hugh Leather
Yuandong Tian
9
9
0
09 Jan 2023
TrojanPuzzle: Covertly Poisoning Code-Suggestion Models
H. Aghakhani
Wei Dai
Andre Manoel
Xavier Fernandes
Anant Kharkar
Christopher Kruegel
Giovanni Vigna
David E. Evans
B. Zorn
Robert Sim
SILM
19
33
0
06 Jan 2023
Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
Yu Gu
Xiang Deng
Yu-Chuan Su
LLMAG
26
52
0
19 Dec 2022
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion
Zi Gong
Yinpeng Guo
Pingyi Zhou
Cuiyun Gao
Yasheng Wang
Zenglin Xu
12
8
0
19 Dec 2022
JEMMA: An Extensible Java Dataset for ML4Code Applications
Anjan Karmakar
Miltiadis Allamanis
Romain Robbes
VLM
21
3
0
18 Dec 2022
Plansformer: Generating Symbolic Plans using Transformers
Vishal Pallagani
Bharath Muppasani
K. Murugesan
F. Rossi
L. Horesh
Biplav Srivastava
F. Fabiano
Andrea Loreggia
LM&Ro
LLMAG
OffRL
15
35
0
16 Dec 2022
An Empirical Study of Deep Learning Models for Vulnerability Detection
Benjamin Steenhoek
Md. Mahbubur Rahman
Richard Jiles
Wei Le
ELM
AAML
18
77
0
15 Dec 2022
Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection
Benjamin Steenhoek
Hongyang Gao
Wei Le
27
27
0
15 Dec 2022
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
Yekun Chai
Shuohuan Wang
Chao Pang
Yu Sun
Hao Tian
Hua-Hong Wu
24
35
0
13 Dec 2022
DexBERT: Effective, Task-Agnostic and Fine-grained Representation Learning of Android Bytecode
Tiezhu Sun
Kevin Allix
Kisub Kim
Xin Zhou
Dongsun Kim
David Lo
Tegawende F. Bissyande
Jacques Klein
8
11
0
12 Dec 2022
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
23
1
0
12 Dec 2022
Evaluating How Fine-tuning on Bimodal Data Effects Code Generation
Gabriel Orlanski
Seonhye Yang
Michael Healy
ALM
21
5
0
15 Nov 2022
MPCFormer: fast, performant and private Transformer inference with MPC
Dacheng Li
Rulin Shao
Hongyi Wang
Han Guo
Eric P. Xing
Haotong Zhang
13
79
0
02 Nov 2022
A Simple, Yet Effective Approach to Finding Biases in Code Generation
Spyridon Mouselinos
Mateusz Malinowski
Henryk Michalewski
10
7
0
31 Oct 2022
Poison Attack and Defense on Deep Source Code Processing Models
Jia Li
Zhuo Li
Huangzhao Zhang
Ge Li
Zhi Jin
Xing Hu
Xin Xia
AAML
33
16
0
31 Oct 2022
Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming
Hussein Mozannar
Gagan Bansal
Adam Fourney
Eric Horvitz
49
109
0
25 Oct 2022
ObSynth: An Interactive Synthesis System for Generating Object Models from Natural Language Specifications
Alex Gu
Tamara Mitrovska
D. Vélez
Jacob Andreas
Armando Solar-Lezama
SyDa
25
1
0
20 Oct 2022
Code Recommendation for Open Source Software Developers
Yiqiao Jin
Yunsheng Bai
Yanqiao Zhu
Yizhou Sun
Wei Wang
20
24
0
15 Oct 2022
Leveraging Artificial Intelligence on Binary Code Comprehension
Yifan Zhang
24
3
0
11 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
16
7
0
07 Oct 2022
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure
Nuo Chen
Qiushi Sun
Renyu Zhu
Xiang Li
Xuesong Lu
Ming Gao
36
10
0
07 Oct 2022
MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation
Zeming Dong
Qiang Hu
Yuejun Guo
Maxime Cordy
Mike Papadakis
Zhenya Zhang
Yves Le Traon
Jianjun Zhao
23
8
0
06 Oct 2022
Statement-Level Vulnerability Detection: Learning Vulnerability Patterns Through Information Theory and Contrastive Learning
Van Nguyen
Trung Le
C. Tantithamthavorn
Michael Fu
John C. Grundy
Hung Nguyen
S. Çamtepe
Paul Quirk
Dinh Q. Phung
39
4
0
20 Sep 2022
Malicious Source Code Detection Using Transformer
Chen Tsfaty
Michael Fire
29
4
0
16 Sep 2022
Exploring Code Style Transfer with Neural Networks
Karl Munson
Anish Savla
Chih-Kai Ting
Serenity Wade
Kiran Kate
Kavitha Srinivas
CLIP
8
0
0
13 Sep 2022
Don't Complete It! Preventing Unhelpful Code Completion for Productive and Sustainable Neural Code Completion Systems
Zhensu Sun
Xiaoning Du
Fu Song
Shangwen Wang
Mingze Ni
Li Li
21
10
0
13 Sep 2022
VulCurator: A Vulnerability-Fixing Commit Detector
Truong-Giang Nguyen
Thanh Le-Cong
Hong Jin Kang
X. Le
David Lo
13
20
0
07 Sep 2022
AutoPruner: Transformer-Based Call Graph Pruning
Thanh Le-Cong
Hong Jin Kang
Truong-Giang Nguyen
S. A. Haryono
David Lo
X. Le
H. Thang
25
19
0
07 Sep 2022
Lost at C: A User Study on the Security Implications of Large Language Model Code Assistants
Gustavo Sandoval
Hammond Pearce
Teo Nys
Ramesh Karri
S. Garg
Brendan Dolan-Gavitt
ELM
14
90
0
20 Aug 2022
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation
Federico Cassano
John Gouwar
Daniel Nguyen
S. Nguyen
Luna Phipps-Costin
...
Carolyn Jane Anderson
Molly Q. Feldman
Arjun Guha
Michael Greenberg
Abhinav Jangda
ELM
22
81
0
17 Aug 2022
CORNET: Learning Table Formatting Rules By Example
Mukul Singh
J. Cambronero
Sumit Gulwani
Vu Le
Carina Negreanu
Mohammad Raza
Gust Verbruggen
LMTD
43
8
0
11 Aug 2022
CSSAM:Code Search via Attention Matching of Code Semantics and Structures
Y. Hu
Bowen Cai
Yaoxiang Yu
13
3
0
08 Aug 2022
Code Comment Inconsistency Detection with BERT and Longformer
Theo Steiner
Rui Zhang
23
4
0
29 Jul 2022
Neurosymbolic Repair for Low-Code Formula Languages
Rohan Bavishi
Harshit Joshi
José Pablo Cambronero Sánchez
Anna Fariha
Sumit Gulwani
Vu Le
Ivan Radicek
A. Tiwari
11
13
0
24 Jul 2022
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
Fenia Christopoulou
Gerasimos Lampouras
Milan Gritta
Guchun Zhang
Yinpeng Guo
...
Guangtai Liang
Jia Wei
Xin Jiang
Qianxiang Wang
Qun Liu
ELM
SyDa
ALM
27
74
0
22 Jul 2022
What does Transformer learn about source code?
Kechi Zhang
Ge Li
Zhi Jin
ViT
14
8
0
18 Jul 2022
Few-shot training LLMs for project-specific code-summarization
Toufique Ahmed
Prem Devanbu
179
213
0
09 Jul 2022
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
124
237
0
05 Jul 2022
Previous
1
2
3
4
5
Next