Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.08155
Cited By
v1
v2
v3
v4 (latest)
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Findings (Findings), 2020
19 February 2020
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
Ming Gong
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"CodeBERT: A Pre-Trained Model for Programming and Natural Languages"
50 / 1,094 papers shown
Beyond Final Code: A Process-Oriented Error Analysis of Software Development Agents in Real-World GitHub Scenarios
Zhi Chen
Wei Ma
Lingxiao Jiang
LLMAG
ELM
LRM
348
5
0
10 Apr 2026
Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures
Amirkia Rafiei Oskooei
S. Selcan Yukcu
Mehmet Cevheri Bozoglan
Mehmet S. Aktas
113
1
0
05 Dec 2025
Learning to Code with Context: A Study-Based Approach
Uwe M. Borghoff
Mark Minas
Jannis Schopp
43
0
0
04 Dec 2025
MANTRA: a Framework for Multi-stage Adaptive Noise TReAtment During Training
ZiXiao Zhao
Fatemeh H. Fard
Jie JW Wu
NoLa
204
0
0
03 Dec 2025
EmoRAG: Evaluating RAG Robustness to Symbolic Perturbations
Xinyun Zhou
Xinfeng Li
Yinan Peng
Ming Xu
X. Zhang
...
X. Jia
Kun Wang
Qingsong Wen
Xiaofeng Wang
Wei Dong
AAML
184
2
0
01 Dec 2025
Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation
Le Chen
Nuo Xu
Winson X. Chen
Bin Lei
Pei-Hung Lin
Dunzhi Zhou
R. Thakur
Caiwen Ding
Ali Jannesari
Chunhua Liao
103
2
0
29 Nov 2025
Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities
Aayush Garg
Zanis Ali Khan
Renzo Degiovanni
Qiang Tang
AAML
177
0
0
28 Nov 2025
CodeFlowLM: Incremental Just-In-Time Defect Prediction with Pretrained Language Models and Exploratory Insights into Defect Localization
Monique Louise Monteiro
George G. Cabral
Adriano L. I. OLiveira
97
0
0
28 Nov 2025
Retrieval-Augmented Few-Shot Prompting Versus Fine-Tuning for Code Vulnerability Detection
Fouad Trad
Ali Chehab
AAML
100
0
0
28 Nov 2025
BRIDGE: Building Representations In Domain Guided Program Synthesis
Robert Joseph George
Carson Eisenach
Udaya Ghai
Dominique C. Perrault-Joncas
A. Anandkumar
Dean Phillips Foster
ALM
LRM
488
0
0
26 Nov 2025
Optimizing LLM Code Suggestions: Feedback-Driven Timing with Lightweight State Bounds
Mohammad Nour Al Awad
Sergey Ivanov
Olga Tikhonova
99
0
0
24 Nov 2025
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
J. Yang
Wei Emma Zhang
Shark Liu
J. Wu
Shawn Guo
...
Zizheng Zhan
Jiajun Zhang
Jie Zhang
Zhaoxiang Zhang
Bo Zheng
LLMAG
ALM
ELM
877
0
0
23 Nov 2025
Effective Code Membership Inference for Code Completion Models via Adversarial Prompts
Yuan Jiang
Zehao Li
Shan Huang
Christoph Treude
Xiaohong Su
Tiantian Wang
AAML
340
1
0
19 Nov 2025
RulePilot: An LLM-Powered Agent for Security Rule Generation
Hongtai Wang
Ming Xu
Yanpei Guo
Weili Han
Hoon Wei Lim
Jin Song Dong
127
1
0
15 Nov 2025
A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher?
Md. Abdul Awal
Mrigank Rochan
Chanchal K. Roy
235
1
0
07 Nov 2025
Software Defined Vehicle Code Generation: A Few-Shot Prompting Approach
Quang-Dung Nguyen
Tri-Dung Tran
Thanh-Hieu Chu
Hoang-Loc Tran
Xiangwei Cheng
Dirk Slama
170
1
0
06 Nov 2025
Understanding Robustness of Model Editing in Code LLMs: An Empirical Study
Vinaik Chhetri
A.B. Siddique
Umar Farooq
KELM
205
0
0
05 Nov 2025
OMPILOT: Harnessing Transformer Models for Auto Parallelization to Shared Memory Computing Paradigms
Arijit Bhattacharjee
Ali TehraniJamsaz
Le Chen
N. Hasabnis
Mihai Capota
Nesreen K. Ahmed
Ali Jannesari
207
0
0
05 Nov 2025
Rescuing the Unpoisoned: Efficient Defense against Knowledge Corruption Attacks on RAG Systems
Minseok Kim
Hankook Lee
Hyungjoon Koo
AAML
SILM
254
2
0
03 Nov 2025
Context-Guided Decompilation: A Step Towards Re-executability
Xiaohan Wang
Yuxin Hu
Kevin Leach
152
1
0
03 Nov 2025
DRAMA: Unifying Data Retrieval and Analysis for Open-Domain Analytic Queries
Chuxuan Hu
Maxwell Yang
James Weiland
Yeji Lim
Suhas Palawala
Daniel Kang
165
0
0
31 Oct 2025
A Survey on Deep Text Hashing: Efficient Semantic Text Retrieval with Binary Representation
Liyang He
Zhenya Huang
Cheng Yang
Rui Li
Zheng Zhang
Kai Zhang
Zhi Li
Qi Liu
Enhong Chen
3DV
452
0
0
31 Oct 2025
On the Difficulty of Selecting Few-Shot Examples for Effective LLM-based Vulnerability Detection
Md Abdul Hannan
Ronghao Ni
Chi Zhang
Limin Jia
Ravi Mangal
Corina S. Pasareanu
187
0
0
31 Oct 2025
Autograder+: A Multi-Faceted AI Framework for Rich Pedagogical Feedback in Programming Education
Vikrant Sahu
Gagan Raj Gupta
Raghav Borikar
Nitin Mane
AI4Ed
453
0
0
30 Oct 2025
SecureReviewer: Enhancing Large Language Models for Secure Code Review through Secure-aware Fine-tuning
Fang Liu
Simiao Liu
Yinghao Zhu
Xiaoli Lian
Li Zhang
AAML
159
3
0
30 Oct 2025
MAGNET: A Multi-Graph Attentional Network for Code Clone Detection
Zixian Zhang
Takfarinas Saber
3DPC
221
0
0
28 Oct 2025
Wisdom and Delusion of LLM Ensembles for Code Generation and Repair
Fernando Vallecillos Ruiz
Max Hort
Leon Moonen
192
3
0
24 Oct 2025
Practical Code RAG at Scale: Task-Aware Retrieval Design Choices under Compute Budgets
Timur Galimzyanov
Olga Kolomyttseva
Egor Bogomolov
306
1
0
23 Oct 2025
SBAN: A Framework & Multi-Dimensional Dataset for Large Language Model Pre-Training and Software Code Mining
Hamed Jelodar
Mohammad Meymani
Samita Bai
Roozbeh Razavi-Far
Ali Ghorbani
273
3
0
21 Oct 2025
DuoLens: A Framework for Robust Detection of Machine-Generated Multilingual Text and Code
Shriyansh Agrawal
Aidan Lau
Sanyam Shah
Ahan M R
Kevin Zhu
Sunishchal Dev
Vasu Sharma
DeLMO
275
0
0
21 Oct 2025
Reasoning Distillation and Structural Alignment for Improved Code Generation
Amir Jalilifard
Anderson de Rezende Rocha
Marcos Medeiros Raimundo
OffRL
LRM
162
0
0
20 Oct 2025
HGAdapter: Hypergraph-based Adapters in Language Models for Code Summarization and Clone Detection
Guang Yang
Yujie Zhu
177
0
0
20 Oct 2025
SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion
George Ma
Anurag Koul
Qi Chen
Y. Wu
Sachit Kuhar
Yu Yu
Aritra Sengupta
Varun Kumar
M. K. Ramanathan
166
0
0
20 Oct 2025
All You Need is One: Capsule Prompt Tuning with a Single Vector
Yiyang Liu
James Chenhao Liang
Heng Fan
Wenhao Yang
Yiming Cui
Xiaotian Han
Lifu Huang
Dongfang Liu
Qifan Wang
Cheng Han
VLM
198
5
0
19 Oct 2025
MLCPD: A Unified Multi-Language Code Parsing Dataset with Universal AST Schema
Jugal Gajjar
Kamalasankari Subramaniakuppusamy
134
0
0
18 Oct 2025
Selecting and Combining Large Language Models for Scalable Code Clone Detection
Muslim Chochlov
Gul Aftab Ahmed
James Vincent Patten
Yuanhua Han
Guoxian Lu
David Gregg
J. Buckley
186
0
0
17 Oct 2025
Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models
Guinan Su
Yanwu Yang
Li Shen
Lu Yin
Shiwei Liu
Jonas Geiping
MoE
KELM
244
2
0
16 Oct 2025
Beyond Function-Level Search: Repository-Aware Dual-Encoder Code Retrieval with Adversarial Verification
Aofan Liu
Shiyuan Song
Haoxuan Li
Cehao Yang
Yiyan Qi
174
1
0
16 Oct 2025
Signature in Code Backdoor Detection, how far are we?
Quoc Hung Le
Thanh Le-Cong
Bach Le
Bowen Xu
AAML
111
0
0
15 Oct 2025
Enhancing Neural Code Representation with Additional Context
Huy Nguyen
Christoph Treude
Patanamon Thongtanunam
149
0
0
14 Oct 2025
ContractEval: A Benchmark for Evaluating Contract-Satisfying Assertions in Code Generation
Soohan Lim
Joonghyuk Hahn
Hyunwoo Park
Sang-Ki Ko
Yo-Sub Han
ALM
273
0
0
14 Oct 2025
A Hierarchical Quantized Tokenization Framework for Task-Adaptive Graph Representation Learning
Yang Xiang
Li Fan
Chenke Yin
Chengtao Ji
169
0
0
14 Oct 2025
Scalable and Explainable Enterprise Knowledge Discovery Using Graph-Centric Hybrid Retrieval
Nilima Rao
Jagriti Srivastava
Pradeep Kumar Sharma
Hritvik Shrivastava
128
1
0
13 Oct 2025
Large Language Models Are Effective Code Watermarkers
Rui Xu
Jiawei Chen
Z. Yin
Cong Kong
Xinpeng Zhang
WaLM
244
0
0
13 Oct 2025
Evaluating Line-level Localization Ability of Learning-based Code Vulnerability Detection Models
Marco Pintore
Giorgio Piras
Angelo Sotgiu
Maura Pintor
Battista Biggio
AAML
124
0
0
13 Oct 2025
ECO: Enhanced Code Optimization via Performance-Aware Prompting for Code-LLMs
Su-Hyeon Kim
Joonghyuk Hahn
Sooyoung Cha
Yo-Sub Han
152
0
0
12 Oct 2025
The Hidden DNA of LLM-Generated JavaScript: Structural Patterns Enable High-Accuracy Authorship Attribution
Norbert Tihanyi
Bilel Cherif
Richard A. Dubniczky
M. Ferrag
Tamás Bisztray
DeLMO
449
1
0
12 Oct 2025
LLM Based Long Code Translation using Identifier Replacement
Manojit Chakraborty
Madhusudan Ghosh
Rishabh Gupta
121
0
0
10 Oct 2025
RAG4Tickets: AI-Powered Ticket Resolution via Retrieval-Augmented Generation on JIRA and GitHub Data
Mohammad Baqar
70
0
0
09 Oct 2025
Fortifying LLM-Based Code Generation with Graph-Based Reasoning on Secure Coding Practices
Rupam Patir
Keyan Guo
Haipeng Cai
Hongxin Hu
LRM
118
0
0
08 Oct 2025
1
2
3
4
...
20
21
22
Next
Page 1 of 22
Page
of 22
Go