Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.09436
Cited By
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
20 September 2019
Hamel Husain
Hongqiu Wu
Tiferet Gazit
Miltiadis Allamanis
Marc Brockschmidt
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeSearchNet Challenge: Evaluating the State of Semantic Code Search"
50 / 471 papers shown
Title
Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding
Yifeng Di
Tianyi Zhang
26
0
0
12 May 2025
SweRank: Software Issue Localization with Code Ranking
R. Reddy
Tarun Suresh
JaeHyeok Doo
Y. Liu
Xuan-Phi Nguyen
Yingbo Zhou
Semih Yavuz
Caiming Xiong
Heng Ji
Shafiq R. Joty
24
0
0
07 May 2025
BiGSCoder: State Space Model for Code Understanding
Shweta Verma
Abhinav Anand
Mira Mezini
Mamba
46
0
0
02 May 2025
An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding
Xiuwei Shang
Zhenkan Fu
Shaoyin Cheng
Guoqiang Chen
Gangyang Li
Li Hu
W. Zhang
N. Yu
62
0
0
30 Apr 2025
Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Kang Yang
Xinjun Mao
Shangwen Wang
Y. Wang
Tanghaoran Zhang
Bo Lin
Yihao Qin
Zhang Zhang
Yao Lu
Kamal Al-Sabahi
ALM
149
1
0
28 Apr 2025
Evaluate-and-Purify: Fortifying Code Language Models Against Adversarial Attacks Using LLM-as-a-Judge
Wenhan Mu
Ling Xu
Shuren Pei
Le Mi
Huichi Zhou
AAML
ELM
53
0
0
28 Apr 2025
NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation
Rob Romijnders
Stefanos Laskaridis
Ali Shahin Shamsabadi
Hamed Haddadi
64
0
0
25 Apr 2025
Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code
Michele Carissimi
Martina Saletta
C. Ferretti
39
0
0
24 Apr 2025
Give LLMs a Security Course: Securing Retrieval-Augmented Code Generation via Knowledge Injection
Bo Lin
Shangwen Wang
Yihao Qin
Liqian Chen
Xiaoguang Mao
SILM
31
0
0
23 Apr 2025
A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs
Musfiqur Rahman
SayedHassan Khatoonabadi
Emad Shihab
ALM
36
0
0
22 Apr 2025
Manipulating Multimodal Agents via Cross-Modal Prompt Injection
Le Wang
Zonghao Ying
Tianyuan Zhang
Siyuan Liang
Shengshan Hu
Mingchuan Zhang
A. Liu
Xianglong Liu
AAML
33
1
0
19 Apr 2025
FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents
Nandan Thakur
Jimmy J. Lin
Sam Havens
Michael Carbin
Omar Khattab
Andrew Drozdov
36
2
0
17 Apr 2025
AI-Driven Code Refactoring: Using Graph Neural Networks to Enhance Software Maintainability
Gopichand Bandarupalli
16
1
0
14 Apr 2025
The Code Barrier: What LLMs Actually Understand?
Serge Lionel Nikiema
Jordan Samhi
A. Kaboré
Jacques Klein
Tegawende F. Bissyande
ELM
27
1
0
14 Apr 2025
Code-Craft: Hierarchical Graph-Based Code Summarization for Enhanced Context Retrieval
David Sounthiraraj
Jared Hancock
Yassin Kortam
Ashok Javvaji
Prabhat Singh
Shaila Shankar
21
0
0
11 Apr 2025
Bringing Structure to Naturalness: On the Naturalness of ASTs
Profir-Petru Pârţachi
Mahito Sugiyama
27
0
0
11 Apr 2025
Towards an Understanding of Context Utilization in Code Intelligence
Yanlin Wang
Kefeng Duan
Dewu Zheng
Ensheng Shi
F. Zhang
...
Xilin Liu
Yuchi Ma
Hongyu Zhang
Qianxiang Wang
Zibin Zheng
29
0
0
11 Apr 2025
Zero-Shot Cross-Domain Code Search without Fine-Tuning
Keyu Liang
Z. Liu
Chao Liu
Zhiyuan Wan
David Lo
Xiaohu Yang
26
0
0
10 Apr 2025
DeCoMa: Detecting and Purifying Code Dataset Watermarks through Dual Channel Code Abstraction
Yuan Xiao
Yuchen Chen
Shiqing Ma
Haocheng Huang
Chunrong Fang
Y. Chen
Weisong Sun
Yunfeng Zhu
X. Zhang
Zhenyu Chen
31
0
0
09 Apr 2025
RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation
Nathanael Beau
Benoît Crabbé
23
0
0
08 Apr 2025
On Benchmarking Code LLMs for Android Malware Analysis
Yiling He
Hongyu She
Xingzhi Qian
Xinran Zheng
Zhuo Chen
Z. Qin
Lorenzo Cavallaro
ELM
50
1
0
01 Apr 2025
Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models
Yilong Xu
Jinhua Gao
Xiaoming Yu
Yuanhai Xue
Baolong Bi
Huawei Shen
Xueqi Cheng
RALM
64
0
0
01 Apr 2025
Improving the Context Length and Efficiency of Code Retrieval for Tracing Security Vulnerability Fixes
Xueqing Liu
Jiangrui Zheng
Guanqun Yang
Siyan Wen
Qiushi Liu
48
0
0
29 Mar 2025
RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation
Linxi Liang
Jing Gong
Mingwei Liu
Chong Wang
Guangsheng Ou
Yanlin Wang
Xin Peng
Zibin Zheng
ALM
64
0
0
21 Mar 2025
Large Language Models (LLMs) for Source Code Analysis: applications, models and datasets
Hamed Jelodar
Mohammad Meymani
Roozbeh Razavi-Far
42
0
0
21 Mar 2025
XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants
Adam Storek
Mukur Gupta
Noopur Bhatt
Aditya Gupta
Janie Kim
Prashast Srivastava
Suman Jana
AAML
69
0
0
18 Mar 2025
CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings
Daniil Orel
Dilshod Azizov
Preslav Nakov
DeLMO
50
0
0
17 Mar 2025
OASIS: Order-Augmented Strategy for Improved Code Search
Zuchen Gao
Zizheng Zhan
Xianming Li
Erxin Yu
Haotian Zhang
Bin Chen
Yuqun Zhang
Jing Li
66
0
0
11 Mar 2025
ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Kaiyuan Liu
Youcheng Pan
J. Li
Daojing He
Yang Xiang
Yexing Du
Tianrun Gao
LLMAG
ELM
59
1
0
10 Mar 2025
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Roham Koohestani
Philippe de Bekker
M. Izadi
VLM
45
0
0
07 Mar 2025
EuroBERT: Scaling Multilingual Encoders for European Languages
Nicolas Boizard
Hippolyte Gisserot-Boukhlef
Duarte M. Alves
André F. T. Martins
Ayoub Hammal
...
Maxime Peyrard
Nuno M. Guerreiro
Patrick Fernandes
Ricardo Rei
Pierre Colombo
122
1
0
07 Mar 2025
LoRACode: LoRA Adapters for Code Embeddings
Saumya Chaturvedi
Aman Chadha
Laurent Bindschaedler
63
0
0
07 Mar 2025
The Challenge of Identifying the Origin of Black-Box Large Language Models
Ziqing Yang
Yixin Wu
Yun Shen
Wei Dai
Michael Backes
Yang Zhang
AAML
42
0
0
06 Mar 2025
One Model to Train them All: Hierarchical Self-Distillation for Enhanced Early Layer Embeddings
Andrea Gurioli
Federico Pennino
João Monteiro
Maurizio Gabbrielli
46
0
0
04 Mar 2025
Multimodal Learning for Just-In-Time Software Defect Prediction in Autonomous Driving Systems
Faisal Mohammad
Duksan Ryu
64
0
0
28 Feb 2025
GNN-Coder: Boosting Semantic Code Retrieval with Combined GNNs and Transformer
Yufan Ye
Pu Pang
Ting Zhang
Hua Huang
71
0
0
24 Feb 2025
Code Summarization Beyond Function Level
Vladimir Makharev
Vladimir Ivanov
45
0
0
23 Feb 2025
Can LLMs Reason About Program Semantics? A Comprehensive Evaluation of LLMs on Formal Specification Inference
Thanh Le-Cong
Bach Le
Toby Murray
LRM
47
1
0
22 Feb 2025
Eliminating Backdoors in Neural Code Models for Secure Code Understanding
Weisong Sun
Yuchen Chen
Chunrong Fang
Yebo Feng
Yuan Xiao
An Guo
Quanjun Zhang
Yang Liu
Baowen Xu
Zhenyu Chen
AAML
111
1
0
21 Feb 2025
Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness
Weisong Sun
Yuchen Chen
Mengzhe Yuan
Chunrong Fang
Zhenpeng Chen
Chong Wang
Yang Liu
Baowen Xu
Zhenyu Chen
AAML
36
1
0
20 Feb 2025
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model
Yi Liu
Changran Xu
Yunhao Zhou
Z. Li
Qiang Xu
VLM
48
4
0
20 Feb 2025
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption
Alireza Nik
Michael A. Riegler
P. Halvorsen
46
0
0
17 Feb 2025
SURGE: On the Potential of Large Language Models as General-Purpose Surrogate Code Executors
Bohan Lyu
Siqiao Huang
Zichen Liang
Qi-An Sun
Jiaming Zhang
ELM
LRM
57
0
0
16 Feb 2025
SoK: Where to Fuzz? Assessing Target Selection Methods in Directed Fuzzing
Felix Weissberg
Jonas Moller
Tom Ganz
Erik Imgrund
Lukas Pirch
Lukas Seidel
Moritz Schloegel
Thorsten Eisenhofer
Konrad Rieck
99
3
0
12 Feb 2025
URECA: The Chain of Two Minimum Set Cover Problems exists behind Adaptation to Shifts in Semantic Code Search
Seok-Ung Choi
Joonghyuk Hahn
Yo-Sub Han
51
0
0
11 Feb 2025
CoDocBench: A Dataset for Code-Documentation Alignment in Software Maintenance
Kunal Suresh Pai
Premkumar Devanbu
Toufique Ahmed
63
1
0
01 Feb 2025
From Critique to Clarity: A Pathway to Faithful and Personalized Code Explanations with Large Language Models
Zexing Xu
Zhuang Luo
Yichuan Li
Kyumin Lee
S. Rasoul Etesami
38
0
0
28 Jan 2025
How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs
Jialun Cao
Yuk-Kit Chan
Zixuan Ling
Wenxuan Wang
Shuqing Li
...
Pinjia He
Shuai Wang
Zibin Zheng
Michael R. Lyu
S. Cheung
ALM
69
1
0
18 Jan 2025
AllSpark: A Multimodal Spatio-Temporal General Intelligence Model with Ten Modalities via Language as a Reference Framework
Run Shao
Cheng Yang
Qiujun Li
Qing Zhu
Yongjun Zhang
...
Yu Liu
Yong Tang
Dapeng Liu
Shizhong Yang
Haifeng Li
111
1
0
08 Jan 2025
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
Batu Guan
Yao Wan
Zhangqian Bi
Zheng Wang
Hongyu Zhang
Yulei Sui
Pan Zhou
37
8
0
31 Dec 2024
1
2
3
4
...
8
9
10
Next