Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.09436
Cited By
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
20 September 2019
Hamel Husain
Hongqiu Wu
Tiferet Gazit
Miltiadis Allamanis
Marc Brockschmidt
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CodeSearchNet Challenge: Evaluating the State of Semantic Code Search"
50 / 472 papers shown
Title
Senatus -- A Fast and Accurate Code-to-Code Recommendation Engine
Fran Silavong
Sean J. Moran
Antonios Georgiadis
Rohan Saphal
R. Otter
20
9
0
05 Nov 2021
GraphSearchNet: Enhancing GNNs via Capturing Global Dependencies for Semantic Code Search
Shangqing Liu
Xiaofei Xie
J. Siow
L. Ma
Guozhu Meng
Yang Liu
GNN
23
53
0
04 Nov 2021
Text Classification for Task-based Source Code Related Questions
Sairamvinay Vijayaraghavan
Jinxiao Song
David A. Tomassi
Siddhartha Punj
Jailan Sabet
19
0
0
31 Oct 2021
CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning
Zhensu Sun
Xiaoning Du
Fu Song
Mingze Ni
Li Li
28
67
0
25 Oct 2021
AugmentedCode: Examining the Effects of Natural Language Resources in Code Retrieval Models
M. Bahrami
N. Shrikanth
Yuji Mizobuchi
Lei Liu
M. Fukuyori
Wei-Peng Chen
Kazuki Munakata
26
3
0
16 Oct 2021
Cascaded Fast and Slow Models for Efficient Semantic Code Search
Akhilesh Deepak Gotmare
Junnan Li
Shafiq R. Joty
S. Hoi
33
10
0
15 Oct 2021
Using Document Similarity Methods to create Parallel Datasets for Code Translation
Mayank Agarwal
Kartik Talamadupula
Fernando Martinez
Stephanie Houde
Michael J. Muller
John T. Richards
Steven I. Ross
Justin D. Weisz
SyDa
31
8
0
11 Oct 2021
Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy
Colin B. Clement
Shuai Lu
Xiaoyu Liu
Michele Tufano
Dawn Drain
Nan Duan
Neel Sundaresan
Alexey Svyatkovskiy
16
27
0
17 Sep 2021
CodeQA: A Question Answering Dataset for Source Code Comprehension
Chenxiao Liu
Xiaojun Wan
37
27
0
17 Sep 2021
Context-NER : Contextual Phrase Generation at Scale
Himanshu Gupta
Shreyas Verma
Santosh Mashetty
Swaroop Mishra
19
10
0
16 Sep 2021
Can Machines Read Coding Manuals Yet? -- A Benchmark for Building Better Language Models for Code Understanding
Ibrahim Abdelaziz
Julian T Dolby
Jamie McCusker
Kavitha Srinivas
ELM
36
5
0
15 Sep 2021
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
235
1,489
0
02 Sep 2021
Lyra: A Benchmark for Turducken-Style Code Generation
Qingyuan Liang
Zeyu Sun
Qihao Zhu
Wenjie Zhang
Lian Yu
Yingfei Xiong
Lu Zhang
11
13
0
27 Aug 2021
Retrieval Augmented Code Generation and Summarization
Md. Rizwan Parvez
W. Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
26
183
0
26 Aug 2021
What do pre-trained code models know about code?
Anjan Karmakar
Romain Robbes
ELM
24
87
0
25 Aug 2021
Program Synthesis with Large Language Models
Jacob Austin
Augustus Odena
Maxwell Nye
Maarten Bosma
Henryk Michalewski
...
Ellen Jiang
Carrie J. Cai
Michael Terry
Quoc V. Le
Charles Sutton
ELM
AIMat
ReCod
ALM
30
1,743
0
16 Aug 2021
Natural Language-Guided Programming
Geert Heyman
Rafael Huysegems
P. Justen
Tom Van Cutsem
21
12
0
11 Aug 2021
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
31
111
0
10 Aug 2021
Distilling Transformers for Neural Cross-Domain Search
Colin B. Clement
Chen Henry Wu
Dawn Drain
Neel Sundaresan
23
1
0
06 Aug 2021
Dialogue Management for Interactive API Search
Zachary Eberhart
Collin McMillan
14
4
0
26 Jul 2021
ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback
Mike Wu
Noah D. Goodman
Chris Piech
Chelsea Finn
24
19
0
23 Jul 2021
On the Evaluation of Neural Code Summarization
Ensheng Shi
Yanlin Wang
Lun Du
Junjie Chen
Shi Han
Hongyu Zhang
Dongmei Zhang
Hongbin Sun
ELM
119
86
0
15 Jul 2021
DeepMutants: Training neural bug detectors with contextual mutations
Cedric Richter
Heike Wehrheim
19
3
0
14 Jul 2021
Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning for Semantic Code Search
Lun Du
Xiaozhou Shi
Yanlin Wang
Ensheng Shi
Shi Han
Dongmei Zhang
24
4
0
10 Jul 2021
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
57
5,055
0
07 Jul 2021
Multimodal Representation for Neural Code Search
Jian Gu
Zimin Chen
Monperrus Martin
13
43
0
02 Jul 2021
Cross-Lingual Transfer Learning for Statistical Type Inference
Zhiming Li
Xiaofei Xie
Haoliang Li
Zhengzi Xu
Yi Li
Yang Liu
8
2
0
01 Jul 2021
Memorization and Generalization in Neural Code Intelligence Models
Md Rafiqul Islam Rabin
Aftab Hussain
Mohammad Amin Alipour
Vincent J. Hellendoorn
TDI
35
40
0
16 Jun 2021
Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors
Junayed Mahmud
Fahim Faisal
Raihan Islam Arnob
Antonios Anastasopoulos
Kevin Moran
19
20
0
15 Jun 2021
Programming Puzzles
Tal Schuster
A. Kalyan
Oleksandr Polozov
Adam Tauman Kalai
ELM
15
32
0
10 Jun 2021
Reading StackOverflow Encourages Cheating: Adding Question Text Improves Extractive Code Generation
Gabriel Orlanski
Alex Gittens
29
20
0
08 Jun 2021
CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model
Tae-Hwan Jung
VLM
22
28
0
29 May 2021
CoDesc: A Large Code-Description Parallel Dataset
Masum Hasan
Tanveer Muttaqueen
Abdullah Al Ishtiaq
Kazi Sajeed Mehrab
Md. Mahim Anjum Haque
Tahmid Hasan
Wasi Uddin Ahmad
Anindya Iqbal
Rifat Shahriyar
11
30
0
29 May 2021
CoSQA: 20,000+ Web Queries for Code Search and Question Answering
Junjie Huang
Duyu Tang
Linjun Shou
Ming Gong
Ke Xu
Daxin Jiang
Ming Zhou
Nan Duan
28
111
0
27 May 2021
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
Ruchi Puri
David S. Kung
G. Janssen
Wei Zhang
Giacomo Domeniconi
...
Saurabh Pujar
Shyam Ramji
Ulrich Finkler
Susan Malaika
Frederick Reiss
29
225
0
25 May 2021
DeepDebug: Fixing Python Bugs Using Stack Traces, Backtranslation, and Code Skeletons
Dawn Drain
Colin B. Clement
Guillermo Serrato
Neel Sundaresan
17
31
0
19 May 2021
CoTexT: Multi-task Learning with Code-Text Transformer
Long Phan
H. Tran
Daniel Le
Hieu Duy Nguyen
J. Anibal
Alec Peltekian
Yanfang Ye
19
135
0
18 May 2021
Shellcode_IA32: A Dataset for Automatic Shellcode Generation
Pietro Liguori
Erfan Al-Hossami
Domenico Cotroneo
R. Natella
B. Cukic
Samira Shaikh
34
27
0
27 Apr 2021
BERT2Code: Can Pretrained Language Models be Leveraged for Code Search?
Abdullah Al Ishtiaq
Masum Hasan
Md. Mahim Anjum Haque
Kazi Sajeed Mehrab
Tanveer Muttaqueen
Tahmid Hasan
Anindya Iqbal
Rifat Shahriyar
6
5
0
16 Apr 2021
ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation
Weizhen Qi
Yeyun Gong
Yu Yan
Can Xu
Bolun Yao
...
Daxin Jiang
Jiusheng Chen
Ruofei Zhang
Houqiang Li
Nan Duan
26
51
0
16 Apr 2021
CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar
Wei Ding
Llion Jones
Tom Gibbs
Tamas B. Fehér
Christoph Angerer
Silvia Severini
Florian Matthes
B. Rost
9
72
0
06 Apr 2021
HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter Notebooks
Xuye Liu
Dakuo Wang
A. Wang
Yufang Hou
Lingfei Wu
22
23
0
31 Mar 2021
deGraphCS: Embedding Variable-based Flow Graph for Neural Code Search
Chen Zeng
Yue Yu
Shanshan Li
Xin Xia
Zhiming Wang
Mingyang Geng
Linxiao Bai
Wei Dong
Xiangke Liao
GNN
31
36
0
24 Mar 2021
Language-Agnostic Representation Learning of Source Code from Structure and Context
Daniel Zügner
Tobias Kirschstein
Michele Catasta
J. Leskovec
Stephan Günnemann
30
119
0
21 Mar 2021
API2Com: On the Improvement of Automatically Generated Code Comments Using API Documentations
Ramin Shahbazi
Rishab Sharma
Fatemeh H. Fard
19
25
0
19 Mar 2021
Unified Pre-training for Program Understanding and Generation
Wasi Uddin Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
18
749
0
10 Mar 2021
NeurIPS 2020 NLC2CMD Competition: Translating Natural Language to Bash Commands
Mayank Agarwal
Tathagata Chakraborti
Quchen Fu
David Gros
Xi Victoria Lin
Jaron Maene
Kartik Talamadupula
Zhongwei Teng
Jules White
16
15
0
03 Mar 2021
Simplified Data Wrangling with ir_datasets
Sean MacAvaney
Andrew Yates
Sergey Feldman
Doug Downey
Arman Cohan
Nazli Goharian
19
108
0
03 Mar 2021
Neural Code Summarization
Piyush Shrivastava
20
2
0
26 Feb 2021
Automatic Code Generation using Pre-Trained Language Models
Luis Perez
Lizi Ottens
Sudharshan Viswanathan
SyDa
ALM
16
22
0
21 Feb 2021
Previous
1
2
3
...
10
8
9
Next