ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06182
  4. Cited By
A Survey of Machine Learning for Big Code and Naturalness
v1v2 (latest)

A Survey of Machine Learning for Big Code and Naturalness

18 September 2017
Miltiadis Allamanis
Earl T. Barr
Premkumar T. Devanbu
Charles Sutton
ArXiv (abs)PDFHTML

Papers citing "A Survey of Machine Learning for Big Code and Naturalness"

50 / 298 papers shown
Title
Code Sophistication: From Code Recommendation to Logic Recommendation
Code Sophistication: From Code Recommendation to Logic Recommendation
Jessie Galasso
Michalis Famelis
H. Sahraoui
146
1
0
19 Jan 2022
Borrowing from Similar Code: A Deep Learning NLP-Based Approach for Log
  Statement Automation
Borrowing from Similar Code: A Deep Learning NLP-Based Approach for Log Statement Automation
Sina Gholamian
Paul A. S. Ward
104
3
0
02 Dec 2021
Leveraging Unsupervised Learning to Summarize APIs Discussed in Stack
  Overflow
Leveraging Unsupervised Learning to Summarize APIs Discussed in Stack OverflowIEEE Working Conference on Source Code Analysis and Manipulation (SCAM), 2021
AmirHossein Naghshzan
Latifa Guerrouj
Olga Baysal
141
18
0
27 Nov 2021
Senatus -- A Fast and Accurate Code-to-Code Recommendation Engine
Senatus -- A Fast and Accurate Code-to-Code Recommendation Engine
Fran Silavong
Sean J. Moran
Antonios Georgiadis
Rohan Saphal
R. Otter
113
11
0
05 Nov 2021
Code2Snapshot: Using Code Snapshots for Learning Representations of
  Source Code
Code2Snapshot: Using Code Snapshots for Learning Representations of Source CodeInternational Conference on Machine Learning and Applications (ICMLA), 2021
Md Rafiqul Islam Rabin
Mohammad Amin Alipour
200
5
0
01 Nov 2021
A Survey on Machine Learning Techniques for Source Code Analysis
A Survey on Machine Learning Techniques for Source Code Analysis
Tushar Sharma
M. Kechagia
Stefanos Georgiou
Rohit Tiwari
Indira Vats
Hadi Moazen
Federica Sarro
226
72
0
18 Oct 2021
Using Document Similarity Methods to create Parallel Datasets for Code
  Translation
Using Document Similarity Methods to create Parallel Datasets for Code Translation
Mayank Agarwal
Kartik Talamadupula
Fernando Martinez
Stephanie Houde
Michael J. Muller
John T. Richards
Steven I. Ross
Justin D. Weisz
SyDa
110
9
0
11 Oct 2021
Capturing Structural Locality in Non-parametric Language Models
Capturing Structural Locality in Non-parametric Language Models
Frank F. Xu
Junxian He
Graham Neubig
Vincent J. Hellendoorn
266
14
0
06 Oct 2021
Learning to Superoptimize Real-world Programs
Learning to Superoptimize Real-world Programs
Alex Shypula
Pengcheng Yin
Jeremy Lacomis
Claire Le Goues
Edward N. Schwartz
Graham Neubig
NAI
307
10
0
28 Sep 2021
Self-Supervised Learning to Prove Equivalence Between Straight-Line
  Programs via Rewrite Rules
Self-Supervised Learning to Prove Equivalence Between Straight-Line Programs via Rewrite Rules
Steve Kommrusch
Monperrus Martin
L. Pouchet
246
10
0
22 Sep 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for
  AI Research
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Chris Cummins
Bram Wasti
Jiadong Guo
Brandon Cui
Jason Ansel
...
Jia-Wei Liu
O. Teytaud
Benoit Steiner
Yuandong Tian
Hugh Leather
224
99
0
17 Sep 2021
Leveraging Code Clones and Natural Language Processing for Log Statement
  Prediction
Leveraging Code Clones and Natural Language Processing for Log Statement PredictionInternational Conference on Automated Software Engineering (ASE), 2021
Sina Gholamian
76
10
0
08 Sep 2021
Program Merge Conflict Resolution via Neural Transformers
Program Merge Conflict Resolution via Neural Transformers
Alexey Svyatkovskiy
Sarah Fakhoury
Negar Ghorbani
Todd Mytkowicz
Elizabeth Dinella
Christian Bird
Jinu Jang
Neel Sundaresan
Shuvendu K. Lahiri
MoMe
239
30
0
31 Aug 2021
Retrieval Augmented Code Generation and Summarization
Retrieval Augmented Code Generation and SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Md. Rizwan Parvez
W. Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
244
232
0
26 Aug 2021
Learning GraphQL Query Costs (Extended Version)
Learning GraphQL Query Costs (Extended Version)
Georgios Mavroudeas
Guillaume Baudart
Alan Cha
Martin Hirzel
Jim Laredo
M. Magdon-Ismail
Louis Mandel
Erik Wittern
141
3
0
25 Aug 2021
Program Synthesis with Large Language Models
Program Synthesis with Large Language Models
Jacob Austin
Augustus Odena
Maxwell Nye
Maarten Bosma
Henryk Michalewski
...
Ellen Jiang
Carrie J. Cai
Michael Terry
Quoc V. Le
Charles Sutton
ELMAIMatReCodALM
402
2,808
0
16 Aug 2021
ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback
ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback
Mike Wu
Noah D. Goodman
Chris Piech
Chelsea Finn
239
22
0
23 Jul 2021
DeepMutants: Training neural bug detectors with contextual mutations
DeepMutants: Training neural bug detectors with contextual mutations
Cedric Richter
Heike Wehrheim
144
3
0
14 Jul 2021
Memorization and Generalization in Neural Code Intelligence Models
Memorization and Generalization in Neural Code Intelligence Models
Md Rafiqul Islam Rabin
Aftab Hussain
Mohammad Amin Alipour
Vincent J. Hellendoorn
TDI
229
48
0
16 Jun 2021
Assessing the Effectiveness of Syntactic Structure to Learn Code Edit
  Representations
Assessing the Effectiveness of Syntactic Structure to Learn Code Edit Representations
Syed Arbaaz Qureshi
Sonu Mehta
Ranjita Bhagwan
Rahul Kumar
139
3
0
11 Jun 2021
Energy-Based Models for Code Generation under Compilability Constraints
Energy-Based Models for Code Generation under Compilability Constraints
Tomasz Korbak
Hady ElSahar
Marc Dymetman
Germán Kruszewski
215
14
0
09 Jun 2021
Understanding Neural Code Intelligence Through Program Simplification
Understanding Neural Code Intelligence Through Program Simplification
Md Rafiqul Islam Rabin
Vincent J. Hellendoorn
Mohammad Amin Alipour
AAML
233
69
0
07 Jun 2021
Proving Equivalence Between Complex Expressions Using Graph-to-Sequence
  Neural Models
Proving Equivalence Between Complex Expressions Using Graph-to-Sequence Neural Models
Steven J Kommrusch
Théo Barollet
L. Pouchet
113
6
0
01 Jun 2021
Learning to Extend Program Graphs to Work-in-Progress Code
Learning to Extend Program Graphs to Work-in-Progress Code
Xuechen Li
Chris J. Maddison
Daniel Tarlow
168
2
0
28 May 2021
CoSQA: 20,000+ Web Queries for Code Search and Question Answering
CoSQA: 20,000+ Web Queries for Code Search and Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Junjie Huang
Duyu Tang
Linjun Shou
Ming Gong
Ke Xu
Daxin Jiang
Ming Zhou
Nan Duan
266
137
0
27 May 2021
Directed Acyclic Graph Network for Conversational Emotion Recognition
Directed Acyclic Graph Network for Conversational Emotion RecognitionAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Weizhou Shen
Siyue Wu
Yunyi Yang
Xiaojun Quan
306
294
0
27 May 2021
Self-Supervised Bug Detection and Repair
Self-Supervised Bug Detection and RepairNeural Information Processing Systems (NeurIPS), 2021
Miltiadis Allamanis
Henry Jackson-Flux
Marc Brockschmidt
250
127
0
26 May 2021
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of
  Coding Tasks
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
Ruchi Puri
David S. Kung
G. Janssen
Wei Zhang
Giacomo Domeniconi
...
Saurabh Pujar
Shyam Ramji
Ulrich Finkler
Susan Malaika
Frederick Reiss
183
321
0
25 May 2021
Neural Transfer Learning for Repairing Security Vulnerabilities in C
  Code
Neural Transfer Learning for Repairing Security Vulnerabilities in C CodeIEEE Transactions on Software Engineering (TSE), 2021
Zimin Chen
Steve Kommrusch
Monperrus Martin
251
153
0
16 Apr 2021
Perfection Not Required? Human-AI Partnerships in Code Translation
Perfection Not Required? Human-AI Partnerships in Code TranslationInternational Conference on Intelligent User Interfaces (IUI), 2021
Justin D. Weisz
Michael J. Muller
Stephanie Houde
John T. Richards
Steven I. Ross
Fernando Martinez
Mayank Agarwal
Kartik Talamadupula
187
152
0
08 Apr 2021
Variable Name Recovery in Decompiled Binary Code using Constrained
  Masked Language Modeling
Variable Name Recovery in Decompiled Binary Code using Constrained Masked Language Modeling
Pratyay Banerjee
Kuntal Kumar Pal
Fish Wang
Chitta Baral
155
17
0
23 Mar 2021
Project-Level Encoding for Neural Source Code Summarization of
  Subroutines
Project-Level Encoding for Neural Source Code Summarization of SubroutinesIEEE International Conference on Program Comprehension (ICPC), 2021
Aakash Bansal
S. Haque
Collin McMillan
144
55
0
22 Mar 2021
API2Com: On the Improvement of Automatically Generated Code Comments
  Using API Documentations
API2Com: On the Improvement of Automatically Generated Code Comments Using API DocumentationsIEEE International Conference on Program Comprehension (ICPC), 2021
Ramin Shahbazi
Rishab Sharma
Fatemeh H. Fard
196
31
0
19 Mar 2021
Generating Adversarial Computer Programs using Optimized Obfuscations
Generating Adversarial Computer Programs using Optimized ObfuscationsInternational Conference on Learning Representations (ICLR), 2021
Shashank Srikant
Sijia Liu
Tamara Mitrovska
Shiyu Chang
Quanfu Fan
Gaoyuan Zhang
Una-May O’Reilly
AAML
194
55
0
18 Mar 2021
Code Completion by Modeling Flattened Abstract Syntax Trees as Graphs
Code Completion by Modeling Flattened Abstract Syntax Trees as GraphsAAAI Conference on Artificial Intelligence (AAAI), 2021
Yanlin Wang
Hui Li
191
93
0
17 Mar 2021
Embedding Code Contexts for Cryptographic API Suggestion:New
  Methodologies and Comparisons
Embedding Code Contexts for Cryptographic API Suggestion:New Methodologies and Comparisons
Ya Xiao
Salman Ahmed
Wen-Kai Song
Xinyang Ge
Bimal Viswanath
D. Yao
111
5
0
15 Mar 2021
Mining Program Properties From Neural Networks Trained on Source Code
  Embeddings
Mining Program Properties From Neural Networks Trained on Source Code Embeddings
Martina Saletta
C. Ferretti
61
1
0
09 Mar 2021
NeurIPS 2020 NLC2CMD Competition: Translating Natural Language to Bash
  Commands
NeurIPS 2020 NLC2CMD Competition: Translating Natural Language to Bash CommandsNeural Information Processing Systems (NeurIPS), 2021
Mayank Agarwal
Tathagata Chakraborti
Quchen Fu
David Gros
Xi Lin
Jaron Maene
Kartik Talamadupula
Zhongwei Teng
Jules White
181
16
0
03 Mar 2021
Learning to Make Compiler Optimizations More Effective
Learning to Make Compiler Optimizations More Effective
Rahim Mammadli
Marija Selakovic
F. Wolf
Michael Pradel
199
18
0
24 Feb 2021
Automatic Code Generation using Pre-Trained Language Models
Automatic Code Generation using Pre-Trained Language Models
Luis Perez
Lizi Ottens
Sudharshan Viswanathan
SyDaALM
102
26
0
21 Feb 2021
A Survey of Machine Learning for Computer Architecture and Systems
A Survey of Machine Learning for Computer Architecture and SystemsACM Computing Surveys (CSUR), 2021
Nan Wu
Yuan Xie
AI4TSAI4CE
224
183
0
16 Feb 2021
PalmTree: Learning an Assembly Language Model for Instruction Embedding
PalmTree: Learning an Assembly Language Model for Instruction EmbeddingConference on Computer and Communications Security (CCS), 2021
Xuezixiang Li
Qu Yu
Heng Yin
261
191
0
21 Jan 2021
Content-Based Textual File Type Detection at Scale
Content-Based Textual File Type Detection at ScaleInternational Conference on Machine Learning and Computing (ICMLC), 2021
Francesca Del Bonifro
M. Gabbrielli
Stefano Zacchiroli
66
4
0
21 Jan 2021
Directed Acyclic Graph Neural Networks
Directed Acyclic Graph Neural NetworksInternational Conference on Learning Representations (ICLR), 2021
Veronika Thost
Jie Chen
GNNAI4CE
652
127
0
20 Jan 2021
Advances in Electron Microscopy with Deep Learning
Advances in Electron Microscopy with Deep Learning
Jeffrey M. Ede
624
3
0
04 Jan 2021
Stack-based Buffer Overflow Detection using Recurrent Neural Networks
Stack-based Buffer Overflow Detection using Recurrent Neural Networks
W. A. Dahl
L. Erdődi
Fabio Massimo Zennaro
152
16
0
30 Dec 2020
Trex: Learning Execution Semantics from Micro-Traces for Binary
  Similarity
Trex: Learning Execution Semantics from Micro-Traces for Binary Similarity
Kexin Pei
Zhou Xuan
Junfeng Yang
Suman Jana
Baishakhi Ray
410
111
0
16 Dec 2020
Deep Data Flow Analysis
Deep Data Flow Analysis
Chris Cummins
Hugh Leather
Zacharias V. Fisches
Tal Ben-Nun
Torsten Hoefler
Michael F. P. O'Boyle
125
6
0
21 Nov 2020
GRAPHSPY: Fused Program Semantic-Level Embedding via Graph Neural
  Networks for Dead Store Detection
GRAPHSPY: Fused Program Semantic-Level Embedding via Graph Neural Networks for Dead Store Detection
Yixin Guo
Pengcheng Li
Yingwei Luo
Xiaolin Wang
Zhenlin Wang
GNN
81
1
0
18 Nov 2020
Neural Software Analysis
Neural Software AnalysisCommunications of the ACM (CACM), 2020
Michael Pradel
S. Chandra
NAI
215
36
0
16 Nov 2020
Previous
123456
Next