ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06182
  4. Cited By
A Survey of Machine Learning for Big Code and Naturalness
v1v2 (latest)

A Survey of Machine Learning for Big Code and Naturalness

18 September 2017
Miltiadis Allamanis
Earl T. Barr
Premkumar T. Devanbu
Charles Sutton
ArXiv (abs)PDFHTML

Papers citing "A Survey of Machine Learning for Big Code and Naturalness"

50 / 298 papers shown
Title
Linguacodus: A Synergistic Framework for Transformative Code Generation
  in Machine Learning Pipelines
Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning PipelinesPeerJ Computer Science (PeerJ Comput. Sci.), 2024
Ekaterina Trofimova
Emil Sataev
Andrey E. Ustyuzhanin
286
0
0
18 Mar 2024
Studying LLM Performance on Closed- and Open-source Data
Studying LLM Performance on Closed- and Open-source Data
Toufique Ahmed
Christian Bird
Prem Devanbu
Saikat Chakraborty
228
17
0
23 Feb 2024
Efficient and Universal Watermarking for LLM-Generated Code Detection
Efficient and Universal Watermarking for LLM-Generated Code Detection
Boquan Li
Mengdi Zhang
Peixin Zhang
Jun Sun
Xingmei Wang
Xingmei Wang
WaLM
571
3
0
12 Feb 2024
On the Effectiveness of Machine Learning-based Call Graph Pruning: An
  Empirical Study
On the Effectiveness of Machine Learning-based Call Graph Pruning: An Empirical StudyIEEE Working Conference on Mining Software Repositories (MSR), 2024
A. Mir
Mehdi Keshani
Sebastian Proksch
126
3
0
11 Feb 2024
CONCORD: Towards a DSL for Configurable Graph Code Representation
CONCORD: Towards a DSL for Configurable Graph Code Representation
M. Saad
Tushar Sharma
243
2
0
31 Jan 2024
Evaluation of large language models for assessing code maintainability
Evaluation of large language models for assessing code maintainability
Marc Dillmann
Julien Siebert
Adam Trendowicz
150
5
0
23 Jan 2024
Assessing the Latent Automated Program Repair Capabilities of Large Language Models using Round-Trip Translation
Assessing the Latent Automated Program Repair Capabilities of Large Language Models using Round-Trip TranslationACM Transactions on Software Engineering and Methodology (TOSEM), 2024
Fernando Vallecillos Ruiz
Anastasiia Grishina
Max Hort
Leon Moonen
LRM
272
6
0
15 Jan 2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Deep Learning for Code Intelligence: Survey, Benchmark and ToolkitACM Computing Surveys (ACM Comput. Surv.), 2023
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
268
40
0
30 Dec 2023
The Next 700 ML-Enabled Compiler Optimizations
The Next 700 ML-Enabled Compiler Optimizations
S. VenkataKeerthy
Siddharth Jain
Umesh Kalvakuntla
Pranav Sai Gorantla
R. Chitale
E. Brevdo
Albert Cohen
Mircea Trofin
Ramakrishna Upadrasta
128
5
0
17 Nov 2023
Learning Generalizable Program and Architecture Representations for
  Performance Modeling
Learning Generalizable Program and Architecture Representations for Performance ModelingInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2023
Lingda Li
T. Flynn
A. Hoisie
181
5
0
25 Oct 2023
Automatic Unit Test Data Generation and Actor-Critic Reinforcement
  Learning for Code Synthesis
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis
P. Gorinski
Matthieu Zimmer
Gerasimos Lampouras
Derrick-Goh-Xin Deik
Ignacio Iacobacci
ALMOffRL
171
5
0
20 Oct 2023
Large Language Model-Aware In-Context Learning for Code Generation
Large Language Model-Aware In-Context Learning for Code Generation
Jia Li
Ge Li
Chongyang Tao
Jia Li
Huangzhao Zhang
Fang Liu
Zhi Jin
162
59
0
15 Oct 2023
GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers
  through In-depth Benchmarking
GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through In-depth BenchmarkingInternational Conference on Learning Representations (ICLR), 2023
Mert Kosan
S. Verma
Burouj Armgaan
Khushbu Pahwa
Ambuj K. Singh
Sourav Medya
Jignesh M. Patel
374
19
0
03 Oct 2023
Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API
  Names?
Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names?
Terry Yue Zhuo
Xiaoning Du
Zhenchang Xing
Jiamou Sun
Haowei Quan
Li Li
Liming Zhu
193
2
0
14 Sep 2023
Distilled GPT for Source Code Summarization
Distilled GPT for Source Code SummarizationInternational Conference on Automated Software Engineering (ASE), 2023
Chia-Yi Su
Collin McMillan
243
53
0
28 Aug 2023
EditSum: A Retrieve-and-Edit Framework for Source Code Summarization
EditSum: A Retrieve-and-Edit Framework for Source Code SummarizationInternational Conference on Automated Software Engineering (ASE), 2021
Jia Li
Yongming Li
Ge Li
Xing Hu
Xin Xia
Zhi Jin
200
81
0
26 Aug 2023
SoTaNa: The Open-Source Software Development Assistant
SoTaNa: The Open-Source Software Development Assistant
Ensheng Shi
Fengji Zhang
Yanlin Wang
B. Chen
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Hongbin Sun
200
13
0
25 Aug 2023
RatGPT: Turning online LLMs into Proxies for Malware Attacks
RatGPT: Turning online LLMs into Proxies for Malware Attacks
Mika Beckerich
L. Plein
Sergio Coronado
SILM
131
38
0
17 Aug 2023
AST-MHSA : Code Summarization using Multi-Head Self-Attention
AST-MHSA : Code Summarization using Multi-Head Self-Attention
Y. Nagaraj
U. Gupta
134
1
0
10 Aug 2023
Understanding User Intent Modeling for Conversational Recommender
  Systems: A Systematic Literature Review
Understanding User Intent Modeling for Conversational Recommender Systems: A Systematic Literature Review
Siamak Farshidi
Kiyan Rezaee
Sara Mazaheri
Amir Rahimi
Ali Dadashzadeh
Morteza Ziabakhsh
S. Eskandari
S. Jansen
138
11
0
05 Aug 2023
Statement-based Memory for Neural Source Code Summarization
Statement-based Memory for Neural Source Code Summarization
Aakash Bansal
Siyuan Jiang
S. Haque
Collin McMillan
127
3
0
21 Jul 2023
COMEX: A Tool for Generating Customized Source Code Representations
COMEX: A Tool for Generating Customized Source Code RepresentationsInternational Conference on Automated Software Engineering (ASE), 2023
Debeshee Das
N. Mathews
Alex Mathai
Srikanth G. Tamilselvam
Kranthi Sedamaki
S. Chimalakonda
Atul Kumar
VLM
118
8
0
10 Jul 2023
Natural Language Generation and Understanding of Big Code for
  AI-Assisted Programming: A Review
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A ReviewEntropy (Entropy), 2023
M. Wong
Shangxin Guo
Ching Nam Hang
Siu-Wai Ho
C. Tan
219
121
0
04 Jul 2023
ARIST: An Effective API Argument Recommendation Approach
ARIST: An Effective API Argument Recommendation ApproachJournal of Systems and Software (JSS), 2023
Son Nguyen
C. T. Manh
T. Tran
Tan M. Nguyen
Thu-Trang Nguyen
Kien-Tuan Ngo
H. Vo
134
14
0
11 Jun 2023
Large Language Models of Code Fail at Completing Code with Potential
  Bugs
Large Language Models of Code Fail at Completing Code with Potential BugsNeural Information Processing Systems (NeurIPS), 2023
Tuan Dinh
Jinman Zhao
Samson Tan
Renato M. P. Negrinho
Leonard Lausen
Sheng Zha
George Karypis
LRM
211
42
0
06 Jun 2023
LambdaBeam: Neural Program Search with Higher-Order Functions and
  Lambdas
LambdaBeam: Neural Program Search with Higher-Order Functions and LambdasNeural Information Processing Systems (NeurIPS), 2023
Kensen Shi
H. Dai
Wen-Ding Li
Kevin Ellis
Charles Sutton
200
8
0
03 Jun 2023
Better Context Makes Better Code Language Models: A Case Study on
  Function Call Argument Completion
Better Context Makes Better Code Language Models: A Case Study on Function Call Argument CompletionAAAI Conference on Artificial Intelligence (AAAI), 2023
Hengzhi Pei
Jinman Zhao
Leonard Lausen
Sheng Zha
George Karypis
ELMLRM
112
26
0
01 Jun 2023
Feature Engineering-Based Detection of Buffer Overflow Vulnerability in
  Source Code Using Neural Networks
Feature Engineering-Based Detection of Buffer Overflow Vulnerability in Source Code Using Neural NetworksAnnual International Computer Software and Applications Conference (COMPSAC), 2023
Mst. Shapna Akter
Hossain Shahriar
Juan Rodriguez Cardenas
S. Ahamed
A. Cuzzocrea
192
2
0
01 Jun 2023
PERFOGRAPH: A Numerical Aware Program Graph Representation for
  Performance Optimization and Program Analysis
PERFOGRAPH: A Numerical Aware Program Graph Representation for Performance Optimization and Program AnalysisNeural Information Processing Systems (NeurIPS), 2023
Ali TehraniJamsaz
Quazi Ishtiaque Mahmud
Le Chen
Nasreen K. Ahmed
Ali Jannesari
196
11
0
31 May 2023
Evaluating GPT's Programming Capability through CodeWars' Katas
Evaluating GPT's Programming Capability through CodeWars' KatasKnowledge Science, Engineering and Management (KSEM), 2023
Zizhuo Zhang
Lian Wen
Shaoyang Zhang
David Chen
Yanfei Jiang
ELM
148
4
0
31 May 2023
TransCoder: Towards Unified Transferable Code Representation Learning
  Inspired by Human Skills
TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human SkillsInternational Conference on Language Resources and Evaluation (LREC), 2023
Qiushi Sun
Polydoros Giannouris
Jiadong Wang
Xiang Li
Ming Gao
256
12
0
23 May 2023
Neural Machine Translation for Code Generation
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
300
7
0
22 May 2023
Towards Code Generation from BDD Test Case Specifications: A Vision
Towards Code Generation from BDD Test Case Specifications: A Vision
Leon Chemnitz
David Reichenbach
Hani Aldebes
Mariam Naveed
Krishna Narasimhan
Mira Mezini
122
4
0
19 May 2023
Modelling Concurrency Bugs Using Machine Learning
Modelling Concurrency Bugs Using Machine Learning
Teodor Rares Begu
106
0
0
08 May 2023
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder
  Models for More Efficient Code Classification
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification
Anastasiia Grishina
Max Hort
Leon Moonen
188
12
0
08 May 2023
Using Large Language Models to Generate JUnit Tests: An Empirical Study
Using Large Language Models to Generate JUnit Tests: An Empirical StudyInternational Conference on Evaluation & Assessment in Software Engineering (EASE), 2023
Mohammed Latif Siddiq
Joanna C. S. Santos
Ridwanul Hasan Tanvir
Noshin Ulfat
Fahmid Al Rifat
Vinicius Carvalho Lopes
ELM
412
96
0
30 Apr 2023
A Review of ChatGPT Applications in Education, Marketing, Software
  Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions
A Review of ChatGPT Applications in Education, Marketing, Software Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions
Mohammad Fraiwan
Natheer Khasawneh
207
57
0
29 Apr 2023
ICE-Score: Instructing Large Language Models to Evaluate Code
ICE-Score: Instructing Large Language Models to Evaluate CodeFindings (Findings), 2023
Terry Yue Zhuo
ELMALM
315
64
0
27 Apr 2023
Performance Optimization using Multimodal Modeling and Heterogeneous GNN
Performance Optimization using Multimodal Modeling and Heterogeneous GNNIEEE International Symposium on High-Performance Parallel Distributed Computing (HPDC), 2023
Akashnil Dutta
J. Alcaraz
Ali TehraniJamsaz
E. César
A. Sikora
Ali Jannesari
173
12
0
25 Apr 2023
Program Comprehension Does Not Primarily Rely On the Language Centers of
  the Human Brain
Program Comprehension Does Not Primarily Rely On the Language Centers of the Human Brain
Shashank Srikant
Anna A. Ivanova
Yotaro Sueoka
Hope H. Kean
Riva Dhamala
Evelina Fedorenko
M. U. Bers
Una-May O’Reilly
70
0
0
11 Apr 2023
CrossCode: Multi-level Visualization of Program Execution
CrossCode: Multi-level Visualization of Program ExecutionInternational Conference on Human Factors in Computing Systems (CHI), 2023
Devamardeep Hayatpur
Haijun Xia
Daniel J. Wigdor
111
16
0
07 Apr 2023
Implant Global and Local Hierarchy Information to Sequence based Code
  Representation Models
Implant Global and Local Hierarchy Information to Sequence based Code Representation ModelsIEEE International Conference on Program Comprehension (ICPC), 2023
Kechi Zhang
Zhuo Li
Zhi Jin
Ge Li
172
9
0
14 Mar 2023
xASTNN: Improved Code Representations for Industrial Practice
xASTNN: Improved Code Representations for Industrial Practice
Zhiwei Xu
Min Zhou
Xibin Zhao
Yang Chen
Xi Cheng
Hongyu Zhang
AI4TS
166
7
0
13 Mar 2023
Study of Distractors in Neural Models of Code
Study of Distractors in Neural Models of Code
Md Rafiqul Islam Rabin
Aftab Hussain
Sahil Suneja
Mohammad Amin Alipour
AAML
140
6
0
03 Mar 2023
Power Constrained Autotuning using Graph Neural Networks
Power Constrained Autotuning using Graph Neural NetworksIEEE International Parallel and Distributed Processing Symposium (IPDPS), 2023
Akashnil Dutta
JeeWhan Choi
Ali Jannesari
164
6
0
22 Feb 2023
The Programmer's Assistant: Conversational Interaction with a Large
  Language Model for Software Development
The Programmer's Assistant: Conversational Interaction with a Large Language Model for Software DevelopmentInternational Conference on Intelligent User Interfaces (IUI), 2023
Steven I. Ross
Fernando Martinez
Stephanie Houde
Michael J. Muller
Justin D. Weisz
217
288
0
14 Feb 2023
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
CodeBERTScore: Evaluating Code Generation with Pretrained Models of CodeConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Shuyan Zhou
Uri Alon
Sumit Agarwal
Graham Neubig
ELMALM
249
149
0
10 Feb 2023
VuLASTE: Long Sequence Model with Abstract Syntax Tree Embedding for
  vulnerability Detection
VuLASTE: Long Sequence Model with Abstract Syntax Tree Embedding for vulnerability Detection
Botong Zhu
Huobin Tan
99
1
0
05 Feb 2023
Beware of the Unexpected: Bimodal Taint Analysis
Beware of the Unexpected: Bimodal Taint AnalysisInternational Symposium on Software Testing and Analysis (ISSTA), 2023
Yiu Wai Chow
Max Schäfer
Michael Pradel
166
23
0
25 Jan 2023
CaRE: Finding Root Causes of Configuration Issues in Highly-Configurable
  Robots
CaRE: Finding Root Causes of Configuration Issues in Highly-Configurable RobotsIEEE Robotics and Automation Letters (RA-L), 2023
Md. Abir Hossen
Sonam Kharade
B. Schmerl
Javier Cámara
Jason M. O'Kane
E. Czaplinski
K. Dzurilla
David Garlan
Pooyan Jamshidi
185
13
0
18 Jan 2023
Previous
123456
Next