ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06182
  4. Cited By
A Survey of Machine Learning for Big Code and Naturalness

A Survey of Machine Learning for Big Code and Naturalness

18 September 2017
Miltiadis Allamanis
Earl T. Barr
Premkumar T. Devanbu
Charles Sutton
ArXivPDFHTML

Papers citing "A Survey of Machine Learning for Big Code and Naturalness"

50 / 115 papers shown
Title
A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs
A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs
Musfiqur Rahman
SayedHassan Khatoonabadi
Emad Shihab
ALM
39
0
0
22 Apr 2025
Bringing Structure to Naturalness: On the Naturalness of ASTs
Bringing Structure to Naturalness: On the Naturalness of ASTs
Profir-Petru Pârţachi
Mahito Sugiyama
27
0
0
11 Apr 2025
Deep Learning-based Intrusion Detection Systems: A Survey
Deep Learning-based Intrusion Detection Systems: A Survey
Zhiwei Xu
Yujuan Wu
Shiheng Wang
Jiabao Gao
Tian Qiu
Ziqi Wang
Hai Wan
Xibin Zhao
26
1
0
10 Apr 2025
Enhancing Code LLM Training with Programmer Attention
Enhancing Code LLM Training with Programmer Attention
Y. Zhang
Chen Huang
Z. Karas
Dung T. Nguyen
Kevin Leach
Yu Huang
75
0
0
19 Mar 2025
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models
M. Wong
C. Tan
ALM
83
4
0
19 Mar 2025
Fully Autonomous Programming using Iterative Multi-Agent Debugging with Large Language Models
Anastasiia Grishina
Vadim Liventsev
Aki Härmä
Leon Moonen
ELM
87
0
0
10 Mar 2025
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System
Sheikh Moonwara Anjum Monisha
Atul Bharadwaj
49
0
0
16 Feb 2025
Process-Supervised Reinforcement Learning for Code Generation
Process-Supervised Reinforcement Learning for Code Generation
Yufan Ye
Ting Zhang
Wenbin Jiang
Hua Huang
OffRL
LRM
SyDa
63
1
0
03 Feb 2025
From Critique to Clarity: A Pathway to Faithful and Personalized Code Explanations with Large Language Models
Zexing Xu
Zhuang Luo
Yichuan Li
Kyumin Lee
S. Rasoul Etesami
38
0
0
28 Jan 2025
Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?
Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?
Taiming Wang
Yuxia Zhang
Lin Jiang
Yi Tang
Guangjie Li
Hui Liu
88
1
0
22 Jan 2025
Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey
Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey
Junqiao Wang
Zeng Zhang
Yangfan He
Yuyang Song
Tianyu Shi
...
Hengyuan Xu
Kunyu Wu
Guangwu Qian
Qiuwu Chen
Lewei He
38
11
0
03 Jan 2025
Mastering the Craft of Data Synthesis for CodeLLMs
Mastering the Craft of Data Synthesis for CodeLLMs
Meng Chen
Philip Arthur
Qianyu Feng
Cong Duy Vu Hoang
Yu-Heng Hong
...
Mark Johnson
Kemal Kurniawan
Don Dharmasiri
Long Duong
Yuan-Fang Li
SyDa
60
1
0
16 Oct 2024
A Systematic Evaluation of Large Language Models for Natural Language
  Generation Tasks
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks
Xuanfan Ni
Piji Li
ELM
LRM
34
8
0
16 May 2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
35
20
0
30 Dec 2023
Learning Generalizable Program and Architecture Representations for
  Performance Modeling
Learning Generalizable Program and Architecture Representations for Performance Modeling
Lingda Li
T. Flynn
A. Hoisie
26
1
0
25 Oct 2023
EditSum: A Retrieve-and-Edit Framework for Source Code Summarization
EditSum: A Retrieve-and-Edit Framework for Source Code Summarization
Jia Li
Yongming Li
Ge Li
Xing Hu
Xin Xia
Zhi Jin
25
65
0
26 Aug 2023
AST-MHSA : Code Summarization using Multi-Head Self-Attention
AST-MHSA : Code Summarization using Multi-Head Self-Attention
Y. Nagaraj
U. Gupta
20
1
0
10 Aug 2023
COMEX: A Tool for Generating Customized Source Code Representations
COMEX: A Tool for Generating Customized Source Code Representations
Debeshee Das
N. Mathews
Alex Mathai
Srikanth G. Tamilselvam
Kranthi Sedamaki
S. Chimalakonda
Atul Kumar
VLM
23
5
0
10 Jul 2023
Natural Language Generation and Understanding of Big Code for
  AI-Assisted Programming: A Review
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
M. Wong
Shangxin Guo
Ching Nam Hang
Siu-Wai Ho
C. Tan
42
78
0
04 Jul 2023
Neural Machine Translation for Code Generation
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
Towards Code Generation from BDD Test Case Specifications: A Vision
Towards Code Generation from BDD Test Case Specifications: A Vision
Leon Chemnitz
David Reichenbach
Hani Aldebes
Mariam Naveed
Krishna Narasimhan
Mira Mezini
16
3
0
19 May 2023
Modelling Concurrency Bugs Using Machine Learning
Modelling Concurrency Bugs Using Machine Learning
Teodor Rares Begu
13
0
0
08 May 2023
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder
  Models for More Efficient Code Classification
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification
Anastasiia Grishina
Max Hort
Leon Moonen
22
6
0
08 May 2023
Implant Global and Local Hierarchy Information to Sequence based Code
  Representation Models
Implant Global and Local Hierarchy Information to Sequence based Code Representation Models
Kechi Zhang
Zhuo Li
Zhi Jin
Ge Li
29
7
0
14 Mar 2023
xASTNN: Improved Code Representations for Industrial Practice
xASTNN: Improved Code Representations for Industrial Practice
Zhiwei Xu
Min Zhou
Xibin Zhao
Yang Chen
Xi Cheng
Hongyu Zhang
AI4TS
29
5
0
13 Mar 2023
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
Shuyan Zhou
Uri Alon
Sumit Agarwal
Graham Neubig
ELM
ALM
40
98
0
10 Feb 2023
VuLASTE: Long Sequence Model with Abstract Syntax Tree Embedding for
  vulnerability Detection
VuLASTE: Long Sequence Model with Abstract Syntax Tree Embedding for vulnerability Detection
Botong Zhu
Huobin Tan
29
0
0
05 Feb 2023
Serenity: Library Based Python Code Analysis for Code Completion and
  Automated Machine Learning
Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning
Wenting Zhao
Ibrahim Abdelaziz
Julian T Dolby
Kavitha Srinivas
M. Helali
Essam Mansour
24
0
0
05 Jan 2023
JEMMA: An Extensible Java Dataset for ML4Code Applications
JEMMA: An Extensible Java Dataset for ML4Code Applications
Anjan Karmakar
Miltiadis Allamanis
Romain Robbes
VLM
29
3
0
18 Dec 2022
DexBERT: Effective, Task-Agnostic and Fine-grained Representation
  Learning of Android Bytecode
DexBERT: Effective, Task-Agnostic and Fine-grained Representation Learning of Android Bytecode
Tiezhu Sun
Kevin Allix
Kisub Kim
Xin Zhou
Dongsun Kim
David Lo
Tegawende F. Bissyande
Jacques Klein
24
11
0
12 Dec 2022
Machine Learning for Software Engineering: A Tertiary Study
Machine Learning for Software Engineering: A Tertiary Study
Zoe Kotti
R. Galanopoulou
D. Spinellis
31
21
0
17 Nov 2022
Rethinking Storage Management for Data Processing Pipelines in Cloud
  Data Centers
Rethinking Storage Management for Data Processing Pipelines in Cloud Data Centers
Ubaid Ullah Hafeez
Martin Maas
Mustafa Uysal
Richard McDougall
11
0
0
04 Nov 2022
Poison Attack and Defense on Deep Source Code Processing Models
Poison Attack and Defense on Deep Source Code Processing Models
Jia Li
Zhuo Li
Huangzhao Zhang
Ge Li
Zhi Jin
Xing Hu
Xin Xia
AAML
40
16
0
31 Oct 2022
Comparing neural network training performance between Elixir and Python
Comparing neural network training performance between Elixir and Python
Lucas C. Tavano
Lucas K. Amin
Adolfo Gustavo Serra Seca Neto
13
0
0
25 Oct 2022
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models
  for Programming Language Attend Code Structure
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure
Nuo Chen
Qiushi Sun
Renyu Zhu
Xiang Li
Xuesong Lu
Ming Gao
44
10
0
07 Oct 2022
Finding Reusable Machine Learning Components to Build Programming
  Language Processing Pipelines
Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Patrick Flynn
T. Vanderbruggen
C. Liao
Pei-Hung Lin
M. Emani
Xipeng Shen
21
4
0
11 Aug 2022
Test2Vec: An Execution Trace Embedding for Test Case Prioritization
Test2Vec: An Execution Trace Embedding for Test Case Prioritization
E. Jabbar
Soheila Zangeneh
Hadi Hemmati
R. Feldt
43
5
0
28 Jun 2022
An Extractive-and-Abstractive Framework for Source Code Summarization
An Extractive-and-Abstractive Framework for Source Code Summarization
Weisong Sun
Chunrong Fang
Yuchen Chen
Quanjun Zhang
Guanhong Tao
Tingxu Han
Yifei Ge
Yudu You
Bin Luo
26
29
0
15 Jun 2022
CodeS: Towards Code Model Generalization Under Distribution Shift
CodeS: Towards Code Model Generalization Under Distribution Shift
Qiang Hu
Yuejun Guo
Xiaofei Xie
Maxime Cordy
Lei Ma
Mike Papadakis
Yves Le Traon
OOD
30
10
0
11 Jun 2022
A Neural Network Architecture for Program Understanding Inspired by
  Human Behaviors
A Neural Network Architecture for Program Understanding Inspired by Human Behaviors
Renyu Zhu
Lei Yuan
Xiang Li
Ming Gao
Wenyuan Cai
29
8
0
10 May 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with
  Contrastive Pre-Training
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang
Yasheng Wang
Yao Wan
Jiawei Wang
Pingyi Zhou
Li Li
Hao Wu
Jin Liu
26
33
0
04 May 2022
Compositional Generalization and Decomposition in Neural Program
  Synthesis
Compositional Generalization and Decomposition in Neural Program Synthesis
Kensen Shi
Joey Hong
Manzil Zaheer
Pengcheng Yin
Charles Sutton
37
5
0
07 Apr 2022
CoCoSoDa: Effective Contrastive Learning for Code Search
CoCoSoDa: Effective Contrastive Learning for Code Search
Ensheng Shi
Yanlin Wang
Wenchao Gu
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Hongbin Sun
41
33
0
07 Apr 2022
PACE: A Parallelizable Computation Encoder for Directed Acyclic Graphs
PACE: A Parallelizable Computation Encoder for Directed Acyclic Graphs
Zehao Dong
Muhan Zhang
Fuhai Li
Yixin Chen
CML
GNN
33
17
0
19 Mar 2022
Static Prediction of Runtime Errors by Learning to Execute Programs with
  External Resource Descriptions
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions
David Bieber
Rishab Goel
Daniel Zheng
Hugo Larochelle
Daniel Tarlow
16
15
0
07 Mar 2022
Better Together? An Evaluation of AI-Supported Code Translation
Better Together? An Evaluation of AI-Supported Code Translation
Justin D. Weisz
Michael J. Muller
Steven I. Ross
Fernando Martinez
Stephanie Houde
Mayank Agarwal
Kartik Talamadupula
John T. Richards
29
67
0
15 Feb 2022
CodeFill: Multi-token Code Completion by Jointly Learning from Structure
  and Naming Sequences
CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences
M. Izadi
Roberta Gismondi
Georgios Gousios
23
97
0
14 Feb 2022
A Survey on Artificial Intelligence for Source Code: A Dialogue Systems
  Perspective
A Survey on Artificial Intelligence for Source Code: A Dialogue Systems Perspective
Erfan Al-Hossami
Samira Shaikh
32
6
0
10 Feb 2022
Towards Property-Based Tests in Natural Language
Towards Property-Based Tests in Natural Language
Colin S. Gordon
ELM
14
2
0
08 Feb 2022
Featherweight Assisted Vulnerability Discovery
Featherweight Assisted Vulnerability Discovery
D. Binkley
Leon Moonen
Sibren Isaacman
AAML
11
0
0
06 Feb 2022
123
Next