Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.06182
Cited By
A Survey of Machine Learning for Big Code and Naturalness
18 September 2017
Miltiadis Allamanis
Earl T. Barr
Premkumar T. Devanbu
Charles Sutton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Survey of Machine Learning for Big Code and Naturalness"
50 / 115 papers shown
Title
A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs
Musfiqur Rahman
SayedHassan Khatoonabadi
Emad Shihab
ALM
39
0
0
22 Apr 2025
Bringing Structure to Naturalness: On the Naturalness of ASTs
Profir-Petru Pârţachi
Mahito Sugiyama
27
0
0
11 Apr 2025
Deep Learning-based Intrusion Detection Systems: A Survey
Zhiwei Xu
Yujuan Wu
Shiheng Wang
Jiabao Gao
Tian Qiu
Ziqi Wang
Hai Wan
Xibin Zhao
26
1
0
10 Apr 2025
Enhancing Code LLM Training with Programmer Attention
Y. Zhang
Chen Huang
Z. Karas
Dung T. Nguyen
Kevin Leach
Yu Huang
75
0
0
19 Mar 2025
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models
M. Wong
C. Tan
ALM
83
4
0
19 Mar 2025
Fully Autonomous Programming using Iterative Multi-Agent Debugging with Large Language Models
Anastasiia Grishina
Vadim Liventsev
Aki Härmä
Leon Moonen
ELM
87
0
0
10 Mar 2025
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System
Sheikh Moonwara Anjum Monisha
Atul Bharadwaj
49
0
0
16 Feb 2025
Process-Supervised Reinforcement Learning for Code Generation
Yufan Ye
Ting Zhang
Wenbin Jiang
Hua Huang
OffRL
LRM
SyDa
63
1
0
03 Feb 2025
From Critique to Clarity: A Pathway to Faithful and Personalized Code Explanations with Large Language Models
Zexing Xu
Zhuang Luo
Yichuan Li
Kyumin Lee
S. Rasoul Etesami
38
0
0
28 Jan 2025
Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?
Taiming Wang
Yuxia Zhang
Lin Jiang
Yi Tang
Guangjie Li
Hui Liu
88
1
0
22 Jan 2025
Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey
Junqiao Wang
Zeng Zhang
Yangfan He
Yuyang Song
Tianyu Shi
...
Hengyuan Xu
Kunyu Wu
Guangwu Qian
Qiuwu Chen
Lewei He
38
11
0
03 Jan 2025
Mastering the Craft of Data Synthesis for CodeLLMs
Meng Chen
Philip Arthur
Qianyu Feng
Cong Duy Vu Hoang
Yu-Heng Hong
...
Mark Johnson
Kemal Kurniawan
Don Dharmasiri
Long Duong
Yuan-Fang Li
SyDa
60
1
0
16 Oct 2024
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks
Xuanfan Ni
Piji Li
ELM
LRM
34
8
0
16 May 2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
35
20
0
30 Dec 2023
Learning Generalizable Program and Architecture Representations for Performance Modeling
Lingda Li
T. Flynn
A. Hoisie
26
1
0
25 Oct 2023
EditSum: A Retrieve-and-Edit Framework for Source Code Summarization
Jia Li
Yongming Li
Ge Li
Xing Hu
Xin Xia
Zhi Jin
25
65
0
26 Aug 2023
AST-MHSA : Code Summarization using Multi-Head Self-Attention
Y. Nagaraj
U. Gupta
20
1
0
10 Aug 2023
COMEX: A Tool for Generating Customized Source Code Representations
Debeshee Das
N. Mathews
Alex Mathai
Srikanth G. Tamilselvam
Kranthi Sedamaki
S. Chimalakonda
Atul Kumar
VLM
23
5
0
10 Jul 2023
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
M. Wong
Shangxin Guo
Ching Nam Hang
Siu-Wai Ho
C. Tan
42
78
0
04 Jul 2023
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
Towards Code Generation from BDD Test Case Specifications: A Vision
Leon Chemnitz
David Reichenbach
Hani Aldebes
Mariam Naveed
Krishna Narasimhan
Mira Mezini
16
3
0
19 May 2023
Modelling Concurrency Bugs Using Machine Learning
Teodor Rares Begu
13
0
0
08 May 2023
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification
Anastasiia Grishina
Max Hort
Leon Moonen
22
6
0
08 May 2023
Implant Global and Local Hierarchy Information to Sequence based Code Representation Models
Kechi Zhang
Zhuo Li
Zhi Jin
Ge Li
29
7
0
14 Mar 2023
xASTNN: Improved Code Representations for Industrial Practice
Zhiwei Xu
Min Zhou
Xibin Zhao
Yang Chen
Xi Cheng
Hongyu Zhang
AI4TS
29
5
0
13 Mar 2023
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
Shuyan Zhou
Uri Alon
Sumit Agarwal
Graham Neubig
ELM
ALM
40
98
0
10 Feb 2023
VuLASTE: Long Sequence Model with Abstract Syntax Tree Embedding for vulnerability Detection
Botong Zhu
Huobin Tan
29
0
0
05 Feb 2023
Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning
Wenting Zhao
Ibrahim Abdelaziz
Julian T Dolby
Kavitha Srinivas
M. Helali
Essam Mansour
24
0
0
05 Jan 2023
JEMMA: An Extensible Java Dataset for ML4Code Applications
Anjan Karmakar
Miltiadis Allamanis
Romain Robbes
VLM
29
3
0
18 Dec 2022
DexBERT: Effective, Task-Agnostic and Fine-grained Representation Learning of Android Bytecode
Tiezhu Sun
Kevin Allix
Kisub Kim
Xin Zhou
Dongsun Kim
David Lo
Tegawende F. Bissyande
Jacques Klein
24
11
0
12 Dec 2022
Machine Learning for Software Engineering: A Tertiary Study
Zoe Kotti
R. Galanopoulou
D. Spinellis
31
21
0
17 Nov 2022
Rethinking Storage Management for Data Processing Pipelines in Cloud Data Centers
Ubaid Ullah Hafeez
Martin Maas
Mustafa Uysal
Richard McDougall
11
0
0
04 Nov 2022
Poison Attack and Defense on Deep Source Code Processing Models
Jia Li
Zhuo Li
Huangzhao Zhang
Ge Li
Zhi Jin
Xing Hu
Xin Xia
AAML
40
16
0
31 Oct 2022
Comparing neural network training performance between Elixir and Python
Lucas C. Tavano
Lucas K. Amin
Adolfo Gustavo Serra Seca Neto
13
0
0
25 Oct 2022
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure
Nuo Chen
Qiushi Sun
Renyu Zhu
Xiang Li
Xuesong Lu
Ming Gao
44
10
0
07 Oct 2022
Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Patrick Flynn
T. Vanderbruggen
C. Liao
Pei-Hung Lin
M. Emani
Xipeng Shen
21
4
0
11 Aug 2022
Test2Vec: An Execution Trace Embedding for Test Case Prioritization
E. Jabbar
Soheila Zangeneh
Hadi Hemmati
R. Feldt
43
5
0
28 Jun 2022
An Extractive-and-Abstractive Framework for Source Code Summarization
Weisong Sun
Chunrong Fang
Yuchen Chen
Quanjun Zhang
Guanhong Tao
Tingxu Han
Yifei Ge
Yudu You
Bin Luo
26
29
0
15 Jun 2022
CodeS: Towards Code Model Generalization Under Distribution Shift
Qiang Hu
Yuejun Guo
Xiaofei Xie
Maxime Cordy
Lei Ma
Mike Papadakis
Yves Le Traon
OOD
30
10
0
11 Jun 2022
A Neural Network Architecture for Program Understanding Inspired by Human Behaviors
Renyu Zhu
Lei Yuan
Xiang Li
Ming Gao
Wenyuan Cai
29
8
0
10 May 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang
Yasheng Wang
Yao Wan
Jiawei Wang
Pingyi Zhou
Li Li
Hao Wu
Jin Liu
26
33
0
04 May 2022
Compositional Generalization and Decomposition in Neural Program Synthesis
Kensen Shi
Joey Hong
Manzil Zaheer
Pengcheng Yin
Charles Sutton
37
5
0
07 Apr 2022
CoCoSoDa: Effective Contrastive Learning for Code Search
Ensheng Shi
Yanlin Wang
Wenchao Gu
Lun Du
Hongyu Zhang
Shi Han
Dongmei Zhang
Hongbin Sun
41
33
0
07 Apr 2022
PACE: A Parallelizable Computation Encoder for Directed Acyclic Graphs
Zehao Dong
Muhan Zhang
Fuhai Li
Yixin Chen
CML
GNN
33
17
0
19 Mar 2022
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions
David Bieber
Rishab Goel
Daniel Zheng
Hugo Larochelle
Daniel Tarlow
16
15
0
07 Mar 2022
Better Together? An Evaluation of AI-Supported Code Translation
Justin D. Weisz
Michael J. Muller
Steven I. Ross
Fernando Martinez
Stephanie Houde
Mayank Agarwal
Kartik Talamadupula
John T. Richards
29
67
0
15 Feb 2022
CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences
M. Izadi
Roberta Gismondi
Georgios Gousios
23
97
0
14 Feb 2022
A Survey on Artificial Intelligence for Source Code: A Dialogue Systems Perspective
Erfan Al-Hossami
Samira Shaikh
32
6
0
10 Feb 2022
Towards Property-Based Tests in Natural Language
Colin S. Gordon
ELM
14
2
0
08 Feb 2022
Featherweight Assisted Vulnerability Discovery
D. Binkley
Leon Moonen
Sibren Isaacman
AAML
11
0
0
06 Feb 2022
1
2
3
Next