ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06182
  4. Cited By
A Survey of Machine Learning for Big Code and Naturalness
v1v2 (latest)

A Survey of Machine Learning for Big Code and Naturalness

18 September 2017
Miltiadis Allamanis
Earl T. Barr
Premkumar T. Devanbu
Charles Sutton
ArXiv (abs)PDFHTML

Papers citing "A Survey of Machine Learning for Big Code and Naturalness"

50 / 298 papers shown
Title
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
J. Yang
Wei Emma Zhang
Shark Liu
J. Wu
Shawn Guo
...
Zizheng Zhan
Jiajun Zhang
Jie Zhang
Zhaoxiang Zhang
Bo Zheng
LLMAGALMELM
776
0
0
23 Nov 2025
Scaling Laws for Code: A More Data-Hungry Regime
Scaling Laws for Code: A More Data-Hungry Regime
Xianzhen Luo
Wenzhen Zheng
Qingfu Zhu
Rongyi Zhang
Houyi Li
Siming Huang
YuanTao Fan
Wanxiang Che
ALM
100
2
0
09 Oct 2025
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation
Esakkivel Esakkiraja
Denis Akhiyarov
Aditya Shanmugham
Chitra Ganapathy
60
0
0
30 Sep 2025
RANGER -- Repository-Level Agent for Graph-Enhanced Retrieval
RANGER -- Repository-Level Agent for Graph-Enhanced Retrieval
Pratik Shah
Rajat Ghosh
Aryan Singhal
Debojyoti Dutta
140
0
0
27 Sep 2025
On the Soundness and Consistency of LLM Agents for Executing Test Cases Written in Natural Language
On the Soundness and Consistency of LLM Agents for Executing Test Cases Written in Natural Language
Sébastien Salva
Redha Taguelmimt
LLMAG
176
0
0
23 Sep 2025
Discovering Software Parallelization Points Using Deep Neural Networks
Discovering Software Parallelization Points Using Deep Neural Networks
Izavan dos S. Correia
Henrique C. T. Santos
Tiago Ferreira
72
0
0
05 Sep 2025
The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang
The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang
Fenghua Liu
Yulong Chen
Yixuan Liu
Zhujun Jin
Solomon Tsai
Ming Zhong
ReLMLRM
167
0
0
30 Aug 2025
Previously on... Automating Code Review
Previously on... Automating Code Review
Robert Heumüller
Frank Ortmeier
49
1
0
25 Aug 2025
The Fools are Certain; the Wise are Doubtful: Exploring LLM Confidence in Code Completion
The Fools are Certain; the Wise are Doubtful: Exploring LLM Confidence in Code Completion
Zoe Kotti
Konstantina Dritsa
D. Spinellis
Panos Louridas
104
2
0
22 Aug 2025
RAG for Geoscience: What We Expect, Gaps and Opportunities
RAG for Geoscience: What We Expect, Gaps and Opportunities
Runlong Yu
Shiyuan Luo
Rahul Ghosh
Jinkui Chi
Yiqun Xie
Xiaowei Jia
117
1
0
15 Aug 2025
Vibe Coding as a Reconfiguration of Intent Mediation in Software Development: Definition, Implications, and Research Agenda
Vibe Coding as a Reconfiguration of Intent Mediation in Software Development: Definition, Implications, and Research Agenda
Christian Meske
Tobias Hermanns
Esther von der Weiden
Kai-Uwe Loser
Thorsten Berger
143
7
0
29 Jul 2025
Automated Code Review Using Large Language Models with Symbolic Reasoning
Automated Code Review Using Large Language Models with Symbolic ReasoningInternational Service Availability Symposium (ISAS), 2025
Busra Icoz
Goksel Biricik
LRM
146
0
0
24 Jul 2025
AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?
AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?
Ori Press
Brandon Amos
Haoyu Zhao
Yikai Wu
Samuel K. Ainsworth
...
K. Lieret
Hanlin Zhang
Shirley Huang
Matthias Bethge
Ofir Press
ALMELMLM&MA
262
4
0
19 Jul 2025
Directed Acyclic Graph Convolutional Networks
Directed Acyclic Graph Convolutional Networks
Samuel Rey
Hamed Ajorlou
Gonzalo Mateos
GNNCMLAI4CE
176
0
0
13 Jun 2025
Deconstructing Obfuscation: A four-dimensional framework for evaluating Large Language Models assembly code deobfuscation capabilities
Deconstructing Obfuscation: A four-dimensional framework for evaluating Large Language Models assembly code deobfuscation capabilities
Anton Tkachenko
Dmitrij Suskevic
Benjamin Adolphi
279
1
0
26 May 2025
Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code
Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code
Michele Carissimi
Martina Saletta
C. Ferretti
154
1
0
24 Apr 2025
A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs
A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs
Musfiqur Rahman
SayedHassan Khatoonabadi
Emad Shihab
ALM
235
2
0
22 Apr 2025
TD-Suite: All Batteries Included Framework for Technical Debt Classification
TD-Suite: All Batteries Included Framework for Technical Debt Classification
Karthik Shivashankar
Antonio Martini
133
1
0
15 Apr 2025
Bringing Structure to Naturalness: On the Naturalness of ASTs
Bringing Structure to Naturalness: On the Naturalness of ASTs
Profir-Petru Pârţachi
Mahito Sugiyama
186
1
0
11 Apr 2025
Towards an Understanding of Context Utilization in Code Intelligence
Towards an Understanding of Context Utilization in Code Intelligence
Yanlin Wang
Kefeng Duan
Dewu Zheng
Ensheng Shi
F. Zhang
...
Xilin Liu
Yuchi Ma
Hongyu Zhang
Qianxiang Wang
Zibin Zheng
244
3
0
11 Apr 2025
Deep Learning-based Intrusion Detection Systems: A Survey
Deep Learning-based Intrusion Detection Systems: A Survey
Zhiwei Xu
Yujuan Wu
Shiheng Wang
Jiabao Gao
Tian Qiu
Ziqi Wang
Hai Wan
Xibin Zhao
311
15
0
10 Apr 2025
Semantic Mastery: Enhancing LLMs with Advanced Natural Language Understanding
Semantic Mastery: Enhancing LLMs with Advanced Natural Language Understanding
Mohanakrishnan Hariharan
139
2
0
01 Apr 2025
Enhancing Code LLM Training with Programmer Attention
Enhancing Code LLM Training with Programmer Attention
Y. Zhang
Chen Huang
Z. Karas
Dung T. Nguyen
Kevin Leach
Yu Huang
345
2
0
19 Mar 2025
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language ModelsIEEE Transactions on Big Data (IEEE Trans. Big Data), 2025
M. Wong
C. Tan
ALM
295
15
0
19 Mar 2025
Fully Autonomous Programming using Iterative Multi-Agent Debugging with Large Language ModelsACM Transactions on Evolutionary Learning and Optimization (TELO), 2025
Anastasiia Grishina
Vadim Liventsev
Aki Härmä
Leon Moonen
ELM
331
3
0
10 Mar 2025
LoRACode: LoRA Adapters for Code Embeddings
LoRACode: LoRA Adapters for Code Embeddings
Saumya Chaturvedi
Aman Chadha
Laurent Bindschaedler
333
0
0
07 Mar 2025
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System
Sheikh Moonwara Anjum Monisha
Atul Bharadwaj
181
0
0
16 Feb 2025
Should Code Models Learn Pedagogically? A Preliminary Evaluation of Curriculum Learning for Real-World Software Engineering Tasks
Should Code Models Learn Pedagogically? A Preliminary Evaluation of Curriculum Learning for Real-World Software Engineering TasksIEEE Working Conference on Mining Software Repositories (MSR), 2025
Kyi Shin Khant
Hong Yi Lin
Patanamon Thongtanunam
ELM
314
0
0
06 Feb 2025
Process-Supervised Reinforcement Learning for Code Generation
Process-Supervised Reinforcement Learning for Code Generation
Yufan Ye
Ting Zhang
Wenbin Jiang
Hua Huang
OffRLLRMSyDa
318
14
0
03 Feb 2025
From Critique to Clarity: A Pathway to Faithful and Personalized Code Explanations with Large Language Models
Zexing Xu
Zhuang Luo
Yichuan Li
Kyumin Lee
S. Rasoul Etesami
305
1
0
28 Jan 2025
Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?
Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?Empirical Software Engineering (EMSE), 2024
Taiming Wang
Yuxia Zhang
Lin Jiang
Yi Tang
Guangjie Li
Hui Liu
359
4
0
22 Jan 2025
Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets
Mahmoud Jahanshahi
Audris Mockus
AAML
112
2
0
05 Jan 2025
Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey
Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey
Junqiao Wang
Zeng Zhang
Yangfan He
Yuyang Song
Lewei He
...
Tang Jingqun
Guangwu Qian
Keqin Li
Qiuwu Chen
Lewei He
541
83
0
29 Dec 2024
EnStack: An Ensemble Stacking Framework of Large Language Models for
  Enhanced Vulnerability Detection in Source Code
EnStack: An Ensemble Stacking Framework of Large Language Models for Enhanced Vulnerability Detection in Source CodeBigData Congress [Services Society] (BSS), 2024
Shahriyar Zaman Ridoy
Md. Shazzad Hossain Shaon
A. Cuzzocrea
Mst. Shapna Akter
217
6
0
25 Nov 2024
Mastering the Craft of Data Synthesis for CodeLLMs
Mastering the Craft of Data Synthesis for CodeLLMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Meng Chen
Philip Arthur
Qianyu Feng
Cong Duy Vu Hoang
Yu-Heng Hong
...
Mark Johnson
Kemal Kurniawan
Don Dharmasiri
Long Duong
Yuan-Fang Li
SyDa
598
3
0
16 Oct 2024
In-Context Code-Text Learning for Bimodal Software Engineering
In-Context Code-Text Learning for Bimodal Software Engineering
Xunzhu Tang
Liran Wang
Yonghui Liu
Linzheng Chai
Jian Yang
Zhoujun Li
Haoye Tian
Jacques Klein
Tegawende F. Bissyande
278
1
0
08 Oct 2024
Leveraging Reviewer Experience in Code Review Comment Generation
Leveraging Reviewer Experience in Code Review Comment GenerationACM Transactions on Software Engineering and Methodology (TOSEM), 2024
Hong Yi Lin
Patanamon Thongtanunam
Christoph Treude
Michael W. Godfrey
Chunhua Liu
Wachiraphan Charoenwet
220
4
0
17 Sep 2024
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale
Huy N. Phan
Phong X. Nguyen
P. Nguyen
Nghi D. Q. Bui
LLMAG
349
31
0
09 Sep 2024
A Joint Learning Model with Variational Interaction for Multilingual
  Program Translation
A Joint Learning Model with Variational Interaction for Multilingual Program TranslationInternational Conference on Automated Software Engineering (ASE), 2024
Yali Du
Hui Sun
Ming Li
324
5
0
25 Aug 2024
Is Generative AI the Next Tactical Cyber Weapon For Threat Actors?
  Unforeseen Implications of AI Generated Cyber Attacks
Is Generative AI the Next Tactical Cyber Weapon For Threat Actors? Unforeseen Implications of AI Generated Cyber Attacks
Yusuf Usman
Aadesh Upadhyay
P. Gyawali
Robin Chataut
AAML
231
6
0
23 Aug 2024
Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Deep Code Search with Naming-Agnostic Contrastive Multi-View LearningACM Transactions on Knowledge Discovery from Data (TKDD), 2024
Jiadong Feng
Wei Li
Suhuang Wu
Zhao Wei
Yong Xu
Juhong Wang
Hui Li
165
2
0
18 Aug 2024
MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance
  Optimizations
MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance Optimizations
Akash Dutta
Ali Jannesari
219
3
0
02 Jul 2024
AgileCoder: Dynamic Collaborative Agents for Software Development based
  on Agile Methodology
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology
Minh Huynh Nguyen
Thang Phan Chau
Phong X. Nguyen
Nghi D. Q. Bui
290
36
0
16 Jun 2024
Morescient GAI for Software Engineering
Morescient GAI for Software EngineeringACM Transactions on Software Engineering and Methodology (TOSEM), 2024
Marcus Kessel
Colin Atkinson
SyDa
223
4
0
07 Jun 2024
Requirements are All You Need: The Final Frontier for End-User Software
  Engineering
Requirements are All You Need: The Final Frontier for End-User Software Engineering
Diana Robinson
Christian Cabrera
Andrew D. Gordon
Neil D. Lawrence
Lars Mennen
213
10
0
22 May 2024
A Systematic Evaluation of Large Language Models for Natural Language
  Generation Tasks
A Systematic Evaluation of Large Language Models for Natural Language Generation TasksChina National Conference on Chinese Computational Linguistics (CCL), 2024
Xuanfan Ni
Piji Li
ELMLRM
173
13
0
16 May 2024
Convolutional Learning on Directed Acyclic Graphs
Convolutional Learning on Directed Acyclic GraphsAsilomar Conference on Signals, Systems and Computers (ACSSC), 2024
Samuel Rey
Hamed Ajorlou
Gonzalo Mateos
CMLAI4CEGNN
191
7
0
05 May 2024
Exploring and Unleashing the Power of Large Language Models in Automated
  Code Translation
Exploring and Unleashing the Power of Large Language Models in Automated Code Translation
Zhen Yang
Fang Liu
Zhongxing Yu
J. Keung
Jia Li
Shuo Liu
Yifan Hong
Xiaoxue Ma
Zhi Jin
Ge Li
304
125
0
23 Apr 2024
Vulnerability Detection with Code Language Models: How Far Are We?
Vulnerability Detection with Code Language Models: How Far Are We?
Yangruibo Ding
Yanjun Fu
Omniyyah Ibrahim
Chawin Sitawarin
Xinyun Chen
Basel Alomair
David Wagner
Baishakhi Ray
Yizheng Chen
AAML
253
139
0
27 Mar 2024
Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language
  Models
Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models
Chengzhe Feng
Yanan Sun
Ke Li
Pan Zhou
Jiancheng Lv
Aojun Lu
289
3
0
20 Mar 2024
123456
Next