ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.08949
  4. Cited By
Learning to Mine Aligned Code and Natural Language Pairs from Stack
  Overflow

Learning to Mine Aligned Code and Natural Language Pairs from Stack Overflow

23 May 2018
Pengcheng Yin
Bowen Deng
Edgar Chen
Bogdan Vasilescu
Graham Neubig
ArXivPDFHTML

Papers citing "Learning to Mine Aligned Code and Natural Language Pairs from Stack Overflow"

43 / 43 papers shown
Title
Sigma: A dataset for text-to-code semantic parsing with statistical analysis
Sigma: A dataset for text-to-code semantic parsing with statistical analysis
Saleh Almohaimeed
Shenyang Liu
May Alsofyani
Saad Almohaimeed
Liqiang Wang
37
0
0
05 Apr 2025
ThrowBench: Benchmarking LLMs by Predicting Runtime Exceptions
Julian Aron Prenner
Romain Robbes
59
0
0
06 Mar 2025
Pragmatic Reasoning improves LLM Code Generation
Pragmatic Reasoning improves LLM Code Generation
Zhuchen Cao
Sven Apel
Adish Singla
Vera Demberg
LRM
39
0
0
20 Feb 2025
GitChameleon: Unmasking the Version-Switching Capabilities of Code
  Generation Models
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models
Nizar Islah
Justine Gehring
Diganta Misra
Eilif B. Muller
Irina Rish
Terry Yue Zhuo
Massimo Caccia
SyDa
38
1
0
05 Nov 2024
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages
Mehant Kammakomati
Sameer Pimparkhede
Srikanth G. Tamilselvam
Prince Kumar
Pushpak Bhattacharyya
ALM
40
0
0
03 Jul 2024
Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Yuxuan Wan
Chaozheng Wang
Yi Dong
Wenxuan Wang
Shuqing Li
Yintong Huo
M. Lyu
3DV
73
10
0
24 Jun 2024
PECC: Problem Extraction and Coding Challenges
PECC: Problem Extraction and Coding Challenges
Patrick Haller
Jonas Golde
Alan Akbik
ReLM
32
5
0
29 Apr 2024
Revisiting Code Similarity Evaluation with Abstract Syntax Tree Edit
  Distance
Revisiting Code Similarity Evaluation with Abstract Syntax Tree Edit Distance
Yewei Song
Cedric Lothritz
Daniel Tang
Tegawende F. Bissyande
Jacques Klein
46
9
0
12 Apr 2024
Compositional API Recommendation for Library-Oriented Code Generation
Compositional API Recommendation for Library-Oriented Code Generation
Zexiong Ma
Shengnan An
Bing Xie
Zeqi Lin
32
17
0
29 Feb 2024
SelfEvolve: A Code Evolution Framework via Large Language Models
SelfEvolve: A Code Evolution Framework via Large Language Models
Shuyang Jiang
Yuhao Wang
Yu Wang
16
32
0
05 Jun 2023
Neural Machine Translation for Code Generation
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
Prompting with Pseudo-Code Instructions
Prompting with Pseudo-Code Instructions
Mayank Mishra
Prince Kumar
Riyaz Ahmad Bhat
V. Rudramurthy
Danish Contractor
Srikanth G. Tamilselvam
42
13
0
19 May 2023
JaCoText: A Pretrained Model for Java Code-Text Generation
JaCoText: A Pretrained Model for Java Code-Text Generation
Jessica Nayeli López Espejel
Mahaman Sanoussi Yahaya Alassan
Walid Dahhane
E. Ettifouri
27
3
0
22 Mar 2023
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
Shuyan Zhou
Uri Alon
Sumit Agarwal
Graham Neubig
ELM
ALM
29
98
0
10 Feb 2023
JEMMA: An Extensible Java Dataset for ML4Code Applications
JEMMA: An Extensible Java Dataset for ML4Code Applications
Anjan Karmakar
Miltiadis Allamanis
Romain Robbes
VLM
21
3
0
18 Dec 2022
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for
  Programming Languages
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
Yekun Chai
Shuohuan Wang
Chao Pang
Yu Sun
Hao Tian
Hua-Hong Wu
27
35
0
13 Dec 2022
Execution-based Evaluation for Data Science Code Generation Models
Execution-based Evaluation for Data Science Code Generation Models
Junjie Huang
Chenglong Wang
Jipeng Zhang
Cong Yan
Haotian Cui
J. Inala
Colin B. Clement
Nan Duan
Jianfeng Gao
ELM
30
35
0
17 Nov 2022
Evaluating How Fine-tuning on Bimodal Data Effects Code Generation
Evaluating How Fine-tuning on Bimodal Data Effects Code Generation
Gabriel Orlanski
Seonhye Yang
Michael Healy
ALM
21
5
0
15 Nov 2022
CodePAD: Sequence-based Code Generation with Pushdown Automaton
CodePAD: Sequence-based Code Generation with Pushdown Automaton
Yihong Dong
Xue Jiang
Yuchen Liu
Ge Li
Zhi Jin
20
6
0
02 Nov 2022
Multi-lingual Evaluation of Code Generation Models
Multi-lingual Evaluation of Code Generation Models
Ben Athiwaratkun
Sanjay Krishna Gouda
Zijian Wang
Xiaopeng Li
Yuchen Tian
...
Baishakhi Ray
Parminder Bhatia
Sudipta Sengupta
Dan Roth
Bing Xiang
ELM
112
160
0
26 Oct 2022
Antecedent Predictions Are More Important Than You Think: An Effective
  Method for Tree-Based Code Generation
Antecedent Predictions Are More Important Than You Think: An Effective Method for Tree-Based Code Generation
Yihong Dong
Ge Li
Xue Jiang
Zhi Jin
8
1
0
22 Aug 2022
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural
  Code Generation
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation
Federico Cassano
John Gouwar
Daniel Nguyen
S. Nguyen
Luna Phipps-Costin
...
Carolyn Jane Anderson
Molly Q. Feldman
Arjun Guha
Michael Greenberg
Abhinav Jangda
ELM
22
81
0
17 Aug 2022
Transformer with Tree-order Encoding for Neural Program Generation
Transformer with Tree-order Encoding for Neural Program Generation
Klaudia Thellmann
Bernhard Stadler
Ricardo Usbeck
Jens Lehmann
17
1
0
30 May 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with
  Contrastive Pre-Training
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang
Yasheng Wang
Yao Wan
Jiawei Wang
Pingyi Zhou
Li Li
Hao Wu
Jin Liu
21
33
0
04 May 2022
Natural Language to Code Translation with Execution
Natural Language to Code Translation with Execution
Freda Shi
Daniel Fried
Marjan Ghazvininejad
Luke Zettlemoyer
Sida I. Wang
33
123
0
25 Apr 2022
InCoder: A Generative Model for Code Infilling and Synthesis
InCoder: A Generative Model for Code Infilling and Synthesis
Daniel Fried
Armen Aghajanyan
Jessy Lin
Sida I. Wang
Eric Wallace
Freda Shi
Ruiqi Zhong
Wen-tau Yih
Luke Zettlemoyer
M. Lewis
SyDa
25
625
0
12 Apr 2022
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
Zhiruo Wang
Grace Cuenca
Shuyan Zhou
Frank F. Xu
Graham Neubig
21
50
0
16 Mar 2022
A Survey on Artificial Intelligence for Source Code: A Dialogue Systems
  Perspective
A Survey on Artificial Intelligence for Source Code: A Dialogue Systems Perspective
Erfan Al-Hossami
Samira Shaikh
26
6
0
10 Feb 2022
AstBERT: Enabling Language Model for Financial Code Understanding with
  Abstract Syntax Trees
AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees
Rong Liang
Tiehu Zhang
Y. Lu
Yuze Liu
Zhengqing Huang
Xin Chen
14
3
0
20 Jan 2022
Text Classification for Task-based Source Code Related Questions
Text Classification for Task-based Source Code Related Questions
Sairamvinay Vijayaraghavan
Jinxiao Song
David A. Tomassi
Siddhartha Punj
Jailan Sabet
19
0
0
31 Oct 2021
A Survey on Machine Learning Techniques for Source Code Analysis
A Survey on Machine Learning Techniques for Source Code Analysis
Tushar Sharma
M. Kechagia
Stefanos Georgiou
Rohit Tiwari
Indira Vats
Hadi Moazen
Federica Sarro
25
61
0
18 Oct 2021
Reading StackOverflow Encourages Cheating: Adding Question Text Improves
  Extractive Code Generation
Reading StackOverflow Encourages Cheating: Adding Question Text Improves Extractive Code Generation
Gabriel Orlanski
Alex Gittens
29
20
0
08 Jun 2021
Exploring Dynamic Selection of Branch Expansion Orders for Code
  Generation
Exploring Dynamic Selection of Branch Expansion Orders for Code Generation
Hui Jiang
Chulun Zhou
Fandong Meng
Biao Zhang
Jie Zhou
Degen Huang
Qingqiang Wu
Jinsong Su
19
20
0
01 Jun 2021
Shellcode_IA32: A Dataset for Automatic Shellcode Generation
Shellcode_IA32: A Dataset for Automatic Shellcode Generation
Pietro Liguori
Erfan Al-Hossami
Domenico Cotroneo
R. Natella
B. Cukic
Samira Shaikh
32
27
0
27 Apr 2021
Code Generation from Natural Language with Less Prior and More
  Monolingual Data
Code Generation from Natural Language with Less Prior and More Monolingual Data
Sajad Norouzi
Keyi Tang
Yanshuai Cao
6
19
0
01 Jan 2021
Towards Full-line Code Completion with Neural Language Models
Towards Full-line Code Completion with Neural Language Models
Wenhan Wang
Sijie Shen
Ge Li
Zhi Jin
11
16
0
18 Sep 2020
A Systematic Literature Review on the Use of Deep Learning in Software
  Engineering Research
A Systematic Literature Review on the Use of Deep Learning in Software Engineering Research
Cody Watson
Nathan Cooper
David Nader-Palacio
Kevin Moran
Denys Poshyvanyk
26
111
0
14 Sep 2020
A Multi-Perspective Architecture for Semantic Code Search
A Multi-Perspective Architecture for Semantic Code Search
Rajarshi Haldar
Lingfei Wu
Jinjun Xiong
J. Hockenmaier
15
55
0
06 May 2020
Learning to Update Natural Language Comments Based on Code Changes
Learning to Update Natural Language Comments Based on Code Changes
Sheena Panthaplackel
Pengyu Nie
Miloš Gligorić
Junyi Jessy Li
Raymond J. Mooney
27
63
0
25 Apr 2020
Learning based Methods for Code Runtime Complexity Prediction
Learning based Methods for Code Runtime Complexity Prediction
Jagriti Sikka
K. Satya
Yaman Kumar Singla
Shagun Uppal
R. Shah
Roger Zimmermann
14
14
0
04 Nov 2019
JuICe: A Large Scale Distantly Supervised Dataset for Open Domain
  Context-based Code Generation
JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation
R. Agashe
R. Campello
Arthur Zimek
28
82
0
05 Oct 2019
Program Synthesis and Semantic Parsing with Learned Code Idioms
Program Synthesis and Semantic Parsing with Learned Code Idioms
Richard Shin
Miltiadis Allamanis
Marc Brockschmidt
Oleksandr Polozov
11
87
0
26 Jun 2019
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,923
0
17 Aug 2015
1