Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.03511
Cited By
Unsupervised Translation of Programming Languages
5 June 2020
Marie-Anne Lachaux
Baptiste Roziere
L. Chanussot
Guillaume Lample
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unsupervised Translation of Programming Languages"
47 / 47 papers shown
Title
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
Shouyang Dong
Yuanbo Wen
Jun Bi
Di Huang
Jiaming Guo
...
Yifan Hao
Xuehai Zhou
Tianshi Chen
Qi Guo
Yunji Chen
22
0
0
04 May 2025
CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories
Yijia Xiao
Runhui Wang
Luyang Kong
Davor Golac
Wei Wang
LLMAG
109
0
0
10 Feb 2025
Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Xing Zhang
Jiaheng Wen
Fangkai Yang
Pu Zhao
Yu Kang
...
Qingwei Lin
Yingnong Dang
Saravan Rajmohan
Dongmei Zhang
Qi Zhang
53
2
0
28 Jan 2025
Revisiting Reliability in Large-Scale Machine Learning Research Clusters
Apostolos Kokolis
Michael Kuchnik
John Hoffman
Adithya Kumar
Parth Malani
Faye Ma
Zachary DeVito
S.
Kalyan Saladi
Carole-Jean Wu
107
7
0
29 Oct 2024
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph
Siru Ouyang
W. Yu
Kaixin Ma
Zilin Xiao
Z. Zhang
Mengzhao Jia
J. Han
H. Zhang
Dong Yu
49
12
0
03 Oct 2024
How Does Code Pretraining Affect Language Model Task Performance?
Jackson Petty
Sjoerd van Steenkiste
Tal Linzen
60
8
0
06 Sep 2024
Agentless: Demystifying LLM-based Software Engineering Agents
Chunqiu Steven Xia
Yinlin Deng
Soren Dunn
Lingming Zhang
LLMAG
32
80
0
01 Jul 2024
Exploring and Evaluating Hallucinations in LLM-Powered Code Generation
Fang Liu
Yang Liu
Lin Shi
Houkun Huang
Ruifeng Wang
Zhen Yang
Li Zhang
Zhongqi Li
Yuchi Ma
46
107
0
01 Apr 2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
25
20
0
30 Dec 2023
Can LLMs Patch Security Issues?
Kamel Alrashedy
Abdullah Aljasser
Pradyumna Tambwekar
Matthew Gombolay
AAML
16
6
0
13 Nov 2023
CompCodeVet: A Compiler-guided Validation and Enhancement Approach for Code Dataset
Le Chen
Arijit Bhattacharjee
Nesreen K. Ahmed
N. Hasabnis
Gal Oren
Bin Lei
Ali Jannesari
LRM
24
3
0
11 Nov 2023
Configuration Validation with Large Language Models
Xinyu Lian
Yinfang Chen
Runxiang Cheng
Jie Huang
Parth Thakkar
Minjia Zhang
Tianyin Xu
11
10
0
15 Oct 2023
Bias Testing and Mitigation in LLM-based Code Generation
Dong Huang
Qingwen Bu
Jie M. Zhang
Xiaofei Xie
Junjie Chen
Heming Cui
36
20
0
03 Sep 2023
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code
Nadezhda Chirkova
Sergey Troshin
21
8
0
01 Aug 2023
Multilingual Code Co-Evolution Using Large Language Models
Jiyang Zhang
Pengyu Nie
Junyi Jessy Li
Miloš Gligorić
19
20
0
27 Jul 2023
Exploring Continual Learning for Code Generation Models
Prateek Yadav
Q. Sun
Hantian Ding
Xiaopeng Li
Dejiao Zhang
...
Parminder Bhatia
Ramesh Nallapati
M. K. Ramanathan
Mohit Bansal
Bing Xiang
CLL
30
29
0
05 Jul 2023
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review
M. Wong
Shangxin Guo
Ching Nam Hang
Siu-Wai Ho
C. Tan
33
78
0
04 Jul 2023
SelfEvolve: A Code Evolution Framework via Large Language Models
Shuyang Jiang
Yuhao Wang
Yu Wang
11
32
0
05 Jun 2023
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
30
4
0
22 May 2023
SLaDe: A Portable Small Language Model Decompiler for Optimized Assembly
Jordi Armengol-Estapé
Jackson Woodruff
Chris Cummins
Michael F. P. O'Boyle
35
15
0
21 May 2023
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Dũng Nguyễn Mạnh
Nam Le Hai
An Dau
A. Nguyen
Khanh N. Nghiem
Jingnan Guo
Nghi D. Q. Bui
26
13
0
09 May 2023
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
Aftab Hussain
Md Rafiqul Islam Rabin
Bowen Xu
David Lo
Mohammad Amin Alipour
34
3
0
08 Mar 2023
ADELT: Transpilation Between Deep Learning Frameworks
Linyuan Gong
Jiayi Wang
Alvin Cheung
30
3
0
07 Mar 2023
On ML-Based Program Translation: Perils and Promises
Aniketh Malyala
K. Zhou
Baishakhi Ray
Saikat Chakraborty
24
5
0
21 Feb 2023
Exploring Data Augmentation for Code Generation Tasks
Pinzhen Chen
Gerasimos Lampouras
29
9
0
05 Feb 2023
What are the Machine Learning best practices reported by practitioners on Stack Exchange?
Anamaria Mojica-Hanke
A. Bayona
Mario Linares-Vásquez
Steffen Herbold
Fabio A. González
HAI
19
6
0
25 Jan 2023
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
23
1
0
12 Dec 2022
Execution-based Evaluation for Data Science Code Generation Models
Junjie Huang
Chenglong Wang
Jipeng Zhang
Cong Yan
Haotian Cui
J. Inala
Colin B. Clement
Nan Duan
Jianfeng Gao
ELM
25
34
0
17 Nov 2022
Revealing the Semantics of Data Wrangling Scripts With COMANTICS
Kai Xiong
Zhongsu Luo
Siwei Fu
Yongheng Wang
Mingliang Xu
Yingnian Wu
25
11
0
28 Sep 2022
Malicious Source Code Detection Using Transformer
Chen Tsfaty
Michael Fire
29
4
0
16 Sep 2022
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
Fenia Christopoulou
Gerasimos Lampouras
Milan Gritta
Guchun Zhang
Yinpeng Guo
...
Guangtai Liang
Jia Wei
Xin Jiang
Qianxiang Wang
Qun Liu
ELM
SyDa
ALM
27
74
0
22 Jul 2022
StructCoder: Structure-Aware Transformer for Code Generation
Sindhu Tipirneni
Ming Zhu
Chandan K. Reddy
28
55
0
10 Jun 2022
HyperTree Proof Search for Neural Theorem Proving
Guillaume Lample
Marie-Anne Lachaux
Thibaut Lavril
Xavier Martinet
Amaury Hayat
Gabriel Ebner
Aurelien Rodriguez
Timothée Lacroix
AIMat
23
131
0
23 May 2022
GitRank: A Framework to Rank GitHub Repositories
N. Hasabnis
8
3
0
04 May 2022
Probing Pretrained Models of Source Code
Sergey Troshin
Nadezhda Chirkova
ELM
25
38
0
16 Feb 2022
Better Together? An Evaluation of AI-Supported Code Translation
Justin D. Weisz
Michael J. Muller
Steven I. Ross
Fernando Martinez
Stephanie Houde
Mayank Agarwal
Kartik Talamadupula
John T. Richards
29
67
0
15 Feb 2022
Federated Data Science to Break Down Silos [Vision]
Essam Mansour
Kavitha Srinivas
K. Hose
FedML
AI4CE
17
8
0
25 Nov 2021
Text Classification for Task-based Source Code Related Questions
Sairamvinay Vijayaraghavan
Jinxiao Song
David A. Tomassi
Siddhartha Punj
Jailan Sabet
11
0
0
31 Oct 2021
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
210
1,489
0
02 Sep 2021
AVATAR: A Parallel Corpus for Java-Python Program Translation
W. Ahmad
Md Golam Rahman Tushar
Saikat Chakraborty
Kai-Wei Chang
30
78
0
26 Aug 2021
What do pre-trained code models know about code?
Anjan Karmakar
Romain Robbes
ELM
16
86
0
25 Aug 2021
DeepMutants: Training neural bug detectors with contextual mutations
Cedric Richter
Heike Wehrheim
8
3
0
14 Jul 2021
How could Neural Networks understand Programs?
Dinglan Peng
Shuxin Zheng
Yatao Li
Guolin Ke
Di He
Tie-Yan Liu
NAI
13
61
0
10 May 2021
DOBF: A Deobfuscation Pre-Training Objective for Programming Languages
Baptiste Roziere
Marie-Anne Lachaux
Marc Szafraniec
Guillaume Lample
AI4CE
44
136
0
15 Feb 2021
Neural Software Analysis
Michael Pradel
S. Chandra
NAI
21
31
0
16 Nov 2020
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
Juan Cruz-Benito
Sanjay Vishwakarma
Francisco Martín-Fernández
Ismael Faro Ibm Quantum
22
30
0
16 Sep 2020
Word Translation Without Parallel Data
Alexis Conneau
Guillaume Lample
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
165
1,634
0
11 Oct 2017
1