ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.03374
  4. Cited By
Evaluating Large Language Models Trained on Code

Evaluating Large Language Models Trained on Code

7 July 2021
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
Jared Kaplan
Harrison Edwards
Yura Burda
Nicholas Joseph
Greg Brockman
Alex Ray
Raul Puri
Gretchen Krueger
Michael Petrov
Heidy Khlaaf
Girish Sastry
Pamela Mishkin
Brooke Chan
Scott Gray
Nick Ryder
Mikhail Pavlov
Alethea Power
Lukasz Kaiser
Mohammad Bavarian
Clemens Winter
Philippe Tillet
F. Such
D. Cummings
Matthias Plappert
Fotios Chantzis
Elizabeth Barnes
Ariel Herbert-Voss
William H. Guss
Alex Nichol
Alex Paino
Nikolas Tezak
Jie Tang
Igor Babuschkin
S. Balaji
Shantanu Jain
William Saunders
Christopher Hesse
A. Carr
Jan Leike
Joshua Achiam
Vedant Misra
Evan Morikawa
Alec Radford
Matthew Knight
Miles Brundage
Mira Murati
Katie Mayer
Peter Welinder
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
    ELM
    ALM
ArXivPDFHTML

Papers citing "Evaluating Large Language Models Trained on Code"

50 / 856 papers shown
Title
SELFIES and the future of molecular string representations
SELFIES and the future of molecular string representations
Mario Krenn
Qianxiang Ai
Senja Barthel
Nessa Carson
Angelo Frei
...
Andrew Wang
Andrew D. White
A. Young
Rose Yu
A. Aspuru‐Guzik
32
147
0
31 Mar 2022
CodeGen: An Open Large Language Model for Code with Multi-Turn Program
  Synthesis
CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis
Erik Nijkamp
Bo Pang
Hiroaki Hayashi
Lifu Tu
Haiquan Wang
Yingbo Zhou
Silvio Savarese
Caiming Xiong
ELM
54
968
0
25 Mar 2022
Linearizing Transformer with Key-Value Memory
Linearizing Transformer with Key-Value Memory
Yizhe Zhang
Deng Cai
20
5
0
23 Mar 2022
Automating Code Review Activities by Large-Scale Pre-training
Automating Code Review Activities by Large-Scale Pre-training
Zhiyu Li
Shuai Lu
Daya Guo
Nan Duan
Shailesh Jannu
...
Deep Majumder
Jared Green
Alexey Svyatkovskiy
Shengyu Fu
Neel Sundaresan
VLM
20
139
0
17 Mar 2022
Memorizing Transformers
Memorizing Transformers
Yuhuai Wu
M. Rabe
DeLesley S. Hutchins
Christian Szegedy
RALM
16
171
0
16 Mar 2022
Less is More: Summary of Long Instructions is Better for Program
  Synthesis
Less is More: Summary of Long Instructions is Better for Program Synthesis
Kirby Kuznia
Swaroop Mishra
Mihir Parmar
Chitta Baral
AIMat
28
22
0
16 Mar 2022
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
Zhiruo Wang
Grace Cuenca
Shuyan Zhou
Frank F. Xu
Graham Neubig
19
49
0
16 Mar 2022
Evaluating the Text-to-SQL Capabilities of Large Language Models
Evaluating the Text-to-SQL Capabilities of Large Language Models
Nitarshan Rajkumar
Raymond Li
Dzmitry Bahdanau
LMTD
ELM
21
98
0
15 Mar 2022
Contrastive Visual Semantic Pretraining Magnifies the Semantics of
  Natural Language Representations
Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations
Robert Wolfe
Aylin Caliskan
VLM
21
13
0
14 Mar 2022
A Survey on Deep Graph Generation: Methods and Applications
A Survey on Deep Graph Generation: Methods and Applications
Yanqiao Zhu
Yuanqi Du
Yinkai Wang
Yichen Xu
Jieyu Zhang
Qiang Liu
Shu Wu
3DV
GNN
31
67
0
13 Mar 2022
Static Prediction of Runtime Errors by Learning to Execute Programs with
  External Resource Descriptions
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions
David Bieber
Rishab Goel
Daniel Zheng
Hugo Larochelle
Daniel Tarlow
16
15
0
07 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
311
11,915
0
04 Mar 2022
From Natural Language to Simulations: Applying GPT-3 Codex to Automate
  Simulation Modeling of Logistics Systems
From Natural Language to Simulations: Applying GPT-3 Codex to Automate Simulation Modeling of Logistics Systems
I. Jackson
M. J. Sáenz
16
8
0
24 Feb 2022
COLD Decoding: Energy-based Constrained Text Generation with Langevin
  Dynamics
COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics
Lianhui Qin
Sean Welleck
Daniel Khashabi
Yejin Choi
AI4CE
44
144
0
23 Feb 2022
GPT-based Open-Ended Knowledge Tracing
GPT-based Open-Ended Knowledge Tracing
Naiming Liu
Zichao Wang
Richard G. Baraniuk
Andrew S. Lan
AI4Ed
27
3
0
21 Feb 2022
Probing Pretrained Models of Source Code
Probing Pretrained Models of Source Code
Sergey Troshin
Nadezhda Chirkova
ELM
27
38
0
16 Feb 2022
Better Together? An Evaluation of AI-Supported Code Translation
Better Together? An Evaluation of AI-Supported Code Translation
Justin D. Weisz
Michael J. Muller
Steven I. Ross
Fernando Martinez
Stephanie Houde
Mayank Agarwal
Kartik Talamadupula
John T. Richards
29
67
0
15 Feb 2022
A Survey on Artificial Intelligence for Source Code: A Dialogue Systems
  Perspective
A Survey on Artificial Intelligence for Source Code: A Dialogue Systems Perspective
Erfan Al-Hossami
Samira Shaikh
26
6
0
10 Feb 2022
Competition-Level Code Generation with AlphaCode
Competition-Level Code Generation with AlphaCode
Yujia Li
David Choi
Junyoung Chung
Nate Kushman
Julian Schrittwieser
...
Esme Sutherland Robson
Pushmeet Kohli
Nando de
Koray Kavukcuoglu
Oriol Vinyals
19
1,292
0
08 Feb 2022
Exploring Transformer Backbones for Heterogeneous Treatment Effect
  Estimation
Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation
Yi-Fan Zhang
Hanlin Zhang
Zachary Chase Lipton
Li Erran Li
Eric P. Xing
OODD
24
29
0
02 Feb 2022
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing
  of Software
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing of Software
Chuan-Yung Tsai
Graham W. Taylor
6
2
0
29 Jan 2022
Semantic Code Classification for Automated Machine Learning
Semantic Code Classification for Automated Machine Learning
P. Guseva
Anastasia Drozdova
N. Denisenko
Daria Sapozhnikova
Ivan Pyaternev
Anna Scherbakova
A.E. Ustuzhanin
21
0
0
25 Jan 2022
Text and Code Embeddings by Contrastive Pre-Training
Text and Code Embeddings by Contrastive Pre-Training
Arvind Neelakantan
Tao Xu
Raul Puri
Alec Radford
Jesse Michael Han
...
Tabarak Khan
Toki Sherbakov
Joanne Jang
Peter Welinder
Lilian Weng
SSL
AI4TS
218
421
0
24 Jan 2022
Unveiling Project-Specific Bias in Neural Code Models
Unveiling Project-Specific Bias in Neural Code Models
Zhiming Li
Yanzhou Li
Tianlin Li
Mengnan Du
Bozhi Wu
Yushi Cao
Yi Li
Yang Liu
25
5
0
19 Jan 2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge
  for Embodied Agents
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Wenlong Huang
Pieter Abbeel
Deepak Pathak
Igor Mordatch
LM&Ro
26
1,053
0
18 Jan 2022
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding
  with Text-to-Text Language Models
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Tianbao Xie
Chen Henry Wu
Peng Shi
Ruiqi Zhong
Torsten Scholak
...
Lingpeng Kong
Rui Zhang
Noah A. Smith
Luke Zettlemoyer
Tao Yu
LMTD
26
296
0
16 Jan 2022
Assemble Foundation Models for Automatic Code Summarization
Assemble Foundation Models for Automatic Code Summarization
Jian Gu
P. Salza
H. Gall
28
34
0
13 Jan 2022
User-Driven Support for Visualization Prototyping in D3
User-Driven Support for Visualization Prototyping in D3
Hannah K. Bako
Alisha Varma
Anuoluwapo Faboro
Mahreen Haider
Favour Nerrise
B. Kenah
John P. Dickerson
Leilani Battle
38
5
0
06 Dec 2021
Show Your Work: Scratchpads for Intermediate Computation with Language
  Models
Show Your Work: Scratchpads for Intermediate Computation with Language Models
Maxwell Nye
Anders Andreassen
Guy Gur-Ari
Henryk Michalewski
Jacob Austin
...
Aitor Lewkowycz
Maarten Bosma
D. Luan
Charles Sutton
Augustus Odena
ReLM
LRM
57
697
0
30 Nov 2021
How much do language models copy from their training data? Evaluating
  linguistic novelty in text generation using RAVEN
How much do language models copy from their training data? Evaluating linguistic novelty in text generation using RAVEN
R. Thomas McCoy
P. Smolensky
Tal Linzen
Jianfeng Gao
Asli Celikyilmaz
SyDa
17
119
0
18 Nov 2021
Solving Linear Algebra by Program Synthesis
Solving Linear Algebra by Program Synthesis
Iddo Drori
Nakul Verma
13
21
0
16 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained
  Language Models: A Survey
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
71
1,029
0
01 Nov 2021
The 5th Recognizing Families in the Wild Data Challenge: Predicting
  Kinship from Faces
The 5th Recognizing Families in the Wild Data Challenge: Predicting Kinship from Faces
Joseph P. Robinson
Can Qin
Ming Shao
Matthew A. Turk
Rama Chellappa
Y. Fu
CVBM
28
6
0
31 Oct 2021
Neural Program Generation Modulo Static Analysis
Neural Program Generation Modulo Static Analysis
Rohan Mukherjee
Yeming Wen
Dipak Chaudhari
Thomas W. Reps
Swarat Chaudhuri
C. Jermaine
28
24
0
26 Oct 2021
Automated Support for Unit Test Generation: A Tutorial Book Chapter
Automated Support for Unit Test Generation: A Tutorial Book Chapter
Afonso Fontes
Gregory Gay
F. D. O. Neto
R. Feldt
14
3
0
26 Oct 2021
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Benjamin L. Edelman
Surbhi Goel
Sham Kakade
Cyril Zhang
27
115
0
19 Oct 2021
Coherence boosting: When your pretrained language model is not paying
  enough attention
Coherence boosting: When your pretrained language model is not paying enough attention
Nikolay Malkin
Zhen Wang
Nebojsa Jojic
RALM
19
35
0
15 Oct 2021
Cascaded Fast and Slow Models for Efficient Semantic Code Search
Cascaded Fast and Slow Models for Efficient Semantic Code Search
Akhilesh Deepak Gotmare
Junnan Li
Shafiq R. Joty
S. Hoi
22
10
0
15 Oct 2021
Capturing Structural Locality in Non-parametric Language Models
Capturing Structural Locality in Non-parametric Language Models
Frank F. Xu
Junxian He
Graham Neubig
Vincent J. Hellendoorn
19
14
0
06 Oct 2021
Finetuned Language Models Are Zero-Shot Learners
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALM
UQCV
33
3,561
0
03 Sep 2021
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for
  Code Understanding and Generation
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
235
1,489
0
02 Sep 2021
Learning to Synthesize Programs as Interpretable and Generalizable
  Policies
Learning to Synthesize Programs as Interpretable and Generalizable Policies
Dweep Trivedi
Jesse Zhang
Shao-Hua Sun
Joseph J. Lim
NAI
9
71
0
31 Aug 2021
What do pre-trained code models know about code?
What do pre-trained code models know about code?
Anjan Karmakar
Romain Robbes
ELM
24
87
0
25 Aug 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
28
3,828
0
28 Jul 2021
Latent Execution for Neural Program Synthesis
Latent Execution for Neural Program Synthesis
Xinyun Chen
D. Song
Yuandong Tian
NAI
21
52
0
29 Jun 2021
Learning to Complete Code with Sketches
Learning to Complete Code with Sketches
Daya Guo
Alexey Svyatkovskiy
Jian Yin
Nan Duan
Marc Brockschmidt
Miltiadis Allamanis
13
40
0
18 Jun 2021
Programming Puzzles
Programming Puzzles
Tal Schuster
A. Kalyan
Oleksandr Polozov
Adam Tauman Kalai
ELM
15
32
0
10 Jun 2021
Measuring Coding Challenge Competence With APPS
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
196
624
0
20 May 2021
Carbon Emissions and Large Neural Network Training
Carbon Emissions and Large Neural Network Training
David A. Patterson
Joseph E. Gonzalez
Quoc V. Le
Chen Liang
Lluís-Miquel Munguía
D. Rothchild
David R. So
Maud Texier
J. Dean
AI4CE
241
643
0
21 Apr 2021
Measuring Mathematical Problem Solving With the MATH Dataset
Measuring Mathematical Problem Solving With the MATH Dataset
Dan Hendrycks
Collin Burns
Saurav Kadavath
Akul Arora
Steven Basart
Eric Tang
D. Song
Jacob Steinhardt
ReLM
FaML
57
1,804
0
05 Mar 2021
Previous
123...161718
Next