ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.03374
  4. Cited By
Evaluating Large Language Models Trained on Code

Evaluating Large Language Models Trained on Code

7 July 2021
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
Jared Kaplan
Harrison Edwards
Yura Burda
Nicholas Joseph
Greg Brockman
Alex Ray
Raul Puri
Gretchen Krueger
Michael Petrov
Heidy Khlaaf
Girish Sastry
Pamela Mishkin
Brooke Chan
Scott Gray
Nick Ryder
Mikhail Pavlov
Alethea Power
Lukasz Kaiser
Mohammad Bavarian
Clemens Winter
Philippe Tillet
F. Such
D. Cummings
Matthias Plappert
Fotios Chantzis
Elizabeth Barnes
Ariel Herbert-Voss
William H. Guss
Alex Nichol
Alex Paino
Nikolas Tezak
Jie Tang
Igor Babuschkin
S. Balaji
Shantanu Jain
William Saunders
Christopher Hesse
A. Carr
Jan Leike
Joshua Achiam
Vedant Misra
Evan Morikawa
Alec Radford
Matthew Knight
Miles Brundage
Mira Murati
Katie Mayer
Peter Welinder
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
    ELM
    ALM
ArXivPDFHTML

Papers citing "Evaluating Large Language Models Trained on Code"

50 / 856 papers shown
Title
Learning to Learn with Generative Models of Neural Network Checkpoints
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
73
64
0
26 Sep 2022
NL2INTERFACE: Interactive Visualization Interface Generation from
  Natural Language Queries
NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries
Yiru Chen
Ryan Li
Austin Mac
Tianbao Xie
Tao Yu
Eugene Wu
36
13
0
19 Sep 2022
Malicious Source Code Detection Using Transformer
Malicious Source Code Detection Using Transformer
Chen Tsfaty
Michael Fire
29
4
0
16 Sep 2022
Exploring Code Style Transfer with Neural Networks
Exploring Code Style Transfer with Neural Networks
Karl Munson
Anish Savla
Chih-Kai Ting
Serenity Wade
Kiran Kate
Kavitha Srinivas
CLIP
16
0
0
13 Sep 2022
Don't Complete It! Preventing Unhelpful Code Completion for Productive
  and Sustainable Neural Code Completion Systems
Don't Complete It! Preventing Unhelpful Code Completion for Productive and Sustainable Neural Code Completion Systems
Zhensu Sun
Xiaoning Du
Fu Song
Shangwen Wang
Mingze Ni
Li Li
21
10
0
13 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
28
566
0
07 Sep 2022
Efficient Methods for Natural Language Processing: A Survey
Efficient Methods for Natural Language Processing: A Survey
Marcos Vinícius Treviso
Ji-Ung Lee
Tianchu Ji
Betty van Aken
Qingqing Cao
...
Emma Strubell
Niranjan Balasubramanian
Leon Derczynski
Iryna Gurevych
Roy Schwartz
28
109
0
31 Aug 2022
Lost at C: A User Study on the Security Implications of Large Language
  Model Code Assistants
Lost at C: A User Study on the Security Implications of Large Language Model Code Assistants
Gustavo Sandoval
Hammond Pearce
Teo Nys
Ramesh Karri
S. Garg
Brendan Dolan-Gavitt
ELM
22
90
0
20 Aug 2022
A Survey on Open Information Extraction from Rule-based Model to Large
  Language Model
A Survey on Open Information Extraction from Rule-based Model to Large Language Model
Pai Liu
Wenya Gao
Wenjie Dong
Lin Ai
Wen Dong
Songfang Huang
Zongsheng Li
Ehsan Hoque
Julia Hirschberg
Yue Zhang
32
2
0
18 Aug 2022
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural
  Code Generation
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation
Federico Cassano
John Gouwar
Daniel Nguyen
S. Nguyen
Luna Phipps-Costin
...
Carolyn Jane Anderson
Molly Q. Feldman
Arjun Guha
Michael Greenberg
Abhinav Jangda
ELM
22
81
0
17 Aug 2022
Interactive and Visual Prompt Engineering for Ad-hoc Task Adaptation
  with Large Language Models
Interactive and Visual Prompt Engineering for Ad-hoc Task Adaptation with Large Language Models
Hendrik Strobelt
Albert Webson
Victor Sanh
Benjamin Hoover
Johanna Beyer
Hanspeter Pfister
Alexander M. Rush
VLM
25
135
0
16 Aug 2022
Interactive Code Generation via Test-Driven User-Intent Formalization
Interactive Code Generation via Test-Driven User-Intent Formalization
Shuvendu K. Lahiri
Sarah Fakhoury
Aaditya Naik
Georgios Sakkas
Saikat Chakraborty
...
Piali Choudhury
Curtis von Veh
J. Inala
Chenglong Wang
Jianfeng Gao
16
63
0
11 Aug 2022
Few-shot Adaptation Works with UnpredicTable Data
Few-shot Adaptation Works with UnpredicTable Data
Jun Shern Chan
Michael Pieler
Jonathan Jao
Jérémy Scheurer
Ethan Perez
28
5
0
01 Aug 2022
Learning from flowsheets: A generative transformer model for
  autocompletion of flowsheets
Learning from flowsheets: A generative transformer model for autocompletion of flowsheets
Gabriel Vogel
Lukas Schulze Balhorn
Artur M. Schweidtmann
AI4CE
35
33
0
01 Aug 2022
A Hazard Analysis Framework for Code Synthesis Large Language Models
A Hazard Analysis Framework for Code Synthesis Large Language Models
Heidy Khlaaf
Pamela Mishkin
Joshua Achiam
Gretchen Krueger
Miles Brundage
ELM
17
28
0
25 Jul 2022
Neurosymbolic Repair for Low-Code Formula Languages
Neurosymbolic Repair for Low-Code Formula Languages
Rohan Bavishi
Harshit Joshi
José Pablo Cambronero Sánchez
Anna Fariha
Sumit Gulwani
Vu Le
Ivan Radicek
A. Tiwari
16
13
0
24 Jul 2022
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
Fenia Christopoulou
Gerasimos Lampouras
Milan Gritta
Guchun Zhang
Yinpeng Guo
...
Guangtai Liang
Jia Wei
Xin Jiang
Qianxiang Wang
Qun Liu
ELM
SyDa
ALM
36
74
0
22 Jul 2022
CodeT: Code Generation with Generated Tests
CodeT: Code Generation with Generated Tests
Bei Chen
Fengji Zhang
A. Nguyen
Daoguang Zan
Zeqi Lin
Jian-Guang Lou
Weizhu Chen
36
316
0
21 Jul 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
Mimetic Models: Ethical Implications of AI that Acts Like You
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
11
16
0
19 Jul 2022
Can large language models reason about medical questions?
Can large language models reason about medical questions?
Valentin Liévin
C. Hother
Andreas Geert Motzfeldt
Ole Winther
ELM
LM&MA
AI4MH
LRM
19
299
0
17 Jul 2022
Language Models (Mostly) Know What They Know
Language Models (Mostly) Know What They Know
Saurav Kadavath
Tom Conerly
Amanda Askell
T. Henighan
Dawn Drain
...
Nicholas Joseph
Benjamin Mann
Sam McCandlish
C. Olah
Jared Kaplan
ELM
44
712
0
11 Jul 2022
Exploring Length Generalization in Large Language Models
Exploring Length Generalization in Large Language Models
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLM
LRM
30
158
0
11 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
144
436
0
10 Jul 2022
Few-shot training LLMs for project-specific code-summarization
Few-shot training LLMs for project-specific code-summarization
Toufique Ahmed
Prem Devanbu
179
213
0
09 Jul 2022
CodeRL: Mastering Code Generation through Pretrained Models and Deep
  Reinforcement Learning
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
126
237
0
05 Jul 2022
Rationale-Augmented Ensembles in Language Models
Rationale-Augmented Ensembles in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Denny Zhou
ReLM
LRM
35
124
0
02 Jul 2022
GitHub Copilot AI pair programmer: Asset or Liability?
GitHub Copilot AI pair programmer: Asset or Liability?
Arghavan Moradi Dakhel
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
Michel C. Desmarais
Zhen Ming
Z. Jiang
26
331
0
30 Jun 2022
Solving Quantitative Reasoning Problems with Language Models
Solving Quantitative Reasoning Problems with Language Models
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
...
Theo Gutman-Solo
Yuhuai Wu
Behnam Neyshabur
Guy Gur-Ari
Vedant Misra
ReLM
ELM
LRM
56
739
0
29 Jun 2022
Using cognitive psychology to understand GPT-3
Using cognitive psychology to understand GPT-3
Marcel Binz
Eric Schulz
ELM
LLMAG
250
440
0
21 Jun 2022
Insights into Pre-training via Simpler Synthetic Tasks
Insights into Pre-training via Simpler Synthetic Tasks
Yuhuai Wu
Felix Li
Percy Liang
AIMat
24
20
0
21 Jun 2022
XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence
XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence
Ming Zhu
Aneesh Jain
Karthik Suresh
Roshan Ravindran
Sindhu Tipirneni
Chandan K. Reddy
27
69
0
16 Jun 2022
CERT: Continual Pre-Training on Sketches for Library-Oriented Code
  Generation
CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation
Daoguang Zan
Bei Chen
Dejian Yang
Zeqi Lin
Minsu Kim
Bei Guan
Yongji Wang
Weizhu Chen
Jian-Guang Lou
14
120
0
14 Jun 2022
X-Risk Analysis for AI Research
X-Risk Analysis for AI Research
Dan Hendrycks
Mantas Mazeika
27
67
0
13 Jun 2022
Attention Flows for General Transformers
Attention Flows for General Transformers
Niklas Metzger
Christopher Hahn
Julian Siber
Frederik Schmitt
Bernd Finkbeiner
34
0
0
30 May 2022
Towards Learning Universal Hyperparameter Optimizers with Transformers
Towards Learning Universal Hyperparameter Optimizers with Transformers
Yutian Chen
Xingyou Song
Chansoo Lee
Z. Wang
Qiuyi Zhang
...
Greg Kochanski
Arnaud Doucet
MarcÁurelio Ranzato
Sagi Perel
Nando de Freitas
24
63
0
26 May 2022
Non-Programmers Can Label Programs Indirectly via Active Examples: A
  Case Study with Text-to-SQL
Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL
Ruiqi Zhong
Charles Burton Snell
Dan Klein
Jason Eisner
19
8
0
25 May 2022
Few-Shot Natural Language Inference Generation with PDD: Prompt and
  Dynamic Demonstration
Few-Shot Natural Language Inference Generation with PDD: Prompt and Dynamic Demonstration
Kaijian Li
Shansan Gong
Kenny Q. Zhu
19
0
0
21 May 2022
KERPLE: Kernelized Relative Positional Embedding for Length
  Extrapolation
KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation
Ta-Chung Chi
Ting-Han Fan
Peter J. Ramadge
Alexander I. Rudnicky
39
65
0
20 May 2022
A Precis of Language Models are not Models of Language
A Precis of Language Models are not Models of Language
Csaba Veres
32
3
0
16 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
54
783
0
12 May 2022
Structured, flexible, and robust: benchmarking and improving large
  language models towards more human-like behavior in out-of-distribution
  reasoning tasks
Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks
K. M. Collins
Catherine Wong
Jiahai Feng
Megan Wei
J. Tenenbaum
LRM
20
56
0
11 May 2022
GitRank: A Framework to Rank GitHub Repositories
GitRank: A Framework to Rank GitHub Repositories
N. Hasabnis
13
3
0
04 May 2022
Learning to Parallelize in a Shared-Memory Environment with Transformers
Learning to Parallelize in a Shared-Memory Environment with Transformers
Reém Harel
Yuval Pinter
Gal Oren
45
17
0
27 Apr 2022
Natural Language to Code Translation with Execution
Natural Language to Code Translation with Execution
Freda Shi
Daniel Fried
Marjan Ghazvininejad
Luke Zettlemoyer
Sida I. Wang
30
123
0
25 Apr 2022
A Taxonomy of Prompt Modifiers for Text-To-Image Generation
A Taxonomy of Prompt Modifiers for Text-To-Image Generation
J. Oppenlaender
15
102
0
20 Apr 2022
Rows from Many Sources: Enriching row completions from Wikidata with a
  pre-trained Language Model
Rows from Many Sources: Enriching row completions from Wikidata with a pre-trained Language Model
Carina Negreanu
Alperen Karaoglu
Jack Williams
Shuang Chen
Daniel Fabian
Andrew D. Gordon
Chin-Yew Lin
RALM
AIMat
LMTD
19
2
0
14 Apr 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
...
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
61
800
0
14 Apr 2022
InCoder: A Generative Model for Code Infilling and Synthesis
InCoder: A Generative Model for Code Infilling and Synthesis
Daniel Fried
Armen Aghajanyan
Jessy Lin
Sida I. Wang
Eric Wallace
Freda Shi
Ruiqi Zhong
Wen-tau Yih
Luke Zettlemoyer
M. Lewis
SyDa
22
625
0
12 Apr 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning
  from Human Feedback
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
69
2,308
0
12 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
13
571
0
01 Apr 2022
Previous
123...15161718
Next