ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.03374
  4. Cited By
Evaluating Large Language Models Trained on Code

Evaluating Large Language Models Trained on Code

7 July 2021
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
Jared Kaplan
Harrison Edwards
Yura Burda
Nicholas Joseph
Greg Brockman
Alex Ray
Raul Puri
Gretchen Krueger
Michael Petrov
Heidy Khlaaf
Girish Sastry
Pamela Mishkin
Brooke Chan
Scott Gray
Nick Ryder
Mikhail Pavlov
Alethea Power
Lukasz Kaiser
Mohammad Bavarian
Clemens Winter
Philippe Tillet
F. Such
D. Cummings
Matthias Plappert
Fotios Chantzis
Elizabeth Barnes
Ariel Herbert-Voss
William H. Guss
Alex Nichol
Alex Paino
Nikolas Tezak
Jie Tang
Igor Babuschkin
S. Balaji
Shantanu Jain
William Saunders
Christopher Hesse
A. Carr
Jan Leike
Joshua Achiam
Vedant Misra
Evan Morikawa
Alec Radford
Matthew Knight
Miles Brundage
Mira Murati
Katie Mayer
Peter Welinder
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
    ELM
    ALM
ArXivPDFHTML

Papers citing "Evaluating Large Language Models Trained on Code"

50 / 927 papers shown
Title
The Landscape and Challenges of HPC Research and LLMs
The Landscape and Challenges of HPC Research and LLMs
Le Chen
Nesreen K. Ahmed
Akashnil Dutta
Arijit Bhattacharjee
Sixing Yu
...
Vy A. Vo
J. P. Muñoz
Ted Willke
Tim Mattson
Ali Jannesari
AI4CE
48
20
0
03 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
21
7
0
02 Feb 2024
LLM-based NLG Evaluation: Current Status and Challenges
LLM-based NLG Evaluation: Current Status and Challenges
Mingqi Gao
Xinyu Hu
Jie Ruan
Xiao Pu
Xiaojun Wan
ELM
LM&MA
57
29
0
02 Feb 2024
On the Challenges of Fuzzing Techniques via Large Language Models
On the Challenges of Fuzzing Techniques via Large Language Models
Linghan Huang
Peizhou Zhao
Huaming Chen
Lei Ma
16
14
0
01 Feb 2024
Learning Agent-based Modeling with LLM Companions: Experiences of
  Novices and Experts Using ChatGPT & NetLogo Chat
Learning Agent-based Modeling with LLM Companions: Experiences of Novices and Experts Using ChatGPT & NetLogo Chat
John Chen
Xi Lu
Michael Rejtig
Yuzhou Du
Ruth Bagley
Mike Horn
Uri Wilensky
26
29
0
30 Jan 2024
Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending
Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending
Mario Sanz-Guerrero
Javier Arroyo
28
4
0
29 Jan 2024
OMPGPT: A Generative Pre-trained Transformer Model for OpenMP
OMPGPT: A Generative Pre-trained Transformer Model for OpenMP
Le Chen
Arijit Bhattacharjee
Nesreen Ahmed
N. Hasabnis
Gal Oren
Vy A. Vo
Ali Jannesari
VLM
31
11
0
28 Jan 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
39
122
0
26 Jan 2024
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric
  Algorithm-System Co-Design
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design
Haojun Xia
Zhen Zheng
Xiaoxia Wu
Shiyang Chen
Zhewei Yao
...
Donglin Zhuang
Zhongzhu Zhou
Olatunji Ruwase
Yuxiong He
Shuaiwen Leon Song
MQ
33
14
0
25 Jan 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Demystifying Chains, Trees, and Graphs of Thoughts
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
75
27
0
25 Jan 2024
Knowledge Fusion of Large Language Models
Knowledge Fusion of Large Language Models
Fanqi Wan
Xinting Huang
Deng Cai
Xiaojun Quan
Wei Bi
Shuming Shi
MoMe
34
61
0
19 Jan 2024
LangProp: A code optimization framework using Large Language Models
  applied to driving
LangProp: A code optimization framework using Large Language Models applied to driving
Shu Ishida
Gianluca Corrado
George Fedoseev
Hudson Yeo
Lloyd Russell
Jamie Shotton
João F. Henriques
Anthony Hu
59
11
0
18 Jan 2024
DebugBench: Evaluating Debugging Capability of Large Language Models
DebugBench: Evaluating Debugging Capability of Large Language Models
Runchu Tian
Yining Ye
Yujia Qin
Xin Cong
Yankai Lin
...
Yesai Wu
Haotian Hui
Weichuan Liu
Zhiyuan Liu
Maosong Sun
ELM
35
28
0
09 Jan 2024
LLM Augmented LLMs: Expanding Capabilities through Composition
LLM Augmented LLMs: Expanding Capabilities through Composition
Rachit Bansal
Bidisha Samanta
Siddharth Dalmia
Nitish Gupta
Shikhar Vashishth
Sriram Ganapathy
Abhishek Bapna
Prateek Jain
Partha P. Talukdar
CLL
21
34
0
04 Jan 2024
Using LLM to select the right SQL Query from candidates
Using LLM to select the right SQL Query from candidates
Zhenwen Li
Tao Xie
LLMAG
38
9
0
04 Jan 2024
KernelGPT: Enhanced Kernel Fuzzing via Large Language Models
KernelGPT: Enhanced Kernel Fuzzing via Large Language Models
Chenyuan Yang
Zijie Zhao
Lingming Zhang
25
13
0
31 Dec 2023
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
35
20
0
30 Dec 2023
Open-TI: Open Traffic Intelligence with Augmented Language Model
Open-TI: Open Traffic Intelligence with Augmented Language Model
Longchao Da
Kuanru Liou
Tiejin Chen
Xuesong Zhou
Xiangyong Luo
Yezhou Yang
Hua Wei
43
22
0
30 Dec 2023
A Prompt Learning Framework for Source Code Summarization
A Prompt Learning Framework for Source Code Summarization
Dongrui Liu
Chunrong Fang
Yudu You
Yuchen Chen
Yi Liu
...
Quanjun Zhang
Hanwei Qian
Wei-Ye Zhao
Yang Liu
Zhenyu Chen
LLMAG
43
13
0
26 Dec 2023
IQAGPT: Image Quality Assessment with Vision-language and ChatGPT Models
IQAGPT: Image Quality Assessment with Vision-language and ChatGPT Models
Zhihao Chen
Bin Hu
Chuang Niu
Tao Chen
Yuxin Li
Hongming Shan
Ge Wang
LM&MA
MLLM
21
4
0
25 Dec 2023
Sparse is Enough in Fine-tuning Pre-trained Large Language Models
Sparse is Enough in Fine-tuning Pre-trained Large Language Models
Weixi Song
Z. Li
Lefei Zhang
Hai Zhao
Bo Du
VLM
19
7
0
19 Dec 2023
Demystifying Instruction Mixing for Fine-tuning Large Language Models
Demystifying Instruction Mixing for Fine-tuning Large Language Models
Renxi Wang
Haonan Li
Minghao Wu
Yuxia Wang
Xudong Han
Chiyu Zhang
Timothy Baldwin
28
0
0
17 Dec 2023
One-Shot Learning as Instruction Data Prospector for Large Language
  Models
One-Shot Learning as Instruction Data Prospector for Large Language Models
Yunshui Li
Binyuan Hui
Xiaobo Xia
Jiaxi Yang
Min Yang
...
Ling-Hao Chen
Junhao Liu
Tongliang Liu
Fei Huang
Yongbin Li
38
31
0
16 Dec 2023
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?
Fuheng Zhao
Lawrence Lim
Ishtiyaque Ahmad
D. Agrawal
A. El Abbadi
Amr El Abbadi
65
9
0
16 Dec 2023
Holistic chemical evaluation reveals pitfalls in reaction prediction
  models
Holistic chemical evaluation reveals pitfalls in reaction prediction models
Victor Sabanza Gil
Andres M Bran
Malte Franke
Remi Schlama
J. Luterbacher
Philippe Schwaller
ELM
25
1
0
14 Dec 2023
Designing with Language: Wireframing UI Design Intent with Generative
  Large Language Models
Designing with Language: Wireframing UI Design Intent with Generative Large Language Models
Sidong Feng
Mingyue Yuan
Jieshan Chen
Zhenchang Xing
Chunyang Chen
AI4CE
3DV
19
7
0
12 Dec 2023
Exploring Large Language Models to Facilitate Variable Autonomy for
  Human-Robot Teaming
Exploring Large Language Models to Facilitate Variable Autonomy for Human-Robot Teaming
Younes Lakhnati
Max Pascher
Jens Gerken
LLMAG
LM&Ro
32
3
0
12 Dec 2023
Rethinking the Instruction Quality: LIFT is What You Need
Rethinking the Instruction Quality: LIFT is What You Need
Yang Xu
Yongqiang Yao
Yufan Huang
Mengnan Qi
Maoquan Wang
Bin Gu
Neel Sundaresan
ALM
19
35
0
12 Dec 2023
"I Want It That Way": Enabling Interactive Decision Support Using Large
  Language Models and Constraint Programming
"I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming
Connor Lawless
Jakob Schoeffer
Lindy Le
Kael Rowan
Shilad Sen
Cristina St. Hill
Jina Suh
Bahar Sarrafzadeh
41
8
0
12 Dec 2023
Large Scale Foundation Models for Intelligent Manufacturing
  Applications: A Survey
Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey
Haotian Zhang
S. D. Semujju
Zhicheng Wang
Xianwei Lv
Kang Xu
...
Jing Wu
Zhuo Long
Wensheng Liang
Xiaoguang Ma
Ruiyan Zhuang
UQCV
AI4TS
AI4CE
29
4
0
11 Dec 2023
SmoothQuant+: Accurate and Efficient 4-bit Post-Training
  WeightQuantization for LLM
SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM
Jiayi Pan
Chengcan Wang
Kaifu Zheng
Yangguang Li
Zhenyu Wang
Bin Feng
MQ
35
7
0
06 Dec 2023
Inherent limitations of LLMs regarding spatial information
Inherent limitations of LLMs regarding spatial information
He Yan
Xinyao Hu
Xiangpeng Wan
Chengyu Huang
Kai Zou
Shiqi Xu
LRM
28
2
0
05 Dec 2023
Large Language Models Are Zero-Shot Text Classifiers
Large Language Models Are Zero-Shot Text Classifiers
Zhiqiang Wang
Yiran Pang
Yanbin Lin
13
29
0
02 Dec 2023
ArcMMLU: A Library and Information Science Benchmark for Large Language
  Models
ArcMMLU: A Library and Information Science Benchmark for Large Language Models
Shitou Zhang
Zuchao Li
Xingshen Liu
Liming Yang
Ping Wang
ELM
11
0
0
30 Nov 2023
Universal Self-Consistency for Large Language Model Generation
Universal Self-Consistency for Large Language Model Generation
Xinyun Chen
Renat Aksitov
Uri Alon
Jie Jessie Ren
Kefan Xiao
Pengcheng Yin
Sushant Prakash
Charles Sutton
Xuezhi Wang
Denny Zhou
LRM
26
66
0
29 Nov 2023
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Zhihao Yuan
Jinke Ren
Chun-Mei Feng
Hengshuang Zhao
Shuguang Cui
Zhen Li
34
26
0
26 Nov 2023
PaSS: Parallel Speculative Sampling
PaSS: Parallel Speculative Sampling
Giovanni Monea
Armand Joulin
Edouard Grave
MoE
16
31
0
22 Nov 2023
Transfer Attacks and Defenses for Large Language Models on Coding Tasks
Transfer Attacks and Defenses for Large Language Models on Coding Tasks
Chi Zhang
Zifan Wang
Ravi Mangal
Matt Fredrikson
Limin Jia
Corina S. Pasareanu
AAML
SILM
27
1
0
22 Nov 2023
Compositional Capabilities of Autoregressive Transformers: A Study on
  Synthetic, Interpretable Tasks
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
33
6
0
21 Nov 2023
A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with
  Dynamic Obstacle Trajectory Prediction and Its Application with LLMs
A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs
J. Zhong
Ming Li
Yinliang Chen
Zihang Wei
Fan Yang
Haoran Shen
27
14
0
21 Nov 2023
AtomXR: Streamlined XR Prototyping with Natural Language and Immersive
  Physical Interaction
AtomXR: Streamlined XR Prototyping with Natural Language and Immersive Physical Interaction
Alice Cai
Caine Ardayfio
AnhPhu Nguyen
Tica Lin
Elena L. Glassman
15
1
0
19 Nov 2023
Can We Utilize Pre-trained Language Models within Causal Discovery
  Algorithms?
Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?
Chanhui Lee
Juhyeon Kim
Yongjun Jeong
Juhyun Lyu
Junghee Kim
...
Hyeokjun Choe
Soyeon Park
Woohyung Lim
Sungbin Lim
Snu Astronomy Research Center
20
0
0
19 Nov 2023
CAMRA: Copilot for AMR Annotation
CAMRA: Copilot for AMR Annotation
Jon Z. Cai
Shafiuddin Rehan Ahmed
Julia Bonn
Kristin Wright-Bettner
Martha Palmer
James H. Martin
VLM
9
0
0
18 Nov 2023
MELA: Multilingual Evaluation of Linguistic Acceptability
MELA: Multilingual Evaluation of Linguistic Acceptability
Ziyin Zhang
Yikang Liu
Wei Huang
Junyu Mao
Rui Wang
Hai Hu
22
3
0
15 Nov 2023
Routing to the Expert: Efficient Reward-guided Ensemble of Large
  Language Models
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
Keming Lu
Hongyi Yuan
Runji Lin
Junyang Lin
Zheng Yuan
Chang Zhou
Jingren Zhou
MoE
LRM
40
52
0
15 Nov 2023
Finding Inductive Loop Invariants using Large Language Models
Finding Inductive Loop Invariants using Large Language Models
Adharsh Kamath
Aditya Senthilnathan
Saikat Chakraborty
Pantazis Deligiannis
Shuvendu K. Lahiri
Akash Lal
Aseem Rastogi
Subhajit Roy
Rahul Sharma
14
20
0
14 Nov 2023
Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA
Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA
Dhruv Agarwal
Rajarshi Das
Sopan Khosla
Rashmi Gangadharaiah
OffRL
18
7
0
14 Nov 2023
Explain-then-Translate: An Analysis on Improving Program Translation
  with Self-generated Explanations
Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations
Zilu Tang
Mayank Agarwal
Alex Shypula
Bailin Wang
Derry Wijaya
Jie Chen
Yoon Kim
LRM
37
15
0
13 Nov 2023
CompCodeVet: A Compiler-guided Validation and Enhancement Approach for
  Code Dataset
CompCodeVet: A Compiler-guided Validation and Enhancement Approach for Code Dataset
Le Chen
Arijit Bhattacharjee
Nesreen K. Ahmed
N. Hasabnis
Gal Oren
Bin Lei
Ali Jannesari
LRM
29
3
0
11 Nov 2023
AI-native Interconnect Framework for Integration of Large Language Model
  Technologies in 6G Systems
AI-native Interconnect Framework for Integration of Large Language Model Technologies in 6G Systems
Sasu Tarkoma
Roberto Morabito
Jaakko Sauvola
23
19
0
10 Nov 2023
Previous
123...101112...171819
Next