Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.06680
Cited By
Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach
10 October 2023
Zhenlan Ji
Pingchuan Ma
Zongjie Li
Shuai Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach"
15 / 15 papers shown
Title
Directed Greybox Fuzzing via Large Language Model
HanXiang Xu
Yanjie Zhao
Haoyu Wang
32
0
0
06 May 2025
Testing and Understanding Erroneous Planning in LLM Agents through Synthesized User Inputs
Zhenlan Ji
Daoyuan Wu
Pingchuan Ma
Zongjie Li
Shuai Wang
LLMAG
40
3
0
27 Apr 2024
Cause and Effect: Can Large Language Models Truly Understand Causality?
Swagata Ashwani
Kshiteesh Hegde
Nishith Reddy Mannuru
Mayank Jindal
Dushyant Singh Sengar
Krishna Chaitanya Rao Kathala
Dishant Banga
Vinija Jain
Aman Chadha
LRM
30
17
0
28 Feb 2024
Emerging Synergies in Causality and Deep Generative Models: A Survey
Guanglin Zhou
Shaoan Xie
Guang-Yuan Hao
Shiming Chen
Biwei Huang
Xiwei Xu
Chen Wang
Liming Zhu
Lina Yao
Kun Zhang
AI4CE
39
11
0
29 Jan 2023
Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Yuxin Xiao
Paul Pu Liang
Umang Bhatt
W. Neiswanger
Ruslan Salakhutdinov
Louis-Philippe Morency
170
86
0
10 Oct 2022
Membership Inference Attacks and Generalization: A Causal Perspective
Teodora Baluta
Shiqi Shen
S. Hitarth
Shruti Tople
Prateek Saxena
OOD
MIACV
23
18
0
18 Sep 2022
No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code Intelligence
Chaozheng Wang
Yuanhang Yang
Cuiyun Gao
Yun Peng
Hongyu Zhang
Michael R. Lyu
AAML
49
129
0
24 Jul 2022
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
124
232
0
05 Jul 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,163
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
203
1,651
0
15 Oct 2021
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
204
1,451
0
02 Sep 2021
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
194
614
0
20 May 2021
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
Shuai Lu
Daya Guo
Shuo Ren
Junjie Huang
Alexey Svyatkovskiy
...
Nan Duan
Neel Sundaresan
Shao Kun Deng
Shengyu Fu
Shujie Liu
ELM
190
853
0
09 Feb 2021
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
275
1,561
0
18 Sep 2019
1