Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.09261
Cited By
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
17 October 2022
Mirac Suzgun
Nathan Scales
Nathanael Scharli
Sebastian Gehrmann
Yi Tay
Hyung Won Chung
Aakanksha Chowdhery
Quoc V. Le
Ed H. Chi
Denny Zhou
Jason W. Wei
ALM
ELM
LRM
ReLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them"
50 / 788 papers shown
Title
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
Zhangyue Yin
Qiushi Sun
Cheng Chang
Qipeng Guo
Junqi Dai
Xuanjing Huang
Xipeng Qiu
LRM
48
46
0
04 Dec 2023
CLAMP: Contrastive LAnguage Model Prompt-tuning
Piotr Teterwak
Ximeng Sun
Bryan A. Plummer
Kate Saenko
Ser-Nam Lim
MLLM
VLM
17
1
0
04 Dec 2023
Hyperparameter Optimization for Large Language Model Instruction-Tuning
C. Tribes
Sacha Benarroch-Lelong
Peng Lu
I. Kobyzev
21
12
0
01 Dec 2023
Instruction-tuning Aligns LLMs to the Human Brain
Khai Loong Aw
Syrielle Montariol
Badr AlKhamissi
Martin Schrimpf
Antoine Bosselut
15
18
0
01 Dec 2023
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
Kai Lv
Shuo Zhang
Tianle Gu
Shuhao Xing
Jiawei Hong
...
Tengxiao Liu
Yu Sun
Penousal Machado
Hang Yan
Xipeng Qiu
35
6
0
01 Dec 2023
CLOMO: Counterfactual Logical Modification with Large Language Models
Yinya Huang
Ruixin Hong
Hongming Zhang
Wei Shao
Zhicheng YANG
Dong Yu
Changshui Zhang
Xiaodan Liang
Linqi Song
LRM
18
7
0
29 Nov 2023
Training Chain-of-Thought via Latent-Variable Inference
Du Phan
Matthew D. Hoffman
David Dohan
Sholto Douglas
Tuan Anh Le
Aaron T Parisi
Pavel Sountsov
Charles Sutton
Sharad Vikram
Rif A. Saurous
BDL
ReLM
LRM
8
22
0
28 Nov 2023
AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations
Zhicheng YANG
Yinya Huang
Jing Xiong
Liang Feng
Xiaodan Liang
Yiwei Wang
Jing Tang
LRM
18
1
0
22 Nov 2023
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
Prateek Yadav
Leshem Choshen
Colin Raffel
Mohit Bansal
19
12
0
22 Nov 2023
Conditions for Length Generalization in Learning Reasoning Skills
Changnan Xiao
Bing Liu
LRM
32
7
0
22 Nov 2023
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
22
6
0
21 Nov 2023
Data Diversity Matters for Robust Instruction Tuning
Alexander Bukharin
Tuo Zhao
72
35
0
21 Nov 2023
Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?
Chanhui Lee
Juhyeon Kim
Yongjun Jeong
Juhyun Lyu
Junghee Kim
...
Hyeokjun Choe
Soyeon Park
Woohyung Lim
Sungbin Lim
Snu Astronomy Research Center
12
0
0
19 Nov 2023
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Hamish Ivison
Yizhong Wang
Valentina Pyatkin
Nathan Lambert
Matthew E. Peters
...
Joel Jang
David Wadden
Noah A. Smith
Iz Beltagy
Hanna Hajishirzi
ALM
ELM
22
178
0
17 Nov 2023
OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning
Fei Yu
Anningzhe Gao
Benyou Wang
OffRL
LRM
15
39
0
16 Nov 2023
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Ashim Gupta
Rishanth Rajendhran
Nathan Stringham
Vivek Srikumar
Ana Marasović
AAML
21
3
0
16 Nov 2023
Automatic Engineering of Long Prompts
Cho-Jui Hsieh
Si Si
Felix X. Yu
Inderjit S. Dhillon
VLM
11
8
0
16 Nov 2023
Program-Aided Reasoners (better) Know What They Know
Anubha Kabra
Sanketh Rangreji
Yash Mathur
Aman Madaan
Emmy Liu
Graham Neubig
LRM
19
0
0
16 Nov 2023
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models
Fangzhi Xu
Zhiyong Wu
Qiushi Sun
Siyu Ren
Fei Yuan
Shuai Yuan
Qika Lin
Yu Qiao
Jun Liu
LLMAG
14
32
0
15 Nov 2023
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
Chang Gao
Haiyun Jiang
Deng Cai
Shuming Shi
Wai Lam
LRM
29
3
0
15 Nov 2023
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration
Zhenran Xu
Senbao Shi
Baotian Hu
Jindi Yu
Dongfang Li
Min Zhang
Yuxiang Wu
LRM
LLMAG
ALM
58
19
0
14 Nov 2023
On Measuring Faithfulness or Self-consistency of Natural Language Explanations
Letitia Parcalabescu
Anette Frank
LRM
64
20
0
13 Nov 2023
Do large language models and humans have similar behaviors in causal inference with script knowledge?
Xudong Hong
Margarita Ryzhova
Daniel Adrian Biondi
Ram Sarkar
27
5
0
13 Nov 2023
Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations
Zilu Tang
Mayank Agarwal
Alex Shypula
Bailin Wang
Derry Wijaya
Jie Chen
Yoon Kim
LRM
25
15
0
13 Nov 2023
Towards the Law of Capacity Gap in Distilling Language Models
Chen Zhang
Dawei Song
Zheyu Ye
Yan Gao
ELM
15
20
0
13 Nov 2023
BizBench: A Quantitative Reasoning Benchmark for Business and Finance
Rik Koncel-Kedziorski
Michael Krumdick
Viet Dac Lai
Varshini Reddy
Charles Lovering
Chris Tanner
AIMat
19
4
0
11 Nov 2023
Prompt Engineering a Prompt Engineer
Qinyuan Ye
Maxamed Axmed
Reid Pryzant
Fereshte Khani
VLM
LLMAG
LRM
19
28
0
09 Nov 2023
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Shashank Gupta
Vaishnavi Shrivastava
A. Deshpande
A. Kalyan
Peter Clark
Ashish Sabharwal
Tushar Khot
120
100
0
08 Nov 2023
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
Jiale Cheng
Xiao Liu
Kehan Zheng
Pei Ke
Hongning Wang
Yuxiao Dong
Jie Tang
Minlie Huang
21
76
0
07 Nov 2023
Implicit Chain of Thought Reasoning via Knowledge Distillation
Yuntian Deng
Kiran Prasad
Roland Fernandez
P. Smolensky
Vishrav Chaudhary
Stuart M. Shieber
ReLM
LRM
14
42
0
02 Nov 2023
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Yuxin Jiang
Yufei Wang
Xingshan Zeng
Wanjun Zhong
Liangyou Li
Fei Mi
Lifeng Shang
Xin Jiang
Qun Liu
Wei Wang
ALM
13
25
0
31 Oct 2023
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Wai-Chung Kwan
Xingshan Zeng
Yufei Wang
Yusen Sun
Liangyou Li
Lifeng Shang
Qun Liu
Kam-Fai Wong
ELM
89
10
0
30 Oct 2023
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Nan He
Hanyu Lai
Chenyang Zhao
Zirui Cheng
Junting Pan
...
Zhaohui Hou
Zhiyuan Huang
Shaoqing Lu
Ding Liang
Mingjie Zhan
LRM
12
12
0
29 Oct 2023
DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy
Hongda Sun
Weikai Xu
Wei Liu
Jian Luan
Bin Wang
Shuo Shang
Ji-Rong Wen
Rui Yan
LRM
29
24
0
28 Oct 2023
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta
Kevin Scaria
Ujjwala Anantheswaran
Shreyas Verma
Mihir Parmar
Saurabh Arjun Sawant
Chitta Baral
Swaroop Mishra
SyDa
22
4
0
27 Oct 2023
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models
Mingfeng Xue
Dayiheng Liu
Kexin Yang
Guanting Dong
Wenqiang Lei
Zheng Yuan
Chang Zhou
Jingren Zhou
LLMAG
6
2
0
25 Oct 2023
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization
Xinyuan Wang
Chenxi Li
Zhen Wang
Fan Bai
Haotian Luo
Jiayou Zhang
Nebojsa Jojic
Eric P. Xing
Zhiting Hu
23
98
0
25 Oct 2023
Can You Follow Me? Testing Situational Understanding in ChatGPT
Chenghao Yang
Allyson Ettinger
LRM
LLMAG
ELM
107
4
0
24 Oct 2023
Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation
Zeyuan Yang
Peng Li
Yang Janet Liu
LRM
25
20
0
24 Oct 2023
POE: Process of Elimination for Multiple Choice Reasoning
Chenkai Ma
Xinya Du
LRM
17
5
0
24 Oct 2023
S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models
Fangyu Lei
Qian Liu
Yiming Huang
Shizhu He
Jun Zhao
Kang Liu
ELM
LRM
20
12
0
23 Oct 2023
LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis
Shih-Chieh Dai
Aiping Xiong
Lun-Wei Ku
13
61
0
23 Oct 2023
Reasoning about Ambiguous Definite Descriptions
Stefan F. Schouten
Peter Bloem
Ilia Markov
Piek Vossen
LRM
UQLM
13
0
0
23 Oct 2023
Language Model Unalignment: Parametric Red-Teaming to Expose Hidden Harms and Biases
Rishabh Bhardwaj
Soujanya Poria
ALM
49
14
0
22 Oct 2023
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing
Xinyu Hu
Pengfei Tang
Simiao Zuo
Zihan Wang
Bowen Song
Qiang Lou
Jian Jiao
Denis Xavier Charles
LRM
31
7
0
20 Oct 2023
Teaching Language Models to Self-Improve through Interactive Demonstrations
Xiao Yu
Baolin Peng
Michel Galley
Jianfeng Gao
Zhou Yu
LRM
ReLM
31
19
0
20 Oct 2023
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Zhihan Zhang
Shuohang Wang
W. Yu
Yichong Xu
Dan Iter
Qingkai Zeng
Yang Liu
Chenguang Zhu
Meng-Long Jiang
SyDa
ALM
17
20
0
19 Oct 2023
Multi-stage Large Language Model Correction for Speech Recognition
Jie Pu
Thai-Son Nguyen
Sebastian Stüker
LRM
22
6
0
17 Oct 2023
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
Kai Lv
Hang Yan
Qipeng Guo
Haijun Lv
Xipeng Qiu
ODL
19
20
0
16 Oct 2023
GLoRE: Evaluating Logical Reasoning of Large Language Models
Hanmeng Liu
Zhiyang Teng
Ruoxi Ning
Jian Liu
Qiji Zhou
Yuexin Zhang
Yue Zhang
ReLM
ELM
LRM
57
6
0
13 Oct 2023
Previous
1
2
3
...
11
12
13
14
15
16
Next