Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.11829
Cited By
System 2 Attention (is something you might need too)
20 November 2023
Jason Weston
Sainbayar Sukhbaatar
RALM
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"System 2 Attention (is something you might need too)"
41 / 41 papers shown
Title
Exploring and Controlling Diversity in LLM-Agent Conversation
Kuanchao Chu
Yi-Pei Chen
Hideki Nakayama
LLMAG
42
1
0
24 Feb 2025
An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems
Hashmath Shaik
Alex Doboli
OffRL
ELM
92
0
0
31 Dec 2024
System-2 Mathematical Reasoning via Enriched Instruction Tuning
Huanqia Cai
Yijun Yang
Zhifeng Li
LRM
74
0
0
22 Dec 2024
Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Tyler McDonald
Anthony Colosimo
Yifeng Li
Ali Emami
65
1
0
02 Dec 2024
Sycophancy in Large Language Models: Causes and Mitigations
Lars Malmqvist
69
7
0
22 Nov 2024
Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Adithya V Ganesan
Vasudha Varadarajan
Yash Kumar Lal
Veerle C. Eijsbroek
Katarina Kjell
...
Elizabeth C. Stade
J. Eichstaedt
Ryan L. Boyd
H. A. Schwartz
Lucie Flek
AI4MH
67
0
0
21 Nov 2024
Thinking LLMs: General Instruction Following with Thought Generation
Tianhao Wu
Janice Lan
Weizhe Yuan
Jiantao Jiao
Jason Weston
Sainbayar Sukhbaatar
LRM
16
15
0
14 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
Thomas Palmeira Ferraz
Kartik Mehta
Yu-Hsiang Lin
Haw-Shiuan Chang
Shereen Oraby
Sijia Liu
Vivek Subramanian
Tagyoung Chung
Mohit Bansal
Nanyun Peng
48
7
0
09 Oct 2024
ALR
2
^2
2
: A Retrieve-then-Reason Framework for Long-context Question Answering
Huayang Li
Pat Verga
Priyanka Sen
Bowen Yang
Vijay Viswanathan
Patrick Lewis
Taro Watanabe
Yixuan Su
RALM
LRM
40
7
0
04 Oct 2024
Verbalized Graph Representation Learning: A Fully Interpretable Graph Model Based on Large Language Models Throughout the Entire Process
Xingyu Ji
Jiale Liu
Lu Li
Maojun Wang
Zeyu Zhang
23
1
0
02 Oct 2024
Distilling System 2 into System 1
Ping Yu
Jing Xu
Jason Weston
Ilia Kulikov
OffRL
LRM
38
56
0
08 Jul 2024
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models
Chani Jung
Dongkwan Kim
Jiho Jin
Jiseon Kim
Yeon Seonwoo
Yejin Choi
Alice H. Oh
Hyunwoo Kim
LRM
53
5
0
08 Jul 2024
Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning
Eric Pasewark
Kyle Montgomery
Kefei Duan
Dawn Song
Chenguang Wang
LRM
CLL
ReLM
15
1
0
05 Jul 2024
CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling
Yu Bai
Xiyuan Zou
Heyan Huang
Sanxing Chen
Marc-Antoine Rondeau
Yang Gao
Jackie Chi Kit Cheung
29
3
0
17 Jun 2024
MultiMax: Sparse and Multi-Modal Attention Learning
Yuxuan Zhou
Mario Fritz
M. Keuper
35
1
0
03 Jun 2024
LOGIN: A Large Language Model Consulted Graph Neural Network Training Framework
Yiran Qiao
Xiang Ao
Yang Liu
Jiarong Xu
Xiaoqian Sun
Qing He
27
3
0
22 May 2024
Pixels and Predictions: Potential of GPT-4V in Meteorological Imagery Analysis and Forecast Communication
John R. Lawson
Montgomery Flora
Kevin H. Goebbert
Seth N. Lyman
Corey K. Potvin
David M. Schultz
Adam J. Stepanek
Joseph E. Trujillo-Falcón
MLLM
39
1
0
22 Apr 2024
Social Skill Training with Large Language Models
Diyi Yang
Caleb Ziems
William B. Held
Omar Shaikh
Michael S. Bernstein
John C. Mitchell
LLMAG
43
8
0
05 Apr 2024
Conceptual and Unbiased Reasoning in Language Models
Ben Zhou
Hongming Zhang
Sihao Chen
Dian Yu
Hongwei Wang
Baolin Peng
Dan Roth
Dong Yu
ReLM
LRM
ELM
18
12
0
30 Mar 2024
RAFT: Adapting Language Model to Domain Specific RAG
Tianjun Zhang
Shishir G. Patil
Naman Jain
Sheng Shen
Matei A. Zaharia
Ion Stoica
Joseph E. Gonzalez
RALM
32
176
0
15 Mar 2024
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
James Chua
Edward Rees
Hunar Batra
Samuel R. Bowman
Julian Michael
Ethan Perez
Miles Turpin
LRM
39
13
0
08 Mar 2024
Can't Remember Details in Long Documents? You Need Some R&R
Devanshu Agrawal
Shang Gao
Martin Gajek
RALM
67
6
0
08 Mar 2024
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
Kaiyan Zhang
Jianyu Wang
Ermo Hua
Biqing Qi
Ning Ding
Bowen Zhou
SyDa
30
20
0
05 Mar 2024
LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Akash Gupta
Ivaxi Sheth
Vyas Raina
Mark J. F. Gales
Mario Fritz
30
4
0
28 Feb 2024
Are LLMs Rational Investors? A Study on Detecting and Reducing the Financial Bias in LLMs
Yuhang Zhou
Yuchen Ni
Yu Gan
Zhangyue Yin
Xiang Liu
Jian Zhang
Sen Liu
Xipeng Qiu
Guangnan Ye
Hongfeng Chai
AIFin
41
4
0
20 Feb 2024
Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation
Hongbin Na
Zimu Wang
M. Maimaiti
Tong Chen
Wei Wang
Tao Shen
Ling Chen
LRM
20
5
0
16 Feb 2024
Chain of Logic: Rule-Based Reasoning with Large Language Models
Sergio Servantez
Joe Barrow
Kristian J. Hammond
R. Jain
ReLM
ELM
AILaw
LRM
AI4CE
32
1
0
16 Feb 2024
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Kuang-Huei Lee
Xinyun Chen
Hiroki Furuta
John F. Canny
Ian S. Fischer
RALM
53
29
0
15 Feb 2024
Measuring and Controlling Instruction (In)Stability in Language Model Dialogs
Kenneth Li
Tianle Liu
Naomi Bashkansky
David Bau
Fernanda Viégas
Hanspeter Pfister
Martin Wattenberg
16
6
0
13 Feb 2024
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
Víctor Gallego
SyDa
25
6
0
12 Feb 2024
A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications
Pranab Sahoo
Ayush Kumar Singh
Sriparna Saha
Vinija Jain
S. Mondal
Aman Chadha
49
268
0
05 Feb 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
63
26
0
25 Jan 2024
DocFinQA: A Long-Context Financial Reasoning Dataset
Varshini Reddy
Rik Koncel-Kedziorski
Viet Dac Lai
Michael Krumdick
Charles Lovering
Chris Tanner
RALM
13
15
0
12 Jan 2024
Large Language Models for Social Networks: Applications, Challenges, and Solutions
Jingying Zeng
Richard Huang
Waleed Malik
Langxuan Yin
Bojan Babic
Danny Shacham
Xiao Yan
Jaewon Yang
Qi He
22
3
0
04 Jan 2024
A Survey of Reasoning with Foundation Models
Jiankai Sun
Chuanyang Zheng
E. Xie
Zhengying Liu
Ruihang Chu
...
Xipeng Qiu
Yi-Chen Guo
Hui Xiong
Qun Liu
Zhenguo Li
ReLM
LRM
AI4CE
22
75
0
17 Dec 2023
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Yihe Deng
Weitong Zhang
Zixiang Chen
Quanquan Gu
LRM
22
72
0
07 Nov 2023
Towards Understanding Sycophancy in Language Models
Mrinank Sharma
Meg Tong
Tomasz Korbak
D. Duvenaud
Amanda Askell
...
Oliver Rausch
Nicholas Schiefer
Da Yan
Miranda Zhang
Ethan Perez
209
178
0
20 Oct 2023
Concise and Organized Perception Facilitates Reasoning in Large Language Models
Junjie Liu
Shaotian Yan
Chen Shen
Zhengdong Xiao
Wenxiao Wang
Jieping Ye
Jieping Ye
LRM
8
1
0
05 Oct 2023
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
4,048
0
24 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,402
0
28 Jan 2022
UnNatural Language Inference
Koustuv Sinha
Prasanna Parthasarathi
Joelle Pineau
Adina Williams
211
94
0
30 Dec 2020
1