ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.00114
  4. Cited By
Show Your Work: Scratchpads for Intermediate Computation with Language
  Models

Show Your Work: Scratchpads for Intermediate Computation with Language Models

30 November 2021
Maxwell Nye
Anders Andreassen
Guy Gur-Ari
Henryk Michalewski
Jacob Austin
David Bieber
David Dohan
Aitor Lewkowycz
Maarten Bosma
D. Luan
Charles Sutton
Augustus Odena
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Show Your Work: Scratchpads for Intermediate Computation with Language Models"

50 / 551 papers shown
Title
Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics
Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics
Jasper Timm
Chetan Talele
Jacob Haimes
33
0
0
28 Jan 2025
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
Yutong Yin
Zhaoran Wang
LRM
ReLM
95
0
0
27 Jan 2025
CodeMonkeys: Scaling Test-Time Compute for Software Engineering
CodeMonkeys: Scaling Test-Time Compute for Software Engineering
Ryan Ehrlich
Bradley Brown
Jordan Juravsky
Ronald Clark
Christopher Ré
Azalia Mirhoseini
55
6
0
24 Jan 2025
Towards Multimodal Metaphor Understanding: A Chinese Dataset and Model for Metaphor Mapping Identification
Towards Multimodal Metaphor Understanding: A Chinese Dataset and Model for Metaphor Mapping Identification
Dongyu Zhang
Shengcheng Yin
J. Yu
Zhiyao Wu
Zhen Li
Chengpei Xu
X. Wang
Feng Xia
91
0
0
05 Jan 2025
Mathematical Language Models: A Survey
Mathematical Language Models: A Survey
W. Liu
Hanglei Hu
Jie Zhou
Yuyang Ding
Junsong Li
...
Mengliang He
Qin Chen
Bo Jiang
Aimin Zhou
Liang He
LRM
79
12
0
03 Jan 2025
Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria
Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria
Joonwon Jang
Jaehee Kim
Wonbin Kweon
Hwanjo Yu
LRM
31
1
0
03 Jan 2025
Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search
Shuangtao Li
Shuaihao Dong
Kexin Luan
Xinhan Di
Chaofan Ding
LRM
43
1
0
02 Jan 2025
Lies, Damned Lies, and Distributional Language Statistics: Persuasion
  and Deception with Large Language Models
Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models
Cameron R. Jones
Benjamin Bergen
67
4
0
22 Dec 2024
Prompting Strategies for Enabling Large Language Models to Infer
  Causation from Correlation
Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation
Eleni Sgouritsa
Virginia Aglietti
Yee Whye Teh
Arnaud Doucet
A. Gretton
Silvia Chiappa
ReLM
LRM
74
0
0
18 Dec 2024
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs
Chengwei Wei
Bin Wang
Jung-jae Kim
Guimei Liu
Nancy F. Chen
LRM
77
0
0
16 Dec 2024
C3oT: Generating Shorter Chain-of-Thought without Compromising
  Effectiveness
C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
Yu Kang
Xianghui Sun
Liangyu Chen
Wei Zou
LRM
72
18
0
16 Dec 2024
LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
William Watson
Nicole Cho
Nishan Srishankar
Zhen Zeng
Lucas Cecchi
Daniel Scott
S. Siddagangappa
Rachneet Kaur
T. Balch
Manuela Veloso
AILaw
69
0
0
15 Dec 2024
Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures
Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures
Fu-Chieh Chang
Pei-Yuan Wu
Pei-Yuan Wu
LRM
104
1
0
25 Nov 2024
Forecasting Future International Events: A Reliable Dataset for
  Text-Based Event Modeling
Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling
Daehoon Gwak
Junwoo Park
Minho Park
C. Park
Hyunchan Lee
E. Choi
Jaegul Choo
64
0
0
21 Nov 2024
Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Adithya V Ganesan
Vasudha Varadarajan
Yash Kumar Lal
Veerle C. Eijsbroek
Katarina Kjell
...
Elizabeth C. Stade
J. Eichstaedt
Ryan L. Boyd
H. A. Schwartz
Lucie Flek
AI4MH
67
0
0
21 Nov 2024
Foundations and Recent Trends in Multimodal Mobile Agents: A Survey
Foundations and Recent Trends in Multimodal Mobile Agents: A Survey
Biao Wu
Yanda Li
Meng Fang
Zirui Song
Zhiwei Zhang
Yunchao Wei
L. Chen
LM&Ro
LLMAG
OffRL
AI4TS
39
4
0
04 Nov 2024
Prospective Learning: Learning for a Dynamic Future
Prospective Learning: Learning for a Dynamic Future
Ashwin De Silva
Rahul Ramesh
Rubing Yang
Siyu Yu
Joshua T. Vogelstein
Pratik Chaudhari
AI4TS
58
0
0
31 Oct 2024
Leveraging LLMs for Hypothetical Deduction in Logical Inference: A
  Neuro-Symbolic Approach
Leveraging LLMs for Hypothetical Deduction in Logical Inference: A Neuro-Symbolic Approach
Qingchuan Li
Jiatong Li
Tongxuan Liu
Yuting Zeng
Mingyue Cheng
Weizhe Huang
Qi Liu
LRM
AI4CE
47
2
0
29 Oct 2024
Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on
  Tasks where Thinking Makes Humans Worse
Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse
Ryan Liu
Jiayi Geng
Addison J. Wu
Ilia Sucholutsky
Tania Lombrozo
Thomas L. Griffiths
ReLM
LRM
60
19
0
27 Oct 2024
Delving into the Reversal Curse: How Far Can Large Language Models
  Generalize?
Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Zhengkai Lin
Z. Fu
Kai Liu
Liang Xie
Binbin Lin
Wenxiao Wang
D. Cai
Yue Wu
Jieping Ye
LRM
25
3
0
24 Oct 2024
Understanding When Tree of Thoughts Succeeds: Larger Models Excel in
  Generation, Not Discrimination
Understanding When Tree of Thoughts Succeeds: Larger Models Excel in Generation, Not Discrimination
Qiqi Chen
Xinpeng Wang
Philipp Mondorf
Michael A. Hedderich
Barbara Plank
LRM
AI4CE
21
1
0
23 Oct 2024
In Context Learning and Reasoning for Symbolic Regression with Large Language Models
In Context Learning and Reasoning for Symbolic Regression with Large Language Models
Samiha Sharlin
Tyler R. Josephson
ReLM
LLMAG
LRM
40
1
0
22 Oct 2024
The Best Defense is a Good Offense: Countering LLM-Powered Cyberattacks
The Best Defense is a Good Offense: Countering LLM-Powered Cyberattacks
Daniel Ayzenshteyn
Roy Weiss
Yisroel Mirsky
AAML
26
0
0
20 Oct 2024
From Solitary Directives to Interactive Encouragement! LLM Secure Code
  Generation by Natural Language Prompting
From Solitary Directives to Interactive Encouragement! LLM Secure Code Generation by Natural Language Prompting
Shigang Liu
Bushra Sabir
Seung Ick Jang
Yuval Kansal
Yansong Gao
Kristen Moore
A. Abuadbba
Surya Nepal
30
2
0
18 Oct 2024
Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning
Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning
Lang Cao
Chao Peng
Renhong Chen
Wu Ning
Yingtian Zou
Yitong Li
LRM
21
0
0
18 Oct 2024
Retrieval-Enhanced Named Entity Recognition
Retrieval-Enhanced Named Entity Recognition
Enzo Shiraishi
Raphael Y. de Camargo
Henrique L. P. Silva
Ronaldo C. Prati
RALM
18
0
0
17 Oct 2024
Open Domain Question Answering with Conflicting Contexts
Open Domain Question Answering with Conflicting Contexts
Siyi Liu
Qiang Ning
Kishaloy Halder
Wei Xiao
Zheng Qi
...
Yi Zhang
Neha Anna John
Bonan Min
Yassine Benajiba
Dan Roth
LLMAG
63
2
0
16 Oct 2024
DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
Shreya Shankar
Tristan Chambers
Eugene Wu
Aditya G. Parameswaran
Eugene Wu
LLMAG
53
6
0
16 Oct 2024
On the Training Convergence of Transformers for In-Context
  Classification
On the Training Convergence of Transformers for In-Context Classification
Wei Shen
Ruida Zhou
Jing Yang
Cong Shen
26
3
0
15 Oct 2024
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent
Bo Chen
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao-quan Song
83
19
0
15 Oct 2024
Thinking LLMs: General Instruction Following with Thought Generation
Thinking LLMs: General Instruction Following with Thought Generation
Tianhao Wu
Janice Lan
Weizhe Yuan
Jiantao Jiao
Jason Weston
Sainbayar Sukhbaatar
LRM
16
15
0
14 Oct 2024
Neural networks that overcome classic challenges through practice
Neural networks that overcome classic challenges through practice
Kazuki Irie
Brenden M. Lake
29
4
0
14 Oct 2024
OpenR: An Open Source Framework for Advanced Reasoning with Large
  Language Models
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Jun Wang
Meng Fang
Ziyu Wan
Muning Wen
Jiachen Zhu
...
Lei Chen
Lionel M. Ni
Linyi Yang
Ying Wen
W. Zhang
LRM
21
30
0
12 Oct 2024
Visual Scratchpads: Enabling Global Reasoning in Vision
Visual Scratchpads: Enabling Global Reasoning in Vision
Aryo Lotfi
Enrico Fini
Samy Bengio
Moin Nabi
Emmanuel Abbe
LRM
37
0
0
10 Oct 2024
Executing Arithmetic: Fine-Tuning Large Language Models as Turing
  Machines
Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines
Junyu Lai
Jiahe Xu
Yao Yang
Yunpeng Huang
Chun Cao
Jingwei Xu
LRM
29
2
0
10 Oct 2024
MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks
MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks
M. Bueno
R. Lotufo
Rodrigo Nogueira
LRM
26
0
0
08 Oct 2024
Reasoning Paths Optimization: Learning to Reason and Explore From
  Diverse Paths
Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Yew Ken Chia
Guizhen Chen
Weiwen Xu
Luu Anh Tuan
Soujanya Poria
Lidong Bing
LRM
23
0
0
07 Oct 2024
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation
Mehul Damani
Idan Shenfeld
Andi Peng
Andreea Bobu
Jacob Andreas
39
14
0
07 Oct 2024
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
Kaiyue Wen
Huaqing Zhang
Hongzhou Lin
Jingzhao Zhang
MoE
LRM
61
2
0
07 Oct 2024
Understanding Reasoning in Chain-of-Thought from the Hopfieldian View
Understanding Reasoning in Chain-of-Thought from the Hopfieldian View
Lijie Hu
Liang Liu
Shu Yang
Xin Chen
Zhen Tan
Muhammad Asif Ali
Mengdi Li
Di Wang
LRM
41
1
0
04 Oct 2024
When a language model is optimized for reasoning, does it still show
  embers of autoregression? An analysis of OpenAI o1
When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1
R. Thomas McCoy
Shunyu Yao
Dan Friedman
Mathew D. Hardy
Thomas L. Griffiths
LRM
31
7
0
02 Oct 2024
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source
  Instruction Data
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data
Shubham Toshniwal
Wei Du
Ivan Moshkov
Branislav Kisacanin
Alexan Ayrapetyan
Igor Gitman
LRM
18
49
0
02 Oct 2024
ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Ezra Karger
Houtan Bastani
Chen Yueh-Han
Zachary Jacobs
Danny Halawi
Fred Zhang
P. Tetlock
33
6
0
30 Sep 2024
Infer Human's Intentions Before Following Natural Language Instructions
Infer Human's Intentions Before Following Natural Language Instructions
Yanming Wan
Yue Wu
Yiping Wang
Jiayuan Mao
Natasha Jaques
LM&Ro
28
3
0
26 Sep 2024
Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models
Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models
Tongxuan Liu
Wenjiang Xu
Weizhe Huang
Yuting Zeng
Jiaxing Wang
Hailong Yang
Hailong Yang
Jing Li
LRM
ReLM
41
5
0
26 Sep 2024
Scaling Behavior for Large Language Models regarding Numeral Systems: An
  Example using Pythia
Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia
Zhejian Zhou
Jiayu Wang
Dahua Lin
Kai Chen
LRM
29
2
0
25 Sep 2024
GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group
  Discussion
GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group Discussion
Tongxuan Liu
Xingyu Wang
Weizhe Huang
Wenjiang Xu
Yuting Zeng
Lei Jiang
Hailong Yang
Jing Li
LLMAG
29
8
0
21 Sep 2024
Uncovering Latent Chain of Thought Vectors in Language Models
Uncovering Latent Chain of Thought Vectors in Language Models
Jason Zhang
Scott Viteri
LLMSV
LRM
36
1
0
21 Sep 2024
Contextual Compression in Retrieval-Augmented Generation for Large
  Language Models: A Survey
Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A Survey
Sourav Verma
RALM
3DV
24
2
0
20 Sep 2024
Retrieve, Annotate, Evaluate, Repeat: Leveraging Multimodal LLMs for
  Large-Scale Product Retrieval Evaluation
Retrieve, Annotate, Evaluate, Repeat: Leveraging Multimodal LLMs for Large-Scale Product Retrieval Evaluation
Kasra Hosseini
Thomas Kober
Josip Krapac
Roland Vollgraf
Weiwei Cheng
Ana Peleteiro Ramallo
19
1
0
18 Sep 2024
Previous
12345...101112
Next