Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task

24 October 2022

Papers citing "Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task"

50 / 200 papers shown

Title
Scaling Laws for State Dynamics in Large Language Models Jacob X Li Shreyas S Raman Jessica Wan Fahad Samman Jazlyn Lin 14 0 0 20 May 2025
ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models Matteo Merler Nicola Dainese Minttu Alakuijala Giovanni Bonetta Pietro Ferrazzi Yu Tian Bernardo Magnini Pekka Marttinen LM&Ro VLM 27 0 0 19 May 2025
Reward Inside the Model: A Lightweight Hidden-State Reward Model for LLM's Best-of-N sampling Jizhou Guo Zhaomin Wu Philip S. Yu 12 0 0 18 May 2025
Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis Akarsh Kumar Jeff Clune Joel Lehman Kenneth O. Stanley OOD 26 0 0 16 May 2025
Evaluating Large Language Models for Real-World Engineering Tasks Rene Heesch Sebastian Eilermann Alexander Windmann Alexander Diedrich Philipp Rosenthal Oliver Niggemann ELM 13 0 0 12 May 2025
Improving World Models using Deep Supervision with Linear Probes Andrii Zahorodnii 28 0 0 04 Apr 2025
Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure Boshi Wang Huan Sun 49 2 0 02 Apr 2025
Improving Preference Extraction In LLMs By Identifying Latent Knowledge Through Classifying Probes Sharan Maiya Yinhong Liu Ramit Debnath Anna Korhonen 46 0 0 22 Mar 2025
MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic Workflow Ziyue Wang Junde Wu Linghan Cai Chang Han Low Xihong Yang Qiaxuan Li Yueming Jin LRM 70 2 0 21 Mar 2025
Revisiting the Othello World Model Hypothesis Yifei Yuan Anders Søgaard LRM 60 0 0 06 Mar 2025
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems Richard Ren Arunim Agarwal Mantas Mazeika Cristina Menghini Robert Vacareanu ... Matias Geralnik Adam Khoja Dean Lee Summer Yue Dan Hendrycks HILM ALM 90 0 0 05 Mar 2025
(How) Do Language Models Track State? Belinda Z. Li Zifan Carl Guo Jacob Andreas LRM 57 2 0 04 Mar 2025
Implicit Search via Discrete Diffusion: A Study on Chess Jiacheng Ye Zhenyu Wu Jiahui Gao Zhiyong Wu Xin Jiang Zhiyu Li Lingpeng Kong DiffM 55 3 0 27 Feb 2025
Grandes modelos de lenguaje: de la predicción de palabras a la comprensión? Carlos Gómez-Rodríguez SyDa AILaw ELM VLM 115 0 0 25 Feb 2025
Representation Engineering for Large-Language Models: Survey and Research Challenges Lukasz Bartoszcze Sarthak Munshi Bryan Sukidi Jennifer Yen Zejia Yang David Williams-King Linh Le Kosi Asuzu Carsten Maple 102 0 0 24 Feb 2025
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Mengkang Hu Tianxing Chen Yude Zou Yuheng Lei Qiguang Chen Ming Li Yao Mu H. Zhang Wenqi Shao Ping Luo LLMAG 52 2 0 18 Feb 2025
The Representation and Recall of Interwoven Structured Knowledge in LLMs: A Geometric and Layered Analysis Ge Lei Samuel J. Cooper KELM 53 0 0 15 Feb 2025
MET-Bench: Multimodal Entity Tracking for Evaluating the Limitations of Vision-Language and Reasoning Models Vanya Cohen Raymond J. Mooney 52 0 0 15 Feb 2025
Emergent Stack Representations in Modeling Counter Languages Using Transformers Utkarsh Tiwari Aviral Gupta Michael Hahn 259 0 0 03 Feb 2025
It's Not Just a Phase: On Investigating Phase Transitions in Deep Learning-based Side-channel Analysis Sengim Karayalçin Marina Krček Stjepan Picek AAML 82 0 0 01 Feb 2025
An Attempt to Unraveling Token Prediction Refinement and Identifying Essential Layers of Large Language Models Jaturong Kongmanee 44 1 0 28 Jan 2025
Revisiting Rogers' Paradox in the Context of Human-AI Interaction Katherine M. Collins Umang Bhatt Ilia Sucholutsky 61 1 0 16 Jan 2025
Representation in large language models Cameron C. Yetman 46 1 0 03 Jan 2025
ICLR: In-Context Learning of Representations Core Francisco Park Andrew Lee Ekdeep Singh Lubana Yongyi Yang Maya Okawa Kento Nishi Martin Wattenberg Hidenori Tanaka AIFin 125 4 0 29 Dec 2024
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future Shilin Sun Wenbin An Feng Tian Fang Nan Qidong Liu Jing Liu N. Shah Ping Chen 104 3 0 18 Dec 2024
Transformers Use Causal World Models in Maze-Solving Tasks Alex F Spies William Edwards Michael Ivanitskiy Adrians Skapars Tilman Rauker Katsumi Inoue A. Russo Murray Shanahan 224 1 0 16 Dec 2024
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos Meng Cao Haoran Tang Haoze Zhao Hangyu Guo Jing Liu Ge Zhang Ruyang Liu Qiang Sun Ian Reid Xiaodan Liang 106 2 0 02 Dec 2024
Think-to-Talk or Talk-to-Think? When LLMs Come Up with an Answer in Multi-Step Arithmetic Reasoning Keito Kudo Yoichi Aoki Tatsuki Kuribayashi Shusaku Sone Masaya Taniguchi Ana Brassard Keisuke Sakaguchi Kentaro Inui ReLM LRM 79 0 0 02 Dec 2024
COLD: Causal reasOning in cLosed Daily activities Abhinav Joshi A. Ahmad Ashutosh Modi LRM ReLM 74 1 0 29 Nov 2024
Probing for Consciousness in Machines Mathis Immertreu A. Schilling Andreas K. Maier P. Krauss AI4CE 77 1 0 25 Nov 2024
Towards Unifying Interpretability and Control: Evaluation via Intervention Usha Bhalla Suraj Srinivas Asma Ghandeharioun Himabindu Lakkaraju 47 5 0 07 Nov 2024
LLM Generated Distribution-Based Prediction of US Electoral Results, Part I Caleb Bradshaw Caelen Miller Sean Warnick 46 0 0 05 Nov 2024
The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare Souren Pashangpour Goldie Nejat LM&MA 58 7 0 05 Nov 2024
All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling Emanuele Marconato Sébastien Lachapelle Sebastian Weichwald Luigi Gresele 69 3 0 30 Oct 2024
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics Yaniv Nikankin Anja Reusch Aaron Mueller Yonatan Belinkov AIFin LRM 46 25 0 28 Oct 2024
Delving into the Reversal Curse: How Far Can Large Language Models Generalize? Zhengkai Lin Z. Fu Kai Liu Liang Xie Binbin Lin Wenxiao Wang D. Cai Yue Wu Jieping Ye LRM 30 3 0 24 Oct 2024
Are Large Language Models Ready for Travel Planning? Ruiping Ren Xing Yao Shu Cole Haining Wang 33 0 0 22 Oct 2024
Chatting with Bots: AI, Speech Acts, and the Edge of Assertion Iwan Williams Tim Bayne 41 1 0 22 Oct 2024
Do LLMs "know" internally when they follow instructions? Juyeon Heo Christina Heinze-Deml Oussama Elachqar Shirley Ren Udhay Nallasamy Andy Miller Kwan Ho Ryan Chan Jaya Narain 54 6 0 18 Oct 2024
Automatic Mapping of Anatomical Landmarks from Free-Text Using Large Language Models: Insights from Llama-2 Mohamad Abdi Gerardo Hermosillo Valadez H. Yerebakan MedIm 32 0 0 16 Oct 2024
Systems with Switching Causal Relations: A Meta-Causal Perspective Moritz Willig Tim Nelson Tobiasch Florian Peter Busch Jonas Seng Devendra Singh Dhami Kristian Kersting CML 48 0 0 16 Oct 2024
Analyzing (In)Abilities of SAEs via Formal Languages Abhinav Menon Manish Shrivastava David M. Krueger Ekdeep Singh Lubana 50 7 0 15 Oct 2024
The Geometry of Concepts: Sparse Autoencoder Feature Structure Yuxiao Li Eric J. Michaud David D. Baek Joshua Engels Xiaoqing Sun Max Tegmark 58 9 0 10 Oct 2024
Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing Zhuoran Zhang Yongqian Li Zijian Kan Keyuan Cheng Lijie Hu Di Wang KELM 31 5 0 08 Oct 2024
Chain and Causal Attention for Efficient Entity Tracking Erwan Fagnou Paul Caillon Blaise Delattre Alexandre Allauzen 33 3 0 07 Oct 2024
Organizing Unstructured Image Collections using Natural Language Mingxuan Liu Zhun Zhong Jun Li Gianni Franchi Subhankar Roy Elisa Ricci VLM 52 3 0 07 Oct 2024
Latent Abstractions in Generative Diffusion Models Giulio Franzese Mattia Martini Giulio Corallo Paolo Papotti Pietro Michiardi DiffM 43 0 0 04 Oct 2024
On Logical Extrapolation for Mazes with Recurrent and Implicit Networks Brandon Knutson Amandin Chyba Rabeendran Michael Ivanitskiy Jordan Pettyjohn Cecilia G. Diniz Behn Samy Wu Fung Daniel McKenzie LRM 47 2 0 03 Oct 2024
Meta-Models: An Architecture for Decoding LLM Behaviors Through Interpreted Embeddings and Natural Language Anthony Costarelli Mat Allen Severin Field 27 1 0 03 Oct 2024
Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations Nick Jiang Anish Kachinthaya Suzie Petryk Yossi Gandelsman VLM 39 17 0 03 Oct 2024