ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.10668
  4. Cited By
Language Modeling Is Compression

Language Modeling Is Compression

19 September 2023
Grégoire Delétang
Anian Ruoss
Paul-Ambroise Duquenne
Elliot Catt
Tim Genewein
Christopher Mattern
Jordi Grau-Moya
Wenliang Kevin Li
Matthew Aitchison
Laurent Orseau
Marcus Hutter
J. Veness
    AI4CE
ArXivPDFHTML

Papers citing "Language Modeling Is Compression"

50 / 101 papers shown
Title
Easy Problems That LLMs Get Wrong
Easy Problems That LLMs Get Wrong
Sean Williams
James Huckle
LRM
19
10
0
30 May 2024
gzip Predicts Data-dependent Scaling Laws
gzip Predicts Data-dependent Scaling Laws
Rohan Pandey
14
9
0
26 May 2024
Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Emily Cheng
Diego Doimo
Corentin Kervadec
Iuri Macocco
Jade Yu
A. Laio
Marco Baroni
104
11
0
24 May 2024
SPO: Multi-Dimensional Preference Sequential Alignment With Implicit
  Reward Modeling
SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling
Xingzhou Lou
Junge Zhang
Jian Xie
Lifeng Liu
Dong Yan
Kaiqi Huang
29
11
0
21 May 2024
Hallucination of Multimodal Large Language Models: A Survey
Hallucination of Multimodal Large Language Models: A Survey
Zechen Bai
Pichao Wang
Tianjun Xiao
Tong He
Zongbo Han
Zheng Zhang
Mike Zheng Shou
VLM
LRM
71
136
0
29 Apr 2024
Tele-FLM Technical Report
Tele-FLM Technical Report
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Chao Wang
...
Yequan Wang
Zhongjiang He
Zhongyuan Wang
Xuelong Li
Tiejun Huang
30
3
0
25 Apr 2024
Rethinking LLM Memorization through the Lens of Adversarial Compression
Rethinking LLM Memorization through the Lens of Adversarial Compression
Avi Schwarzschild
Zhili Feng
Pratyush Maini
Zachary Chase Lipton
J. Zico Kolter
39
38
0
23 Apr 2024
Compression Represents Intelligence Linearly
Compression Represents Intelligence Linearly
Yuzhen Huang
Jinghan Zhang
Zifei Shan
Junxian He
39
24
0
15 Apr 2024
PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation
  for Videos
PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos
Qi Zhao
M. Salman Asif
Zhan Ma
19
3
0
13 Apr 2024
Training LLMs over Neurally Compressed Text
Training LLMs over Neurally Compressed Text
Brian Lester
Jaehoon Lee
A. Alemi
Jeffrey Pennington
Adam Roberts
Jascha Narain Sohl-Dickstein
Noah Constant
22
6
0
04 Apr 2024
Generative AI for Immersive Communication: The Next Frontier in
  Internet-of-Senses Through 6G
Generative AI for Immersive Communication: The Next Frontier in Internet-of-Senses Through 6G
Nassim Sehad
Lina Bariah
W. Hamidouche
Hamed Hellaoui
Riku Jäntti
Mérouane Debbah
13
14
0
02 Apr 2024
A Survey on Large Language Model-Based Game Agents
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
62
49
0
02 Apr 2024
Is Complexity an Illusion?
Is Complexity an Illusion?
Michael Timothy Bennett
21
3
0
31 Mar 2024
Reverse Training to Nurse the Reversal Curse
Reverse Training to Nurse the Reversal Curse
O. Yu. Golovneva
Zeyuan Allen-Zhu
Jason Weston
Sainbayar Sukhbaatar
18
32
0
20 Mar 2024
Unifying Generation and Compression: Ultra-low bitrate Image Coding Via
  Multi-stage Transformer
Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer
Naifu Xue
Qi Mao
Zijian Wang
Yuan Zhang
Siwei Ma
20
5
0
06 Mar 2024
On the Compressibility of Quantized Large Language Models
On the Compressibility of Quantized Large Language Models
Yu Mao
Weilan Wang
Hongchao Du
Nan Guan
Chun Jason Xue
MQ
23
6
0
03 Mar 2024
Rethinking Tokenization: Crafting Better Tokenizers for Large Language
  Models
Rethinking Tokenization: Crafting Better Tokenizers for Large Language Models
Jinbiao Yang
LLMAG
38
9
0
01 Mar 2024
Beyond Language Models: Byte Models are Digital World Simulators
Beyond Language Models: Byte Models are Digital World Simulators
Shangda Wu
Xu Tan
Zili Wang
Rui Wang
Xiaobing Li
Maosong Sun
25
12
0
29 Feb 2024
Towards Optimal Learning of Language Models
Towards Optimal Learning of Language Models
Yuxian Gu
Li Dong
Y. Hao
Qingxiu Dong
Minlie Huang
Furu Wei
36
7
0
27 Feb 2024
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination
  Tendency of LLMs
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs
Cem Uluoglakci
T. Taşkaya-Temizel
HILM
27
2
0
25 Feb 2024
Subobject-level Image Tokenization
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLM
OCL
46
6
0
22 Feb 2024
Integrating Pre-Trained Language Model with Physical Layer
  Communications
Integrating Pre-Trained Language Model with Physical Layer Communications
Ju-Hyung Lee
Dong-Ho Lee
Joohan Lee
Jay Pujara
26
3
0
18 Feb 2024
Efficient and Scalable Fine-Tune of Language Models for Genome
  Understanding
Efficient and Scalable Fine-Tune of Language Models for Genome Understanding
Huixin Zhan
Ying Nian Wu
Zijun Zhang
ALM
19
1
0
12 Feb 2024
Retrieval-Augmented Thought Process as Sequential Decision Making
Retrieval-Augmented Thought Process as Sequential Decision Making
T. Pouplin
Hao Sun
Samuel Holt
M. Schaar
KELM
RALM
LRM
6
2
0
12 Feb 2024
Dimensionality reduction can be used as a surrogate model for
  high-dimensional forward uncertainty quantification
Dimensionality reduction can be used as a surrogate model for high-dimensional forward uncertainty quantification
Jungho Kim
Sang-ri Yi
Ziqi Wang
12
5
0
07 Feb 2024
Fine-Tuned Language Models Generate Stable Inorganic Materials as Text
Fine-Tuned Language Models Generate Stable Inorganic Materials as Text
Nate Gruver
Anuroop Sriram
Andrea Madotto
A. Wilson
C. L. Zitnick
Zachary W. Ulissi
20
53
0
06 Feb 2024
Large Language Models are Geographically Biased
Large Language Models are Geographically Biased
Rohin Manvi
Samar Khanna
Marshall Burke
David B. Lobell
Stefano Ermon
20
39
0
05 Feb 2024
Spiking Music: Audio Compression with Event Based Auto-encoders
Spiking Music: Audio Compression with Event Based Auto-encoders
Martim Lisboa
Guillaume Bellec
19
2
0
02 Feb 2024
Evaluating Large Language Models for Generalization and Robustness via
  Data Compression
Evaluating Large Language Models for Generalization and Robustness via Data Compression
Yucheng Li
Yunhao Guo
Frank Guerin
Chenghua Lin
ELM
20
5
0
01 Feb 2024
The Information of Large Language Model Geometry
The Information of Large Language Model Geometry
Zhiquan Tan
Chenghai Li
Weiran Huang
13
1
0
01 Feb 2024
Large Language Model Evaluation via Matrix Entropy
Large Language Model Evaluation via Matrix Entropy
Lai Wei
Zhiquan Tan
Chenghai Li
Jindong Wang
Weiran Huang
23
5
0
30 Jan 2024
A Comprehensive Study of Knowledge Editing for Large Language Models
A Comprehensive Study of Knowledge Editing for Large Language Models
Ningyu Zhang
Yunzhi Yao
Bo Tian
Peng Wang
Shumin Deng
...
Lei Liang
Zhiqiang Zhang
Xiao-Jun Zhu
Jun Zhou
Huajun Chen
KELM
26
76
0
02 Jan 2024
Non-Vacuous Generalization Bounds for Large Language Models
Non-Vacuous Generalization Bounds for Large Language Models
Sanae Lotfi
Marc Finzi
Yilun Kuang
Tim G. J. Rudner
Micah Goldblum
Andrew Gordon Wilson
15
20
0
28 Dec 2023
One Fits All: Universal Time Series Analysis by Pretrained LM and
  Specially Designed Adaptors
One Fits All: Universal Time Series Analysis by Pretrained LM and Specially Designed Adaptors
Tian Zhou
Peisong Niu
Xue Wang
Liang Sun
Rong Jin
AI4TS
65
2
0
24 Nov 2023
LLMs may Dominate Information Access: Neural Retrievers are Biased
  Towards LLM-Generated Texts
LLMs may Dominate Information Access: Neural Retrievers are Biased Towards LLM-Generated Texts
Sunhao Dai
Yuqi Zhou
Liang Pang
Weihao Liu
Xiaolin Hu
Yong Liu
Xiao Zhang
Gang Wang
Jun Xu
39
26
0
31 Oct 2023
In-Context Learning Dynamics with Random Binary Sequences
In-Context Learning Dynamics with Random Binary Sequences
Eric J. Bigelow
Ekdeep Singh Lubana
Robert P. Dick
Hidenori Tanaka
T. Ullman
16
4
0
26 Oct 2023
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from
  a Parametric Perspective
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Ming Zhong
Chenxin An
Weizhu Chen
Jiawei Han
Pengcheng He
21
8
0
17 Oct 2023
Tailored Visions: Enhancing Text-to-Image Generation with Personalized
  Prompt Rewriting
Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting
Zijie Chen
Lichao Zhang
Fangsheng Weng
Lili Pan
Zhenzhong Lan
17
9
0
12 Oct 2023
Large Language Models Are Zero-Shot Time Series Forecasters
Large Language Models Are Zero-Shot Time Series Forecasters
Nate Gruver
Marc Finzi
Shikai Qiu
Andrew Gordon Wilson
AI4TS
27
313
0
11 Oct 2023
GeoLLM: Extracting Geospatial Knowledge from Large Language Models
GeoLLM: Extracting Geospatial Knowledge from Large Language Models
Rohin Manvi
Samar Khanna
Gengchen Mai
Marshall Burke
David B. Lobell
Stefano Ermon
8
41
0
10 Oct 2023
Grokking as Compression: A Nonlinear Complexity Perspective
Grokking as Compression: A Nonlinear Complexity Perspective
Ziming Liu
Ziqian Zhong
Max Tegmark
12
9
0
09 Oct 2023
In-context Autoencoder for Context Compression in a Large Language Model
In-context Autoencoder for Context Compression in a Large Language Model
Tao Ge
Jing Hu
Lei Wang
Xun Wang
Si-Qing Chen
Furu Wei
RALM
24
66
0
13 Jul 2023
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Manuel Tran
Yashin Dicente Cid
Amal Lahiani
Fabian J. Theis
Tingying Peng
Eldad Klaiman
13
2
0
23 May 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
203
2,232
0
22 Mar 2023
Accelerated Deep Lossless Image Coding with Unified Paralleleized GPU
  Coding Architecture
Accelerated Deep Lossless Image Coding with Unified Paralleleized GPU Coding Architecture
Benjamin Lukas Cajus Barzen
Fedor Glazov
Jonas Geistert
T. Sikora
8
3
0
11 Jul 2022
Neural Networks and the Chomsky Hierarchy
Neural Networks and the Chomsky Hierarchy
Grégoire Delétang
Anian Ruoss
Jordi Grau-Moya
Tim Genewein
L. Wenliang
...
Chris Cundy
Marcus Hutter
Shane Legg
Joel Veness
Pedro A. Ortega
UQCV
94
129
0
05 Jul 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Big Bird: Transformers for Longer Sequences
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
249
1,982
0
28 Jul 2020
Learning Better Lossless Compression Using Lossy Compression
Learning Better Lossless Compression Using Lossy Compression
Fabian Mentzer
Luc Van Gool
Michael Tschannen
58
67
0
23 Mar 2020
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
4,424
0
23 Jan 2020
Previous
123
Next