ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.10668
  4. Cited By
Language Modeling Is Compression

Language Modeling Is Compression

19 September 2023
Grégoire Delétang
Anian Ruoss
Paul-Ambroise Duquenne
Elliot Catt
Tim Genewein
Christopher Mattern
Jordi Grau-Moya
Wenliang Kevin Li
Matthew Aitchison
Laurent Orseau
Marcus Hutter
J. Veness
    AI4CE
ArXivPDFHTML

Papers citing "Language Modeling Is Compression"

50 / 101 papers shown
Title
Memorization-Compression Cycles Improve Generalization
Memorization-Compression Cycles Improve Generalization
Fangyuan Yu
16
0
0
13 May 2025
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i
Kola Ayonrinde
Louis Jaburi
MILM
68
1
0
01 May 2025
Revisiting Transformers through the Lens of Low Entropy and Dynamic Sparsity
Revisiting Transformers through the Lens of Low Entropy and Dynamic Sparsity
Ruifeng Ren
Yong Liu
33
0
0
26 Apr 2025
GIViC: Generative Implicit Video Compression
GIViC: Generative Implicit Video Compression
Ge Gao
Siyue Teng
Tianhao Peng
Fan Zhang
David Bull
DiffM
VGen
36
0
0
25 Mar 2025
LZMidi: Compression-Based Symbolic Music Generation
LZMidi: Compression-Based Symbolic Music Generation
Connor Ding
Abhiram Gorle
Sagnik Bhattacharya
Divija Hasteer
Naomi Sagan
Tsachy Weissman
45
0
0
22 Mar 2025
The KoLMogorov Test: Compression by Code Generation
The KoLMogorov Test: Compression by Code Generation
Ori Yoran
Kunhao Zheng
Fabian Gloeckle
Jonas Gehring
Gabriel Synnaeve
Taco Cohen
62
1
0
18 Mar 2025
Measuring In-Context Computation Complexity via Hidden State Prediction
Measuring In-Context Computation Complexity via Hidden State Prediction
Vincent Herrmann
Róbert Csordás
Jürgen Schmidhuber
39
0
0
17 Mar 2025
TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction
Chao Wang
Weiwei Fu
Yang Zhou
MLLM
VLM
67
0
0
06 Mar 2025
Lossy Neural Compression for Geospatial Analytics: A Review
Carlos Gomes
Isabelle Wittmann
Damien Robert
Johannes Jakubik
Tim Reichelt
...
Romeo Kienzler
Rania Briq
Sabrina Benassou
Michele Lazzarini
C. Albrecht
79
2
0
03 Mar 2025
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
Kashun Shum
Y. Huang
Hongjian Zou
Qi Ding
Yixuan Liao
X. Chen
Qian Liu
Junxian He
55
2
0
02 Mar 2025
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
Zixuan Gong
Xiaolin Hu
Huayi Tang
Yong Liu
33
0
0
24 Feb 2025
Large Language Model for Lossless Image Compression with Visual Prompts
Large Language Model for Lossless Image Compression with Visual Prompts
Junhao Du
Chuqin Zhou
Ning Cao
Gang Chen
Yunuo Chen
Zhengxue Cheng
Li-Na Song
Guo Lu
Wenjun Zhang
VLM
47
1
0
22 Feb 2025
Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent
Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent
Junda Wu
Yuxin Xiong
Xintong Li
Yu Xia
Ruoyu Wang
...
Sungchul Kim
Ryan Rossi
Lina Yao
Jingbo Shang
Julian McAuley
CLL
VLM
49
0
0
17 Feb 2025
Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications
Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications
Li Qiao
Mahdi Boloursaz Mashhadi
Zhen Gao
Rahim Tafazolli
Mehdi Bennis
Dusit Niyato
79
2
0
17 Feb 2025
Large Language Diffusion Models
Large Language Diffusion Models
Shen Nie
Fengqi Zhu
Zebin You
Xiaolu Zhang
Jingyang Ou
Jun Hu
Jun Zhou
Yankai Lin
Ji-Rong Wen
Chongxuan Li
90
12
0
14 Feb 2025
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
Shengshi Yao
Jincheng Dai
Xiaoqi Qin
Sixian Wang
Siye Wang
K. Niu
Ping Zhang
31
0
0
22 Jan 2025
LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned
  Image Compression
LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression
Shimon Murai
Heming Sun
J. Katto
VLM
64
0
0
20 Nov 2024
Training Compute-Optimal Protein Language Models
Training Compute-Optimal Protein Language Models
Xingyi Cheng
Bo Chen
Pan Li
Jing Gong
Jie Tang
Le Song
68
12
0
04 Nov 2024
MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression
MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression
Noel Elias
H. Esfahanizadeh
Kaan Kale
S. Vishwanath
Muriel Médard
24
0
0
28 Oct 2024
MatExpert: Decomposing Materials Discovery by Mimicking Human Experts
MatExpert: Decomposing Materials Discovery by Mimicking Human Experts
Qianggang Ding
Santiago Miret
Bang Liu
MoE
27
7
0
26 Oct 2024
In-context learning and Occam's razor
In-context learning and Occam's razor
Eric Elmoznino
Tom Marty
Tejas Kasetty
Léo Gagnon
Sarthak Mittal
Mahan Fathi
Dhanya Sridhar
Guillaume Lajoie
32
1
0
17 Oct 2024
MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and
  Mapping With a Dynamic and Static Object Discriminator
MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator
Taozhe Li
Wei Sun
21
1
0
14 Oct 2024
Rethinking Data Selection at Scale: Random Selection is Almost All You
  Need
Rethinking Data Selection at Scale: Random Selection is Almost All You Need
Tingyu Xia
Bowen Yu
K. Dang
An Yang
Yuan Wu
Yuan Tian
Yi-Ju Chang
Junyang Lin
ALM
49
3
0
12 Oct 2024
Transformers learn variable-order Markov chains in-context
Transformers learn variable-order Markov chains in-context
Ruida Zhou
C. Tian
Suhas Diggavi
21
0
0
07 Oct 2024
Compression via Pre-trained Transformers: A Study on Byte-Level
  Multimodal Data
Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data
David Heurtel-Depeiges
Anian Ruoss
Joel Veness
Tim Genewein
17
1
0
07 Oct 2024
Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor
  Factorization for Compression of Generative Language Models
Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor Factorization for Compression of Generative Language Models
Mingxue Xu
Sadia Sharmin
Danilo P. Mandic
14
2
0
03 Oct 2024
How Much Can RAG Help the Reasoning of LLM?
How Much Can RAG Help the Reasoning of LLM?
Jingyu Liu
Jiaen Lin
Yong Liu
LRM
18
9
0
03 Oct 2024
Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models
Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models
Hui-Po Wang
Mario Fritz
26
3
0
26 Sep 2024
Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool
  and Depth-Anything Constraint
Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint
Sixiang Chen
Tian-Chun Ye
K. Zhang
Zhaohu Xing
Yunlong Lin
Lei Zhu
DiffM
33
9
0
24 Sep 2024
Conversational Complexity for Assessing Risk in Large Language Models
Conversational Complexity for Assessing Risk in Large Language Models
John Burden
Manuel Cebrian
José Hernández Orallo
37
0
0
02 Sep 2024
Iterative Graph Alignment
Iterative Graph Alignment
Fangyuan Yu
H. S. Arora
Matt Johnson
26
1
0
29 Aug 2024
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and
  Deduplication by Introducing a Competitive Large Language Model Baseline
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
Guosheng Dong
Da Pan
Yiding Sun
Shusen Zhang
Zheng Liang
...
Bingning Wang
Wentao Zhang
Jiaxin Mao
Zenan Zhou
Weipeng Chen
ALM
27
2
0
27 Aug 2024
Effective Demonstration Annotation for In-Context Learning via Language
  Model-Based Determinantal Point Process
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process
Peng Wang
Xiaobin Wang
Chao Lou
Shengyu Mao
Pengjun Xie
Yong-jia Jiang
49
0
0
04 Aug 2024
Finch: Prompt-guided Key-Value Cache Compression
Finch: Prompt-guided Key-Value Cache Compression
Giulio Corallo
Paolo Papotti
33
3
0
31 Jul 2024
Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law
Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law
Giorgio Franceschelli
Claudia Cevenini
Mirco Musolesi
39
0
0
18 Jul 2024
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical
  Reasoning with Checklist
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Zihao Zhou
Shudong Liu
Maizhen Ning
Wei Liu
Jindong Wang
Derek F. Wong
Xiaowei Huang
Qiufeng Wang
Kaizhu Huang
ELM
LRM
61
23
0
11 Jul 2024
Coding for Intelligence from the Perspective of Category
Coding for Intelligence from the Perspective of Category
Wenhan Yang
Zixuan Hu
Lilang Lin
Jiaying Liu
Ling-Yu Duan
AI4CE
30
1
0
01 Jul 2024
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Tao Ge
Xin Chan
Dian Yu
Haitao Mi
Dong Yu
Dong Yu
SyDa
106
89
0
28 Jun 2024
Ranking LLMs by compression
Ranking LLMs by compression
Peijia Guo
Ziguang Li
Haibo Hu
Chao Huang
Ming Li
Rui Zhang
23
1
0
20 Jun 2024
Measuring Sample Importance in Data Pruning for Training LLMs from a
  Data Compression Perspective
Measuring Sample Importance in Data Pruning for Training LLMs from a Data Compression Perspective
Minsang Kim
Seungjun Baek
29
0
0
20 Jun 2024
Security of AI Agents
Security of AI Agents
Yifeng He
Ethan Wang
Yuyang Rong
Zifei Cheng
Hao Chen
LLMAG
29
7
0
12 Jun 2024
Self-attention-based non-linear basis transformations for compact latent
  space modelling of dynamic optical fibre transmission matrices
Self-attention-based non-linear basis transformations for compact latent space modelling of dynamic optical fibre transmission matrices
Yijie Zheng
Robert J. Kilpatrick
David B. Phillips
G. Gordon
MedIm
18
1
0
11 Jun 2024
Image and Video Tokenization with Binary Spherical Quantization
Image and Video Tokenization with Binary Spherical Quantization
Yue Zhao
Yuanjun Xiong
Philipp Krahenbuhl
20
17
0
11 Jun 2024
Vision Model Pre-training on Interleaved Image-Text Data via Latent
  Compression Learning
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Chenyu Yang
Xizhou Zhu
Jinguo Zhu
Weijie Su
Junjie Wang
...
Lewei Lu
Bin Li
Jie Zhou
Yu Qiao
Jifeng Dai
VLM
CLIP
31
4
0
11 Jun 2024
Deep Generative Modeling Reshapes Compression and Transmission: From
  Efficiency to Resiliency
Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency
Jincheng Dai
Xiaoqi Qin
Sixian Wang
Lexi Xu
Kai Niu
Ping Zhang
23
4
0
10 Jun 2024
Language Models Resist Alignment
Language Models Resist Alignment
Jiaming Ji
Kaile Wang
Tianyi Qiu
Boyuan Chen
Jiayi Zhou
Changye Li
Hantao Lou
Yaodong Yang
31
1
0
10 Jun 2024
Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of
  Health for Clinical Text
Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of Health for Clinical Text
Avijit Mitra
Emily Druhl
Raelene Goodwin
Hong Yu
24
2
0
10 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
32
18
0
06 Jun 2024
Guiding ChatGPT to Generate Salient Domain Summaries
Guiding ChatGPT to Generate Salient Domain Summaries
Jun Gao
Ziqiang Cao
Shaoyao Huang
Luozheng Qin
Chunhui Ai
28
0
0
03 Jun 2024
DORY: Deliberative Prompt Recovery for LLM
DORY: Deliberative Prompt Recovery for LLM
Lirong Gao
Ru Peng
Yiming Zhang
Junbo Zhao
27
3
0
31 May 2024
123
Next