Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2405.14213
Cited By
From Text to Pixel: Advancing Long-Context Understanding in MLLMs
23 May 2024
Yujie Lu
Xiujun Li
Tsu-Jui Fu
Miguel P. Eckstein
William Y. Wang
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11★)
Papers citing
"From Text to Pixel: Advancing Long-Context Understanding in MLLMs"
3 / 3 papers shown
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs
Yanhong Li
Zixuan Lan
Jiawei Zhou
VLM
255
3
0
21 Oct 2025
Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents
Yiqi Lin
Alex Jinpeng Wang
Linjie Li
Zhengyuan Yang
Mike Zheng Shou
178
1
0
21 Oct 2025
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Shilin Sun
Wenbin An
Feng Tian
Fang Nan
Qidong Liu
Jing Liu
N. Shah
Ping Chen
520
22
0
18 Dec 2024
1
Page 1 of 1