ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.14213
  4. Cited By
From Text to Pixel: Advancing Long-Context Understanding in MLLMs

From Text to Pixel: Advancing Long-Context Understanding in MLLMs

23 May 2024
Yujie Lu
Xiujun Li
Tsu-Jui Fu
Miguel P. Eckstein
William Y. Wang
    VLM
ArXiv (abs)PDFHTMLGithub (11★)

Papers citing "From Text to Pixel: Advancing Long-Context Understanding in MLLMs"

3 / 3 papers shown
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs
Yanhong Li
Zixuan Lan
Jiawei Zhou
VLM
255
3
0
21 Oct 2025
Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents
Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents
Yiqi Lin
Alex Jinpeng Wang
Linjie Li
Zhengyuan Yang
Mike Zheng Shou
178
1
0
21 Oct 2025
A Review of Multimodal Explainable Artificial Intelligence: Past,
  Present and Future
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Shilin Sun
Wenbin An
Feng Tian
Fang Nan
Qidong Liu
Jing Liu
N. Shah
Ping Chen
520
22
0
18 Dec 2024
1
Page 1 of 1