Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2402.07116
Cited By

A Benchmark for Multi-modal Foundation Models on Low-level Vision: from
Single Images to Pairs

A Benchmark for Multi-modal Foundation Models on Low-level Vision: from Single Images to Pairs

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

11 February 2024

Zicheng Zhang

Haoning Wu

Guangtao Zhai

Weisi Lin

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)Github (268★)

Papers citing "A Benchmark for Multi-modal Foundation Models on Low-level Vision: from Single Images to Pairs"

7 / 7 papers shown

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Haotian Zhang

Mingfei Gao

...

Zirui Wang

Yinfei Yang

307

67

1

30 Sep 2024

A Survey on Multimodal Benchmarks: In the Era of Large AI Models

A Survey on Multimodal Benchmarks: In the Era of Large AI Models

Jun Xiao

Long Chen

346

23

0

21 Sep 2024

A Survey on Evaluation of Multimodal Large Language Models

A Survey on Evaluation of Multimodal Large Language Models

Jiaxing Huang

Jingyi Zhang

309

44

0

28 Aug 2024

LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models

LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models

Wei Sun

Yunhao Li

Fengyu Sun

Shangling Jui

Xiongkuo Min

Guangtao Zhai

203

24

0

26 Aug 2024

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal
Large Language Models

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

Ming Yan

Fei Huang

Jingren Zhou

317

240

0

09 Aug 2024

Enhancing Descriptive Image Quality Assessment with A Large-scale Multi-modal Dataset

Enhancing Descriptive Image Quality Assessment with A Large-scale Multi-modal DatasetIEEE Transactions on Image Processing (TIP), 2024

490

38

0

29 May 2024

Depicting Beyond Scores: Advancing Image Quality Assessment through
Multi-modal Language Models

Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language ModelsEuropean Conference on Computer Vision (ECCV), 2023

Jinjin Gu

401

92

0

14 Dec 2023

Page 1 of 1