ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.14302
  4. Cited By
VILA: Learning Image Aesthetics from User Comments with Vision-Language
  Pretraining

VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining

24 March 2023
Junjie Ke
Keren Ye
Jiahui Yu
Yonghui Wu
P. Milanfar
Feng Yang
    VLM
ArXivPDFHTML

Papers citing "VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining"

10 / 10 papers shown
Title
Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment
Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment
Fatemeh Behrad
Tinne Tuytelaars
Johan Wagemans
ViT
38
0
0
03 Apr 2025
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
Pouyan Navard
Amin Karimi Monsefi
Mengxi Zhou
Wei-Lun Chao
Alper Yilmaz
R. Ramnath
DiffM
51
2
0
02 Oct 2024
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Zicheng Zhang
Ziheng Jia
H. Wu
Chunyi Li
Zijian Chen
...
Wei Sun
Xiaohong Liu
Xiongkuo Min
Weisi Lin
Guangtao Zhai
32
7
0
30 Sep 2024
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Seung Hyun Lee
Junjie Ke
Yinxiao Li
Junfeng He
Steven Hickson
...
Irfan Essa
Sangpil Kim
Ming-Hsuan Yang
Irfan Essa
Feng Yang
VLM
49
0
0
14 Aug 2024
Second Place Solution of WSDM2023 Toloka Visual Question Answering
  Challenge
Second Place Solution of WSDM2023 Toloka Visual Question Answering Challenge
Xiangyu Wu
Zhouyang Chi
Yang Yang
Jianfeng Lu
42
0
0
05 Jul 2024
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in
  Text-to-Image Generation
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation
Jingkun An
Yinghao Zhu
Zongjian Li
Haoran Feng
Bohua Chen
Yemin Shi
Chengwei Pan
43
2
0
20 Mar 2024
Advancing Text-Driven Chest X-Ray Generation with Policy-Based
  Reinforcement Learning
Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning
Woojung Han
Chanyoung Kim
Dayun Ju
Yumin Shim
Seong Jae Hwang
MedIm
37
8
0
11 Mar 2024
SPIRE: Semantic Prompt-Driven Image Restoration
SPIRE: Semantic Prompt-Driven Image Restoration
Chenyang Qi
Zhengzhong Tu
Keren Ye
M. Delbracio
P. Milanfar
Qifeng Chen
Hossein Talebi
DiffM
31
11
0
18 Dec 2023
MUSIQ: Multi-scale Image Quality Transformer
MUSIQ: Multi-scale Image Quality Transformer
Junjie Ke
Qifei Wang
Yilin Wang
P. Milanfar
Feng Yang
174
628
0
12 Aug 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
322
3,708
0
11 Feb 2021
1