Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.08677
Cited By
Label2Label: A Language Modeling Framework for Multi-Attribute Learning
18 July 2022
Wanhua Li
Zhexuan Cao
Jianjiang Feng
Jie Zhou
Jiwen Lu
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Label2Label: A Language Modeling Framework for Multi-Attribute Learning"
8 / 8 papers shown
Title
FaceInsight: A Multimodal Large Language Model for Face Perception
Jingzhi Li
Changjiang Luo
Ruoyu Chen
Hua Zhang
Wenqi Ren
Jianhou Gan
Xiaochun Cao
CVBM
LRM
57
0
0
22 Apr 2025
Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Li-Na Bao
Longhui Wei
Xiaoyu Qiu
Wen-gang Zhou
Houqiang Li
Qi Tian
SSL
18
5
0
12 Apr 2023
POAR: Towards Open Vocabulary Pedestrian Attribute Recognition
Yue Zhang
Suchen Wang
Shichao Kan
Zhenyu Weng
Yigang Cen
Yap-Peng Tan
ViT
29
3
0
26 Mar 2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Antoine Yang
Arsha Nagrani
Paul Hongsuck Seo
Antoine Miech
Jordi Pont-Tuset
Ivan Laptev
Josef Sivic
Cordelia Schmid
AI4TS
VLM
23
220
0
27 Feb 2023
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,434
0
11 Nov 2021
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
235
344
0
22 Sep 2021
CrossTransformers: spatially-aware few-shot transfer
Carl Doersch
Ankush Gupta
Andrew Zisserman
ViT
201
330
0
22 Jul 2020
Multi-task Learning of Cascaded CNN for Facial Attribute Classification
Ni Zhuang
Y. Yan
Si Chen
Hanzi Wang
CVBM
34
34
0
03 May 2018
1