ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.08677
  4. Cited By
Label2Label: A Language Modeling Framework for Multi-Attribute Learning

Label2Label: A Language Modeling Framework for Multi-Attribute Learning

18 July 2022
Wanhua Li
Zhexuan Cao
Jianjiang Feng
Jie Zhou
Jiwen Lu
    VLM
ArXivPDFHTML

Papers citing "Label2Label: A Language Modeling Framework for Multi-Attribute Learning"

8 / 8 papers shown
Title
FaceInsight: A Multimodal Large Language Model for Face Perception
FaceInsight: A Multimodal Large Language Model for Face Perception
Jingzhi Li
Changjiang Luo
Ruoyu Chen
Hua Zhang
Wenqi Ren
Jianhou Gan
Xiaochun Cao
CVBM
LRM
59
0
0
22 Apr 2025
Learning Transferable Pedestrian Representation from Multimodal
  Information Supervision
Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Li-Na Bao
Longhui Wei
Xiaoyu Qiu
Wen-gang Zhou
Houqiang Li
Qi Tian
SSL
18
5
0
12 Apr 2023
POAR: Towards Open Vocabulary Pedestrian Attribute Recognition
POAR: Towards Open Vocabulary Pedestrian Attribute Recognition
Yue Zhang
Suchen Wang
Shichao Kan
Zhenyu Weng
Yigang Cen
Yap-Peng Tan
ViT
29
3
0
26 Mar 2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense
  Video Captioning
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Antoine Yang
Arsha Nagrani
Paul Hongsuck Seo
Antoine Miech
Jordi Pont-Tuset
Ivan Laptev
Josef Sivic
Cordelia Schmid
AI4TS
VLM
28
220
0
27 Feb 2023
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,434
0
11 Nov 2021
Pix2seq: A Language Modeling Framework for Object Detection
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
238
344
0
22 Sep 2021
CrossTransformers: spatially-aware few-shot transfer
CrossTransformers: spatially-aware few-shot transfer
Carl Doersch
Ankush Gupta
Andrew Zisserman
ViT
203
330
0
22 Jul 2020
Multi-task Learning of Cascaded CNN for Facial Attribute Classification
Multi-task Learning of Cascaded CNN for Facial Attribute Classification
Ni Zhuang
Y. Yan
Si Chen
Hanzi Wang
CVBM
34
34
0
03 May 2018
1