ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.03344
  4. Cited By
Universal Multimodal Representation for Language Understanding

Universal Multimodal Representation for Language Understanding

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
9 January 2023
Zhuosheng Zhang
Kehai Chen
Rui Wang
Masao Utiyama
Eiichiro Sumita
Z. Li
Hai Zhao
    SSL
ArXiv (abs)PDFHTMLGithub (171★)

Papers citing "Universal Multimodal Representation for Language Understanding"

10 / 10 papers shown
A Multimodal-Multitask Framework with Cross-modal Relation and Hierarchical Interactive Attention for Semantic Comprehension
A Multimodal-Multitask Framework with Cross-modal Relation and Hierarchical Interactive Attention for Semantic ComprehensionInformation Fusion (Inf. Fusion), 2025
Mohammad Zia Ur Rehman
Devraj Raghuvanshi
Umang Jain
Shubhi Bansal
Nagendra Kumar
154
5
0
22 Aug 2025
ADAT: Time-Series-Aware Adaptive Transformer Architecture for Sign Language Translation
ADAT: Time-Series-Aware Adaptive Transformer Architecture for Sign Language Translation
Nada Shahin
Leila Ismail
SLR
261
1
0
16 Apr 2025
A Survey: Spatiotemporal Consistency in Video Generation
A Survey: Spatiotemporal Consistency in Video Generation
Zhiyu Yin
Kehai Chen
Xuefeng Bai
Ruili Jiang
Junlin Li
Hongdong Li
Jin Liu
Yang Xiang
Jun Yu
Min Zhang
EGVMVGenAI4TS
445
0
0
25 Feb 2025
SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes
SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes
Palash Nandi
Shivam Sharma
Tanmoy Chakraborty
279
6
0
31 Dec 2024
Energy-Latency Manipulation of Multi-modal Large Language Models via
  Verbose Samples
Energy-Latency Manipulation of Multi-modal Large Language Models via Verbose Samples
Kuofeng Gao
Jindong Gu
Yang Bai
Shu-Tao Xia
Juil Sock
Wei Liu
Zhifeng Li
374
18
0
25 Apr 2024
GeReA: Question-Aware Prompt Captions for Knowledge-based Visual
  Question Answering
GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering
Ziyu Ma
Shutao Li
Bin Sun
Jianfei Cai
Zuxiang Long
Fuyan Ma
316
8
0
04 Feb 2024
Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in
  Language Models
Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2023
Liqi He
Zuchao Li
Xiantao Cai
Ping Wang
LRM
257
38
0
14 Dec 2023
Multimodal Prompt Learning for Product Title Generation with Extremely
  Limited Labels
Multimodal Prompt Learning for Product Title Generation with Extremely Limited LabelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Bang-ju Yang
Fenglin Liu
Zheng Li
Qingyu Yin
Chenyu You
Bing Yin
Yuexian Zou
VLM
294
7
0
05 Jul 2023
VITR: Augmenting Vision Transformers with Relation-Focused Learning for
  Cross-Modal Information Retrieval
VITR: Augmenting Vision Transformers with Relation-Focused Learning for Cross-Modal Information RetrievalACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Yansong Gong
Georgina Cosma
Axel Finke
ViT
358
4
0
13 Feb 2023
Multimodal Chain-of-Thought Reasoning in Language Models
Multimodal Chain-of-Thought Reasoning in Language Models
Zhuosheng Zhang
Aston Zhang
Mu Li
Hai Zhao
George Karypis
Alexander J. Smola
LRM
689
805
0
02 Feb 2023
1
Page 1 of 1