ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01177
  4. Cited By
MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion
  Attention

MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion Attention

2 December 2021
Xixi Wang
Xiao Wang
Bo Jiang
Jin Tang
Bin Luo
    ViT
ArXivPDFHTML

Papers citing "MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion Attention"

8 / 8 papers shown
Title
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video
  Representations
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations
Mohammadreza Zolfaghari
Yi Zhu
Peter V. Gehler
Thomas Brox
111
122
0
30 Sep 2021
Dyadformer: A Multi-modal Transformer for Long-Range Modeling of Dyadic
  Interactions
Dyadformer: A Multi-modal Transformer for Long-Range Modeling of Dyadic Interactions
D. Curto
Albert Clapés
Javier Selva
Sorina Smeureanu
Julio C. S. Jacques Junior
...
G. Guilera
D. Leiva
T. Moeslund
Sergio Escalera
Cristina Palmero
28
29
0
20 Sep 2021
The Right to Talk: An Audio-Visual Transformer Approach
The Right to Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong
C. Duong
T. D. Vu
H. Pham
Bhiksha Raj
Ngan Le
Khoa Luu
44
36
0
06 Aug 2021
MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named
  Entity Recognition
MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition
Shuang Wu
Xiaoning Song
Zhenhua Feng
19
85
0
12 Jul 2021
Visual Saliency Transformer
Visual Saliency Transformer
Nian Liu
Ni Zhang
Kaiyuan Wan
Ling Shao
Junwei Han
ViT
246
281
0
25 Apr 2021
TransReID: Transformer-based Object Re-Identification
TransReID: Transformer-based Object Re-Identification
Shuting He
Haowen Luo
Pichao Wang
F. Wang
Hao Li
Wei Jiang
ViT
210
769
0
08 Feb 2021
RGB-D Salient Object Detection via 3D Convolutional Neural Networks
RGB-D Salient Object Detection via 3D Convolutional Neural Networks
Qian Chen
Ze Liu
Y. Zhang
Keren Fu
Qijun Zhao
H. Du
3DPC
24
148
0
25 Jan 2021
Multi-modal Transformer for Video Retrieval
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
398
532
0
21 Jul 2020
1