Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.02417
Cited By
Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
5 August 2021
Xuri Ge
Fuhai Chen
J. Jose
Zhilong Ji
Zhongqin Wu
Xiao-Chang Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval"
14 / 14 papers shown
Title
Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Yaming Yang
Zhe Wang
Fuhai Chen
Wei Zhao
Weigang Lu
Joemon M. Jose
CVBM
28
1
0
01 Aug 2024
A Unified Graph Transformer for Overcoming Isolations in Multi-modal Recommendation
Zixuan Yi
I. Ounis
23
2
0
29 Jul 2024
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval
Yiwei Ma
Xiaoshuai Sun
Jiayi Ji
Guannan Jiang
Weilin Zhuang
Rongrong Ji
31
15
0
09 Jun 2024
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching
Xuri Ge
Fuhai Chen
Songpei Xu
Fuxiang Tao
Jie Wang
Joemon M. Jose
34
0
0
05 Jun 2024
3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting
Xuri Ge
Songpei Xu
Fuhai Chen
Jie Wang
Guoxin Wang
Shan An
Joemon M. Jose
3DPC
27
10
0
26 Apr 2024
IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT
Junchen Fu
Xuri Ge
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
Jie Wang
Joemon M. Jose
35
17
0
02 Apr 2024
CLCE: An Approach to Refining Cross-Entropy and Contrastive Learning for Optimized Learning Fusion
Zijun Long
George Killick
Lipeng Zhuang
Gerardo Aragon Camarasa
Zaiqiao Meng
R. McCreadie
VLM
37
2
0
22 Feb 2024
A New Fine-grained Alignment Method for Image-text Matching
Yang Zhang
13
1
0
03 Nov 2023
RoboLLM: Robotic Vision Tasks Grounded on Multimodal Large Language Models
Zijun Long
George Killick
R. McCreadie
Gerardo Aragon Camarasa
VLM
22
11
0
16 Oct 2023
HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval
Jie Guo
Meiting Wang
Yan Zhou
Bin Song
Yuhao Chi
Wei-liang Fan
Jianglong Chang
37
15
0
16 Dec 2022
Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Xuri Ge
Fuhai Chen
Songpei Xu
Fuxiang Tao
J. Jose
25
26
0
17 Oct 2022
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Haoran Wang
Dongliang He
Wenhao Wu
Boyang Xia
Min Yang
Fu Li
Yunlong Yu
Zhong Ji
Errui Ding
Jingdong Wang
22
22
0
21 Aug 2022
Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations
Qian Yang
Yunxin Li
Baotian Hu
Lin Ma
Yuxin Ding
Min Zhang
22
10
0
23 Jul 2022
Automatic Facial Paralysis Estimation with Facial Action Units
Xuri Ge
J. Jose
Pengcheng Wang
Arunachalam Iyer
Xiao-Chang Liu
Hu Han
CVBM
15
6
0
03 Mar 2022
1