Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.10949
Cited By
Multimodal Learning using Optimal Transport for Sarcasm and Humor Detection
21 October 2021
Shraman Pramanick
A. Roy
Vishal M. Patel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multimodal Learning using Optimal Transport for Sarcasm and Humor Detection"
28 / 28 papers shown
Title
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark
Hanlei Zhang
Zhuohang Li
Yeshuang Zhu
Hua Xu
Peiwu Wang
Haige Zhu
Jie Zhou
Jinchao Zhang
130
0
0
23 Apr 2025
A Survey of Multimodal Sarcasm Detection
Shafkat Farabi
Tharindu Ranasinghe
Diptesh Kanojia
Yu Kong
Marcos Zampieri
51
4
0
24 Oct 2024
Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks
Orchid Chetia Phukan
Devyani Koshal
Swarup Ranjan Behera
Arun Balaji Buduru
Rajesh Sharma
53
1
0
16 Oct 2024
An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment
Hugo Malard
Michel Olvera
Stéphane Lathuilière
S. Essid
VLM
66
0
0
08 Oct 2024
Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition
Orchid Chetia Phukan
Mohd Mujtaba Akhtar
Girish
Swarup Ranjan Behera
Sishir Kalita
Arun Balaji Buduru
Rajesh Sharma
S. R Mahadeva Prasanna
EgoV
93
0
0
21 Sep 2024
NYK-MS: A Well-annotated Multi-modal Metaphor and Sarcasm Understanding Benchmark on Cartoon-Caption Dataset
Ke Chang
Hao Li
Junzhao Zhang
Yunfang Wu
78
0
0
02 Sep 2024
End-to-end Semantic-centric Video-based Multimodal Affective Computing
Ronghao Lin
Ying Zeng
Sijie Mai
Haifeng Hu
VGen
118
0
0
14 Aug 2024
Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection
Sajal Aggarwal
Ananya Pandey
Dinesh Kumar Vishwakarma
78
2
0
05 Aug 2024
Multimodal Prototyping for cancer survival prediction
Andrew H. Song
Richard J. Chen
Guillaume Jaume
Anurag J. Vaidya
Alexander S. Baras
Faisal Mahmood
95
17
0
28 Jun 2024
COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport
Linhao Zhang
Li Jin
Guangluan Xu
Xiaoyu Li
Xian Sun
65
0
0
18 Jun 2024
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition
Shahin Amiriparian
Lukas Christ
Alexander Kathan
Maurice Gerczuk
Niklas Muller
...
Lukas Stappen
Andreas Konig
Min Zhang
Björn Schuller
Simone Eulitz
123
11
0
11 Jun 2024
Encoding and Controlling Global Semantics for Long-form Video Question Answering
Thong Nguyen
Zhiyuan Hu
Xiaobao Wu
Cong-Duy Nguyen
See-Kiong Ng
Anh Tuan Luu
98
3
0
30 May 2024
Alignment Helps Make the Most of Multimodal Data
Christian Arnold
Andreas Küpfer
129
2
0
14 May 2024
Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding
Zichen Wu
Hsiu-Yuan Huang
Fanyi Qu
Hao Sun
VLM
MoE
83
5
0
17 Mar 2024
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models
Lee Hyun
Kim Sung-Bin
Seungju Han
Youngjae Yu
Tae-Hyun Oh
100
15
0
15 Dec 2023
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
Shan Zhong
Zhongzhan Huang
Shanghua Gao
Wushao Wen
Liang Lin
Marinka Zitnik
Pan Zhou
LLMAG
LRM
101
40
0
05 Dec 2023
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in Conversations
Yazhou Zhang
Mengyao Wang
Youxi Wu
Prayag Tiwari
Qiuchi Li
Benyou Wang
Jing Qin
146
24
0
17 Oct 2023
LICO: Explainable Models with Language-Image Consistency
Yiming Lei
Zilong Li
Yangyang Li
Junping Zhang
Hongming Shan
VLM
FAtt
46
7
0
15 Oct 2023
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick
Yale Song
Sayan Nag
Kevin Qinghong Lin
Hardik Shah
Mike Zheng Shou
Ramalingam Chellappa
Pengchuan Zhang
VLM
118
100
0
11 Jul 2023
Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications
Paul Pu Liang
Chun Kai Ling
Yun Cheng
A. Obolenskiy
Yudong Liu
Rohan Pandey
Alex Wilf
Louis-Philippe Morency
Ruslan Salakhutdinov
OffRL
81
12
0
07 Jun 2023
Context-aware attention layers coupled with optimal transport domain adaptation and multimodal fusion methods for recognizing dementia from spontaneous speech
Loukas Ilias
D. Askounis
68
10
0
25 May 2023
Cross-Attention is Not Enough: Incongruity-Aware Dynamic Hierarchical Fusion for Multimodal Affect Recognition
Yaoting Wang
Yuanchao Li
Paul Pu Liang
Louis-Philippe Morency
P. Bell
Catherine Lai
CVBM
72
8
0
23 May 2023
TOT: Topology-Aware Optimal Transport For Multimodal Hate Detection
Linhao Zhang
Li Jin
Xian Sun
Guangluan Xu
Zequn Zhang
Xiaoyu Li
Nayu Liu
Qing Liu
Shiyao Yan
81
8
0
27 Feb 2023
VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment
Shraman Pramanick
Li Jing
Sayan Nag
Jiachen Zhu
Hardik Shah
Yann LeCun
Ramalingam Chellappa
82
22
0
09 Oct 2022
Towards Multimodal Prediction of Spontaneous Humour: A Novel Dataset and First Results
Lukas Christ
Shahin Amiriparian
Alexander Kathan
Niklas Muller
Andreas Konig
Björn W. Schuller
111
4
0
28 Sep 2022
Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Paul Pu Liang
Amir Zadeh
Louis-Philippe Morency
114
88
0
07 Sep 2022
COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition
M. Tellamekala
Shahin Amiriparian
Björn W. Schuller
Elisabeth André
T. Giesbrecht
Michel Valstar
126
26
0
12 Jun 2022
Where in the World is this Image? Transformer-based Geo-localization in the Wild
Shraman Pramanick
E. Nowara
Joshua Gleason
Carlos D. Castillo
Rama Chellappa
ViT
62
37
0
29 Apr 2022
1