Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2410.05608
Cited By
Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond
ACM Multimedia (MM), 2024
8 October 2024
Soyeon Caren Han
Feiqi Cao
Josiah Poon
Roberto Navigli
MLLM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4★)
Papers citing
"Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond"
3 / 3 papers shown
TerraMind: Large-Scale Generative Multimodality for Earth Observation
Johannes Jakubik
Felix Yang
Benedikt Blumenstiel
Erik Scheurer
Rocco Sedona
...
P. Fraccaro
Thomas Brunschwiler
Gabriele Cavallaro
Juan Bernabé-Moreno
Alessandra Feliciotti
MLLM
VLM
612
75
0
15 Apr 2025
COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails
Miguel Espinosa
V. Marsocci
Yuru Jia
Elliot J. Crowley
Mikolaj Czerkawski
DiffM
431
3
0
11 Apr 2025
Enhancing Collective Intelligence in Large Language Models Through Emotional Integration
Likith Kadiyala
Ramteja Sajja
Y. Sermet
Ibrahim Demir
939
4
0
05 Mar 2025
1
Page 1 of 1