Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.09781
Cited By
GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding
14 June 2024
Yiqi Wu
Xiaodan Hu
Ziming Fu
Siling Zhou
Jiangong Li
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding"
7 / 7 papers shown
Title
3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o
Dingning Liu
Cheng Wang
Peng Gao
Renrui Zhang
Xinzhu Ma
Yuan Meng
Zhihui Wang
LRM
39
0
0
17 Mar 2025
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Omnilingual MT Team
Pierre Yves Andrews
Mikel Artetxe
Mariano Coria Meglioli
Marta R. Costa-jussá
...
Eduardo Sánchez
Ioannis Tsiamas
Arina Turkatenko
Albert Ventayol-Boada
Shireen Yates
98
0
0
06 Feb 2025
Leveraging Large Language Models for Generating Labeled Mineral Site Record Linkage Data
Jiyoon Pyo
Yao-Yi Chiang
61
0
0
17 Nov 2024
UniGlyph: A Seven-Segment Script for Universal Language Representation
G. V. Bency Sherin
A. Abijesh Euphrine
A. Lenora Moreen
L. Arun Jose
21
0
0
11 Oct 2024
Just Say the Name: Online Continual Learning with Category Names Only via Data Generation
Minhyuk Seo
Diganta Misra
Seongwon Cho
Minjae Lee
Jonghyun Choi
CLL
30
6
0
16 Mar 2024
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
154
280
0
14 Oct 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
1