ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.09781
  4. Cited By
GPT-4o: Visual perception performance of multimodal large language
  models in piglet activity understanding

GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding

14 June 2024
Yiqi Wu
Xiaodan Hu
Ziming Fu
Siling Zhou
Jiangong Li
    MLLM
ArXivPDFHTML

Papers citing "GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding"

7 / 7 papers shown
Title
3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o
3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o
Dingning Liu
Cheng Wang
Peng Gao
Renrui Zhang
Xinzhu Ma
Yuan Meng
Zhihui Wang
LRM
39
0
0
17 Mar 2025
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Omnilingual MT Team
Pierre Yves Andrews
Mikel Artetxe
Mariano Coria Meglioli
Marta R. Costa-jussá
...
Eduardo Sánchez
Ioannis Tsiamas
Arina Turkatenko
Albert Ventayol-Boada
Shireen Yates
98
0
0
06 Feb 2025
Leveraging Large Language Models for Generating Labeled Mineral Site
  Record Linkage Data
Leveraging Large Language Models for Generating Labeled Mineral Site Record Linkage Data
Jiyoon Pyo
Yao-Yi Chiang
61
0
0
17 Nov 2024
UniGlyph: A Seven-Segment Script for Universal Language Representation
UniGlyph: A Seven-Segment Script for Universal Language Representation
G. V. Bency Sherin
A. Abijesh Euphrine
A. Lenora Moreen
L. Arun Jose
21
0
0
11 Oct 2024
Just Say the Name: Online Continual Learning with Category Names Only
  via Data Generation
Just Say the Name: Online Continual Learning with Category Names Only via Data Generation
Minhyuk Seo
Diganta Misra
Seongwon Cho
Minjae Lee
Jonghyun Choi
CLL
27
6
0
16 Mar 2024
MiniGPT-v2: large language model as a unified interface for
  vision-language multi-task learning
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
154
280
0
14 Oct 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
1