ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.13949
  4. Cited By
PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering
  in Pituitary Surgery

PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery

22 May 2024
Runlong He
Mengya Xu
Adrito Das
Danyal Z. Khan
Sophia Bano
Hani J. Marcus
Danail Stoyanov
Matthew J. Clarkson
Mobarakol Islam
ArXivPDFHTML

Papers citing "PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery"

6 / 6 papers shown
Title
AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs
AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs
Diwei Wang
Cédric Bobenrieth
Hyewon Seo
LRM
40
0
0
23 Mar 2025
SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery
Jiayuan Huang
Runlong He
Danyal Z. Khan
E. Mazomenos
Danail Stoyanov
Hani J. Marcus
Matthew J. Clarkson
Mobarakol Islam
LM&Ro
55
0
0
12 Mar 2025
FunBench: Benchmarking Fundus Reading Skills of MLLMs
Qijie Wei
Kaiheng Qian
Xirong Li
34
1
0
02 Mar 2025
VideoICL: Confidence-based Iterative In-context Learning for
  Out-of-Distribution Video Understanding
VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
Kangsan Kim
G. Park
Youngwan Lee
Woongyeong Yeo
Sung Ju Hwang
89
3
0
03 Dec 2024
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic
  Pituitary Surgery
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery
Adrito Das
Danyal Z. Khan
Dimitrios Psychogyios
Yitong Zhang
John G. Hanrahan
...
Santiago Rodriguez
Pablo Arbelaez
Danail Stoyanov
Hani J. Marcus
Sophia Bano
16
5
0
02 Sep 2024
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
382
4,010
0
28 Jan 2022
1