ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.09048
  4. Cited By
GRASP: A novel benchmark for evaluating language GRounding And Situated
  Physics understanding in multimodal language models

GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models

15 November 2023
Serwan Jassim
Mario S. Holubar
Annika Richter
Cornelius Wolff
Xenia Ohmer
Elia Bruni
    ELM
ArXivPDFHTML

Papers citing "GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models"

10 / 10 papers shown
Title
MMSciBench: Benchmarking Language Models on Multimodal Scientific Problems
Xinwu Ye
Chengfan Li
Siming Chen
Xiangru Tang
Wei Wei
LRM
34
1
0
27 Feb 2025
CueTip: An Interactive and Explainable Physics-aware Pool Assistant
CueTip: An Interactive and Explainable Physics-aware Pool Assistant
Sean Memery
Kevin Denamganai
Jiaxin Zhang
Zehai Tu
Yiwen Guo
Kartic Subr
LRM
35
0
0
30 Jan 2025
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark
  for Video Generation
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
Fanqing Meng
Jiaqi Liao
Xinyu Tan
Wenqi Shao
Quanfeng Lu
Kaipeng Zhang
Yu Cheng
Dianqi Li
Yu Qiao
Ping Luo
VGen
EGVM
32
24
0
07 Oct 2024
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
Xu Cao
Bolin Lai
Wenqian Ye
Yunsheng Ma
Joerg Heintz
Jintai Chen
Jianguo Cao
James M. Rehg
37
8
0
14 Jun 2024
Exploring Spatial Schema Intuitions in Large Language and Vision Models
Exploring Spatial Schema Intuitions in Large Language and Vision Models
Philipp Wicke
Lennart Wachowiak
LM&Ro
16
4
0
01 Feb 2024
SimLM: Can Language Models Infer Parameters of Physical Systems?
SimLM: Can Language Models Infer Parameters of Physical Systems?
Sean Memery
Mirella Lapata
Kartic Subr
LRM
AI4CE
16
2
0
21 Dec 2023
Visual cognition in multimodal large language models
Visual cognition in multimodal large language models
Luca M. Schulze Buschoff
Elif Akata
Matthias Bethge
Eric Schulz
LRM
49
12
0
27 Nov 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
262
4,223
0
30 Jan 2023
AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation
  for Artificial Cognition
AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation for Artificial Cognition
Arijit Dasgupta
Jiafei Duan
M. Ang
Cheston Tan
26
4
0
12 Oct 2021
Interaction Networks for Learning about Objects, Relations and Physics
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
258
1,398
0
01 Dec 2016
1