OmniFusion Technical Report

9 April 2024

Papers citing "OmniFusion Technical Report"

5 / 5 papers shown

Title
Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation Nikita Kachaev Andrei Spiridonov Andrey Gorodetsky K. Muravyev Nikita Oskolkov ... Vlad Shakhuro Dmitry Makarov Aleksandr Panov Polina Fedotova A. Kovalev LM&Ro 81 0 0 21 Aug 2025
Visually Interpretable Subtask Reasoning for Visual Question Answering Yu Cheng A. Goel Hakan Bilen LRM 203 2 0 12 May 2025
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language ModelsComputer Vision and Pattern Recognition (CVPR), 2024 Byung-Kwan Lee Ryo Hachiuma Yu-Chiang Frank Wang Y. Ro Yueh-Hua Wu VLM 361 4 0 02 Dec 2024
Phantom of Latent for Large Language and Vision Models Byung-Kwan Lee Sangyun Chung Chae Won Kim Beomchan Park Yong Man Ro VLM LRM 250 11 0 23 Sep 2024
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Byung-Kwan Lee Chae Won Kim Beomchan Park Yonghyun Ro MLLM LRM 283 26 0 24 May 2024