Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.10326
Cited By
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
21 September 2022
Hao Li
Jinfa Huang
Peng Jin
Guoli Song
Qi Wu
Jie Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering"
13 / 13 papers shown
Title
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search
Yiming Jia
J. Li
Xiang Yue
Bo Li
Ping Nie
Kai Zou
Wenhu Chen
LRM
72
2
0
13 Mar 2025
Hierarchical Banzhaf Interaction for General Video-Language Representation Learning
Peng Jin
H. Li
Li Yuan
Shuicheng Yan
Jie Chen
42
1
0
31 Dec 2024
VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning
Muye Huang
Lingling Zhang
Lai Han
Wenjun Wu
Xinyu Zhang
Jun Liu
14
0
0
03 Sep 2024
GraCo: Granularity-Controllable Interactive Segmentation
Yian Zhao
Kehan Li
Ze-Long Cheng
Pengchong Qiao
Xiawu Zheng
Rongrong Ji
Chang Liu
Li-ming Yuan
Jie Chen
29
1
0
01 May 2024
FreestyleRet: Retrieving Images from Style-Diversified Queries
Hao Li
Curise Jia
Peng Jin
Ze-Long Cheng
Kehan Li
Jialu Sui
Chang Liu
Li-ming Yuan
3DH
10
5
0
05 Dec 2023
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs
Peng Jin
Yang Wu
Yanbo Fan
Zhongqian Sun
Yang Wei
Li-ming Yuan
DiffM
15
27
0
02 Nov 2023
Head-Tail Cooperative Learning Network for Unbiased Scene Graph Generation
Lei Wang
Zejian Yuan
Yao Lu
Badong Chen
14
0
0
23 Aug 2023
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Jingyi Wang
Can Zhang
Jinfa Huang
Bo Ren
Zhidong Deng
13
7
0
04 Aug 2023
TG-VQA: Ternary Game of Video Question Answering
Hao Li
Peng Jin
Ze-Long Cheng
Songyang Zhang
Kai-xiang Chen
Zhennan Wang
Chang-rui Liu
Jie Chen
13
10
0
17 May 2023
Adaptive loose optimization for robust question answering
Jie Ma
Pinghui Wang
Ze-you Wang
Dechen Kong
Min Hu
Tingxu Han
Jun Liu
OOD
19
4
0
06 May 2023
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Peng Jin
Hao Li
Ze-Long Cheng
Kehan Li
Xiang Ji
Chang-rui Liu
Li-ming Yuan
Jie Chen
DiffM
VGen
9
52
0
17 Mar 2023
Parallel Vertex Diffusion for Unified Visual Grounding
Ze-Long Cheng
Kehan Li
Peng Jin
Xiang Ji
Li-ming Yuan
Chang-rui Liu
Jie Chen
DiffM
12
25
0
13 Mar 2023
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
261
10,106
0
16 Nov 2016
1