ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.09988
  4. Cited By
Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild
  with Pose Annotations

Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations

Computer Vision and Pattern Recognition (CVPR), 2020
18 December 2020
Adel Ahmadyan
Liangkai Zhang
Jianing Wei
Artsiom Ablavatski
Matthias Grundmann
    3DPC
ArXiv (abs)PDFHTML

Papers citing "Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations"

50 / 126 papers shown
4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer
4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer
Xianfeng Wu
Yajing Bai
Minghan Li
Xianzu Wu
Xueqi Zhao
Zhongyuan Lai
Wenyu Liu
Xinggang Wang
3DGS
292
1
0
04 Dec 2025
UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data
UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data
Longkun Zou
Jiale Wang
Rongqin Liang
Hai Wu
Ke Chen
Yaowei Wang
239
1
0
27 Nov 2025
LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight
LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight
Yunze Man
S. S. Wang
Guowen Zhang
Johan Bjorck
Zhiqi Li
Liang-Yan Gui
Jim Fan
Jan Kautz
Yu Wang
Zhiding Yu
180
4
0
25 Nov 2025
DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video
DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video
Jiawei Hou
Shenghao Zhang
Can Wang
Zheng Gu
Yonggen Ling
Taiping Zeng
Xiangyang Xue
Jingbo Zhang
3DPC
198
0
0
24 Nov 2025
Concept than Document: Context Compression via AMR-based Conceptual Entropy
Concept than Document: Context Compression via AMR-based Conceptual Entropy
Kaize Shi
Xueyao Sun
Xiaohui Tao
Lin Li
Qika Lin
Guandong Xu
263
0
0
24 Nov 2025
SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation
SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation
Zhenyuan Qin
Xincheng Shuai
Henghui Ding
DiffM
262
5
0
20 Nov 2025
Visual Spatial Tuning
Visual Spatial Tuning
Rui Yang
Ziyu Zhu
Yanwei Li
Jingjia Huang
Shen Yan
...
Xiangtai Li
S. Li
Wenqian Wang
Yi Lin
Hengshuang Zhao
VLM
421
29
0
07 Nov 2025
iFlyBot-VLM Technical Report
iFlyBot-VLM Technical Report
Xin Nie
Zhiyuan Cheng
Yuan Zhang
Chao Ji
Jiajia wu
Yuhan Zhang
Jia Pan
LM&Ro
376
0
0
07 Nov 2025
Neural USD: An object-centric framework for iterative editing and control
Neural USD: An object-centric framework for iterative editing and control
Alejandro Escontrela
Shrinu Kushagra
Sjoerd van Steenkiste
Yulia Rubanova
Aleksander Holynski
Kelsey R. Allen
Kevin Murphy
Thomas Kipf
DiffM
203
1
0
28 Oct 2025
Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists
Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists
Eduardo R. Corral-Soto
Yang Liu
Y. Ren
Bai Dongfeng
Liu Bingbing
162
2
0
23 Oct 2025
MultiCOIN: Multi-Modal COntrollable Video INbetweening
MultiCOIN: Multi-Modal COntrollable Video INbetweening
Maham Tanveer
Yang Zhou
Simon Niklaus
Ali Mahdavi-Amiri
Hao Zhang
Krishna Kumar Singh
Nanxuan Zhao
DiffMVGen
238
2
0
09 Oct 2025
Robotic Manipulation Framework Based on Semantic Keypoints for Packing Shoes of Different Sizes, Shapes, and Softness
Robotic Manipulation Framework Based on Semantic Keypoints for Packing Shoes of Different Sizes, Shapes, and Softness
Yi Dong
Y. Liu
Jinjun Duan
Yang Li
Zhendong Dai
215
2
0
07 Sep 2025
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
Yung-Hsu Yang
Luigi Piccinelli
Mattia Segu
Siyuan Li
Rui Huang
Yuqian Fu
Marc Pollefeys
Hermann Blum
Z. Bauer
3DPC
320
10
0
31 Jul 2025
Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
Runmin Zhang
Zhu Yu
Si-Yuan Cao
Lingyu Zhu
Guangyi Zhang
Xiaokai Bai
Hui-Liang Shen
3DPC
237
2
0
24 Jul 2025
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations
Linjie Li
Mahtab Bigverdi
Jiawei Gu
Zixian Ma
Yinuo Yang
Ziang Li
Yejin Choi
Ranjay Krishna
LRM
288
17
0
05 Jun 2025
Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary
Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary
Keanu Nichols
Nazia Tasnim
Yuting Yan
Nicholas Ikechukwu
Elva Zou
Deepti Ghadiyaram
Bryan A. Plummer
500
1
0
27 May 2025
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Duy-Tho Le
Trung Pham
Jianfei Cai
H. Rezatofighi
442
2
0
23 Apr 2025
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
Eunsoo Im
Jung Kwon Lee
Changhyun Jee
659
3
0
15 Apr 2025
OpenLex3D: A Tiered Evaluation Benchmark for Open-Vocabulary 3D Scene Representations
OpenLex3D: A Tiered Evaluation Benchmark for Open-Vocabulary 3D Scene Representations
Christina Kassab
Sacha Morin
Martin Buchner
Matías Mattamala
Kumaraditya Gupta
Abhinav Valada
Liam Paull
Maurice F. Fallon
3DVELM
381
3
0
25 Mar 2025
AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation
AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation
Yang Zou
Zhaoshuai Qi
Yating Liu
Zihao Xu
Weipeng Sun
Weiyi Liu
Xingyuan Li
Jiaqi Yang
Yanning Zhang
273
0
0
09 Mar 2025
Enriching physical-virtual interaction in AR gaming by tracking identical objects via an egocentric partial observation frame
Enriching physical-virtual interaction in AR gaming by tracking identical objects via an egocentric partial observation frame
Liuchuan Yu
Ching-I Huang
Hsueh-Cheng Wang
L. Yu
244
0
0
24 Feb 2025
MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition
MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition
Paul Koch
Marian Schluter
Jörg Krüger
334
0
0
24 Feb 2025
Glissando-Net: Deep sinGLe vIew category level poSe eStimation ANd 3D recOnstruction
Glissando-Net: Deep sinGLe vIew category level poSe eStimation ANd 3D recOnstructionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Bo Sun
Hao Kang
Li Guan
Haoxiang Li
Philippos Mordohai
Gang Hua
470
2
0
28 Jan 2025
GSOT3D: Towards Generic 3D Single Object Tracking in the Wild
GSOT3D: Towards Generic 3D Single Object Tracking in the Wild
Yifan Jiao
Yunhao Li
Junhua Ding
Q. Yang
Song Fu
Heng Fan
Libo Zhang
3DPC
300
0
0
03 Dec 2024
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Junyuan Deng
Wei Yin
Xiaoyang Guo
Qian Zhang
Xiaotao Hu
Weiqiang Ren
Xiaoxiao Long
P. Tan
MDEDiffM
469
1
0
26 Nov 2024
Open Vocabulary Monocular 3D Object Detection
Open Vocabulary Monocular 3D Object Detection
Jin Yao
Hao Gu
Xuweiyi Chen
Jiayun Wang
Zezhou Cheng
ObjDVLM
638
17
0
25 Nov 2024
CameraHMR: Aligning People with Perspective
CameraHMR: Aligning People with PerspectiveInternational Conference on 3D Vision (3DV), 2024
Priyanka Patel
Michael J. Black
3DH3DGS
272
44
0
12 Nov 2024
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane SweepsNeural Information Processing Systems (NeurIPS), 2024
Yating Xu
Chen Li
G. Lee
3DPC
351
8
0
28 Oct 2024
MetaFood3D: Large 3D Food Object Dataset with Nutrition Values
MetaFood3D: Large 3D Food Object Dataset with Nutrition Values
Yuhao Chen
Jiangpeng He
Chris Czarnecki
Gautham Vinod
Talha Ibn Mahmud
...
Saeejith Nair
Pengcheng Xi
Alexander Wong
Edward J. Delp
Fengqing Zhu
274
2
0
03 Sep 2024
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
CatFree3D: Category-agnostic 3D Object Detection with DiffusionInternational Conference on 3D Vision (3DV), 2024
Wenjing Bian
Zirui Wang
Andrea Vedaldi
357
2
0
22 Aug 2024
ADen: Adaptive Density Representations for Sparse-view Camera Pose
  Estimation
ADen: Adaptive Density Representations for Sparse-view Camera Pose EstimationEuropean Conference on Computer Vision (ECCV), 2024
Hao Tang
Weiyao Wang
Pierre Gleize
Matt Feiszli
3DH
268
2
0
16 Aug 2024
LLMI3D: MLLM-based 3D Perception from a Single 2D Image
LLMI3D: MLLM-based 3D Perception from a Single 2D Image
Fan Yang
Sicheng Zhao
Yanhao Zhang
Haoxiang Chen
Hui Chen
Wenbo Tang
Guiguang Ding
310
1
0
14 Aug 2024
DEF-oriCORN: efficient 3D scene understanding for robust
  language-directed manipulation without demonstrations
DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations
Dongwon Son
Sanghyeon Son
Jaehyung Kim
Beomjoon Kim
LM&Ro
336
1
0
31 Jul 2024
Floating No More: Object-Ground Reconstruction from a Single Image
Floating No More: Object-Ground Reconstruction from a Single Image
Yunze Man
Yichen Sheng
Jianming Zhang
Liangyan Gui
Yu-Xiong Wang
426
3
0
26 Jul 2024
OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects
OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects
Akshay Krishnan
Abhijit Kundu
Kevis-Kokitsi Maninis
James Hays
Matthew Brown
226
24
0
11 Jul 2024
RoCap: A Robotic Data Collection Pipeline for the Pose Estimation of
  Appearance-Changing Objects
RoCap: A Robotic Data Collection Pipeline for the Pose Estimation of Appearance-Changing Objects
Jiahao Nick Li
T. Chong
Zhongyi Zhou
Hironori Yoshida
Koji Yatani
Xiang Ánthony' Chen
Takeo Igarashi
251
1
0
10 Jul 2024
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Wufei Ma
Guanning Zeng
Guofeng Zhang
Qihao Liu
Letian Zhang
Adam Kortylewski
Yaoyao Liu
Alan Yuille
VLM3DV
303
17
0
13 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image
  Diffusion Models
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGenDiffM
393
27
0
13 Jun 2024
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based
  Dense Incident Map Generation
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation
Xiankang He
Guangkai Xu
Bo Zhang
Hao Chen
Ying Cui
Dongyan Guo
DiffM
355
9
0
24 May 2024
Deep Learning-Based Object Pose Estimation: A Comprehensive Survey
Deep Learning-Based Object Pose Estimation: A Comprehensive Survey
Jian Liu
Wei Sun
Hui Yang
Zhiwen Zeng
Chongpei Liu
Jin Zheng
Xingyu Liu
Hossein Rahmani
Andrii Zadaianchuk
Lin Wang
610
60
0
13 May 2024
Language-Image Models with 3D Understanding
Language-Image Models with 3D UnderstandingInternational Conference on Learning Representations (ICLR), 2024
Jang Hyun Cho
Boris Ivanovic
Yulong Cao
Edward Schmerling
Yue Wang
...
Boyi Li
Yurong You
Philipp Krahenbuhl
Yan Wang
Marco Pavone
LRM
242
34
0
06 May 2024
Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D
  Pose Estimation
Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation
F. D. Felice
A. Remus
Stefano Gasperini
Benjamin Busam
Lionel Ott
Federico Tombari
Roland Siegwart
C. Avizzano
DiffM
224
13
0
21 Mar 2024
A survey of synthetic data augmentation methods in computer vision
A survey of synthetic data augmentation methods in computer visionMachine Intelligence Research (MIR), 2024
A. Mumuni
F. Mumuni
N. K. Gerrar
385
91
0
15 Mar 2024
UniMODE: Unified Monocular 3D Object Detection
UniMODE: Unified Monocular 3D Object Detection
Zhuoling Li
Xiaohan Li
Sernam Lim
Hengshuang Zhao
439
3
0
28 Feb 2024
Advances in 3D Generation: A Survey
Advances in 3D Generation: A Survey
Xiaoyu Li
Tao Gui
Di Kang
Weihao Cheng
Yiming Gao
Jingbo Zhang
Zhihao Liang
Jing Liao
Yan-Pei Cao
Ying Shan
404
82
0
31 Jan 2024
RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from
  RGB-D Videos
RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D VideosComputer Vision and Pattern Recognition (CVPR), 2024
Hongchi Xia
Yang Fu
Sifei Liu
Xiaolong Wang
443
80
0
23 Jan 2024
Towards Real-World Aerial Vision Guidance with Categorical 6D Pose
  Tracker
Towards Real-World Aerial Vision Guidance with Categorical 6D Pose TrackerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Jingtao Sun
Yaonan Wang
Danwei Wang
410
3
0
09 Jan 2024
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision,
  Language, Audio, and Action
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu
Christopher Clark
Sangho Lee
Zichen Zhang
Savya Khosla
Ryan Marten
Derek Hoiem
Aniruddha Kembhavi
VLMMLLM
379
310
0
28 Dec 2023
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision
Lu Ling
Yichen Sheng
Zhi Tu
Wentian Zhao
Cheng Xin
...
Xiangrui Kong
Gang Hua
Tianti Zhang
Bedrich Benes
Aniket Bera
VGen
669
361
0
26 Dec 2023
PACE: A Large-Scale Dataset with Pose Annotations in Cluttered
  Environments
PACE: A Large-Scale Dataset with Pose Annotations in Cluttered Environments
Yang You
Kai Xiong
Zhening Yang
Zhengxiang Huang
Junwei Zhou
Ruoxi Shi
Zhou Fang
Adam W. Harley
Leonidas Guibas
Cewu Lu
3DV
462
6
0
23 Dec 2023
123
Next
Page 1 of 3