Object Scene Representation Transformer

14 June 2022

Mehdi S. M. Sajjadi

Daniel Duckworth

Aravindh Mahendran

Sjoerd van Steenkiste

Papers citing "Object Scene Representation Transformer"

50 / 71 papers shown

Title
Object-Centric Pretraining via Target Encoder Bootstrapping Nikola Đukić Tim Lebailly Tinne Tuytelaars OCL 63 0 0 19 Mar 2025
Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering Yanpeng Zhao Yiwei Hao Siyu Gao Yunbo Wang Xiaokang Yang OCL 111 1 0 17 Feb 2025
Slot-BERT: Self-supervised Object Discovery in Surgical Video Guiqiu Liao M. Jogan Marcel Hussing Kenta Nakahashi Kazuhiro Yasufuku Amin Madani Eric Eaton Daniel A. Hashimoto 41 0 0 21 Jan 2025
Moving Off-the-Grid: Scene-Grounded Video Representations Sjoerd van Steenkiste Daniel Zoran Yi Yang Yulia Rubanova Rishabh Kabra ... Thomas Keck João Carreira Alexey Dosovitskiy Mehdi S. M. Sajjadi Thomas Kipf 26 3 0 08 Nov 2024
Large Spatial Model: End-to-end Unposed Images to Semantic 3D Zhiwen Fan Jian Zhang Wenyan Cong Peihao Wang Renjie Li ... Z. Wang Danfei Xu B. Ivanovic Marco Pavone Yue Wang 3DV 36 11 0 24 Oct 2024
Learning Global Object-Centric Representations via Disentangled Slot Attention Tonglin Chen Yinxuan Huang Zhimeng Shen Jinghao Huang Bin Li Xiangyang Xue OCL 22 0 0 24 Oct 2024
Zero-Shot Object-Centric Representation Learning Aniket Didolkar Andrii Zadaianchuk Anirudh Goyal Mike Mozer Yoshua Bengio Georg Martius Maximilian Seitzer VLM OCL 26 4 0 17 Aug 2024
SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields Yu Liu Baoxiong Jia Yixin Chen Siyuan Huang OCL 31 4 0 13 Aug 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models Amir Mohammad Karimi Mamaghan Samuele Papa Karl Henrik Johansson Stefan Bauer Andrea Dittadi OCL 35 5 0 22 Jul 2024
Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving Ran Tian Boyi Li Xinshuo Weng Yuxiao Chen Edward Schmerling Yue Wang B. Ivanovic Marco Pavone 26 13 0 01 Jul 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models Ziyi Wu Yulia Rubanova Rishabh Kabra Drew A. Hudson Igor Gilitschenski Yusuf Aytar Sjoerd van Steenkiste Kelsey R. Allen Thomas Kipf VGen DiffM 31 10 0 13 Jun 2024
Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery Anand Gopalakrishnan Aleksandar Stanić Jürgen Schmidhuber M. C. Mozer 37 5 0 27 May 2024
A Survey on Vision-Language-Action Models for Embodied AI Yueen Ma Zixing Song Yuzheng Zhuang Jianye Hao Irwin King LM&Ro 60 38 0 23 May 2024
Learning Planning Abstractions from Language Weiyu Liu Geng Chen Joy Hsu Jiayuan Mao Jiajun Wu PINN 25 0 0 06 May 2024
Learning to Compose: Improving Object Centric Learning by Injecting Compositionality Whie Jung Jaehoon Yoo Sungjin Ahn Seunghoon Hong OCL CoGe 24 4 0 01 May 2024
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features Andre Rochow Max Schwarz Sven Behnke ViT 33 6 0 15 Apr 2024
On permutation-invariant neural networks Masanari Kimura Ryotaro Shimizu Yuki Hirakawa Ryosuke Goto Yuki Saito OOD AAML 30 7 0 26 Mar 2024
Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning Rao Fu Jingyu Liu Xilun Chen Yixin Nie Wenhan Xiong LM&Ro LRM 41 47 0 18 Mar 2024
Attentive Illumination Decomposition Model for Multi-Illuminant White Balancing Dongyoung Kim Jinwoo Kim Junsang Yu Seon Joo Kim 34 5 0 28 Feb 2024
Parallelized Spatiotemporal Binding Gautam Singh Yue Wang Jiawei Yang B. Ivanovic Sungjin Ahn Marco Pavone Tong Che 36 1 0 26 Feb 2024
Disentangled 3D Scene Generation with Layout Learning Dave Epstein Ben Poole B. Mildenhall Alexei A. Efros Aleksander Holynski CoGe OCL 3DV 32 20 0 26 Feb 2024
Unsupervised Discovery of Object-Centric Neural Fields Rundong Luo Hong-Xing Yu Jiajun Wu 3DPC OCL 85 3 0 12 Feb 2024
Binding Dynamics in Rotating Features Sindy Lowe Francesco Locatello Max Welling OCL 15 1 0 08 Feb 2024
ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis Bernard Spiegl Andrea Perin Stéphane Deny Alexander Ilin DiffM 11 2 0 05 Feb 2024
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning Yiqi Wang Wentao Chen Xiaotian Han Xudong Lin Haiteng Zhao Yongfei Liu Bohan Zhai Jianbo Yuan Quanzeng You Hongxia Yang LRM 33 66 0 10 Jan 2024
Slot-guided Volumetric Object Radiance Fields Di Qi Tong Yang Xiangyu Zhang OCL 16 2 0 04 Jan 2024
NViST: In the Wild New View Synthesis from a Single Image with Transformers Wonbong Jang Lourdes Agapito ViT 19 9 0 13 Dec 2023
Inferring Hybrid Neural Fluid Fields from Videos Hong-Xing Yu Yang Zheng Yuan Gao Yitong Deng Bo Zhu Jiajun Wu 3DH 22 15 0 11 Dec 2023
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion Zehuan Huang Hao Wen Junting Dong Yaohui Wang Yangguang Li ... Yan-Pei Cao Ding Liang Yu Qiao Bo Dai Lu Sheng 14 10 0 11 Dec 2023
Free3D: Consistent Novel View Synthesis without 3D Representation Chuanxia Zheng Andrea Vedaldi 3DV 35 48 0 07 Dec 2023
Action-slot: Visual Action-centric Representations for Multi-label Atomic Activity Recognition in Traffic Scenes Chi-Hsi Kung Shu-Wei Lu Yi-Hsuan Tsai Yi-Ting Chen 23 6 0 29 Nov 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers Takeru Miyato Bernhard Jaeger Max Welling Andreas Geiger ViT 25 14 0 16 Oct 2023
Leveraging Image Augmentation for Object Manipulation: Towards Interpretable Controllability in Object-Centric Learning Jinwoo Kim Janghyuk Choi Jaehyun Kang Changyeon Lee Ho-Jin Choi Seon Joo Kim OCL 21 0 0 13 Oct 2023
Pseudo-Generalized Dynamic View Synthesis from a Video Xiaoming Zhao Alex Colburn Fangchang Ma Miguel Angel Bautista J. Susskind A. Schwing 27 11 0 12 Oct 2023
DyST: Towards Dynamic Neural Scene Representations on Real-World Videos Maximilian Seitzer Sjoerd van Steenkiste Thomas Kipf Klaus Greff Mehdi S. M. Sajjadi VGen ViT 16 8 0 09 Oct 2023
Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models Jianglong Ye Peng Wang Kejie Li Yichun Shi Heng Wang DiffM 22 72 0 04 Oct 2023
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation Ke Fan Jingshi Lei Xuelin Qian Miaopeng Yu Tianjun Xiao Tong He Zheng-Wei Zhang Yanwei Fu VOS 8 4 0 23 Sep 2023
Consciousness in Artificial Intelligence: Insights from the Science of Consciousness Patrick Butlin R. Long Eric Elmoznino Yoshua Bengio Jonathan C. P. Birch ... L. Mudrik Megan A. K. Peters Eric Schwitzgebel Jonathan Simon Rufin VanRullen LLMAG 16 95 0 17 Aug 2023
Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis Yuxin Wang Wayne Wu Dan Xu 35 9 0 05 Aug 2023
Linking vision and motion for self-supervised object-centric perception Kaylene C. Stocking Zak Murez Vijay Badrinarayanan Jamie Shotton Alex Kendall Claire Tomlin Christopher P. Burgess OCL 22 0 0 14 Jul 2023
Equivariant Single View Pose Prediction Via Induced and Restricted Representations Owen Howell David M. Klee Ondrej Biza Linfeng Zhao Robin G. Walters 16 2 0 07 Jul 2023
Tell Me Where to Go: A Composable Framework for Context-Aware Embodied Robot Navigation Harel Biggie Ajay Narasimha Mopidevi Dusty Woods Christoffer Heckman LM&Ro 11 11 0 15 Jun 2023
DORSal: Diffusion for Object-centric Representations of Scenes et al Allan Jabri Sjoerd van Steenkiste Emiel Hoogeboom Mehdi S. M. Sajjadi Thomas Kipf 14 17 0 13 Jun 2023
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles Tal Daniel Aviv Tamar DiffM 14 3 0 09 Jun 2023
Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities Andrii Zadaianchuk Maximilian Seitzer Georg Martius OCL 12 15 0 07 Jun 2023
Rotating Features for Object Discovery Sindy Lowe Phillip Lippe Francesco Locatello Max Welling OCL 17 21 0 01 Jun 2023
Sensitivity of Slot-Based Object-Centric Models to their Number of Slots Roland S. Zimmermann Sjoerd van Steenkiste Mehdi S. M. Sajjadi Thomas Kipf Klaus Greff OCL 23 5 0 30 May 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation Jingyang Huo Qiang Sun Boyan Jiang Haitao Lin Yanwei Fu 19 18 0 26 May 2023
Provably Learning Object-Centric Representations Jack Brady Roland S. Zimmermann Yash Sharma Bernhard Schölkopf Julius von Kügelgen Wieland Brendel OCL 24 31 0 23 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models Ziyi Wu Jingyu Hu Wuyue Lu Igor Gilitschenski Animesh Garg DiffM OCL 23 44 0 18 May 2023