Object-Centric Learning with Slot Attention

26 June 2020

Francesco Locatello

Alexey Dosovitskiy

Papers citing "Object-Centric Learning with Slot Attention"

50 / 193 papers shown

Title
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis Dongze Li Kang Zhao Wei Wang Bo Peng Yingya Zhang Jing Dong Tien-Ping Tan DiffM VGen 27 12 0 18 Dec 2023
Uni3DL: Unified Model for 3D and Language Understanding Xiang Li Jian Ding Zhaoyang Chen Mohamed Elhoseiny 30 3 0 05 Dec 2023
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations Yifeng Zhu Zhenyu Jiang Peter Stone Yuke Zhu 3DPC 22 43 0 22 Oct 2023
Loci-Segmented: Improving Scene Segmentation Learning Manuel Traub Frederic Becker Adrian Sauter S. Otte Martin Volker Butz 26 2 0 16 Oct 2023
Vision Transformers Need Registers Zilong Chen Maxime Oquab Julien Mairal Huaping Liu ViT 37 311 0 28 Sep 2023
Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation Zhaochong An Guolei Sun Zongwei Wu Hao Tang Luc Van Gool VOS 23 4 0 14 Sep 2023
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation Chaofan Ma Yu-Hao Yang Chen Ju Fei Zhang Ya-Qin Zhang Yanfeng Wang VLM 40 17 0 31 Aug 2023
Enhancing Interpretable Object Abstraction via Clustering-based Slot Initialization Ni Gao Bernard Hohmann Gerhard Neumann OCL 27 2 0 22 Aug 2023
Does Visual Pretraining Help End-to-End Reasoning? Chen Sun Calvin Luo Xingyi Zhou Anurag Arnab Cordelia Schmid OCL LRM ViT 30 3 0 17 Jul 2023
Compositional Generalization from First Principles Thaddäus Wiedemer Prasanna Mayilvahanan Matthias Bethge Wieland Brendel OCL 25 37 0 10 Jul 2023
Learning Differentiable Logic Programs for Abstract Visual Reasoning Hikaru Shindo Viktor Pfanschilling D. Dhami Kristian Kersting NAI 26 6 0 03 Jul 2023
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering Lin Xi Weihai Chen Xingming Wu Zhong Liu Zhengguo Li VOS 23 9 0 21 Jun 2023
How can objects help action recognition? Xingyi Zhou Anurag Arnab Chen Sun Cordelia Schmid 30 14 0 20 Jun 2023
OCTScenes: A Versatile Real-World Dataset of Tabletop Scenes for Object-Centric Learning Yin-Tao Huang Tonglin Chen Zhimeng Shen Jinghao Huang Bin Li Xiangyang Xue OCL 25 1 0 16 Jun 2023
Scalable Neural-Probabilistic Answer Set Programming Arseny Skryagin Daniel Ochs D. Dhami Kristian Kersting 30 5 0 14 Jun 2023
Im-Promptu: In-Context Composition from Image Prompts Bhishma Dedhia Michael Chang Jake C. Snell Thomas L. Griffiths N. Jha LRM MLLM 29 1 0 26 May 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation Jingyang Huo Qiang Sun Boyan Jiang Haitao Lin Yanwei Fu 32 19 0 26 May 2023
An Examination of the Robustness of Reference-Free Image Captioning Evaluation Metrics Saba Ahmadi Aishwarya Agrawal 17 6 0 24 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models Ziyi Wu Jingyu Hu Wuyue Lu Igor Gilitschenski Animesh Garg DiffM OCL 30 44 0 18 May 2023
Ray-Patch: An Efficient Querying for Light Field Transformers T. B. Martins Javier Civera ViT 34 0 0 16 May 2023
Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans Romain Loiseau Elliot Vincent Mathieu Aubry Loic Landrieu 3DPC 13 3 0 19 Apr 2023
RePAST: Relative Pose Attention Scene Representation Transformer Aleksandr Safin Daniel Durckworth Mehdi S. M. Sajjadi 29 3 0 03 Apr 2023
Prefix-Tree Decoding for Predicting Mass Spectra from Molecules Samuel Goldman John Bradshaw Jiayi Xin Connor W. Coley 27 12 0 11 Mar 2023
Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments Jun Yamada J. Collins Ingmar Posner 31 8 0 06 Mar 2023
Reusable Slotwise Mechanisms Trang Nguyen Amin Mansouri Kanika Madan Khuong N. Nguyen Kartik Ahuja Dianbo Liu Yoshua Bengio OCL 28 4 0 21 Feb 2023
Structured Generative Models for Scene Understanding Christopher K. I. Williams OCL 3DV 19 3 0 07 Feb 2023
Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning Yuejiang Liu Alexandre Alahi Chris Russell Max Horn Dominik Zietlow Bernhard Schölkopf Francesco Locatello CML 54 22 0 12 Jan 2023
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation Chenhongyi Yang Jiarui Xu Shalini De Mello Elliot J. Crowley X. Wang ViT 30 21 0 13 Dec 2022
Improving Cross-Modal Retrieval with Set of Diverse Embeddings Dongwon Kim Nam-Won Kim Suha Kwak 16 37 0 30 Nov 2022
ILSGAN: Independent Layer Synthesis for Unsupervised Foreground-Background Segmentation Qiran Zou Yu Yang Wing Yin Cheung Chang-rui Liu Xiang Ji GAN 21 4 0 25 Nov 2022
ONeRF: Unsupervised 3D Object Segmentation from Multiple Views Sheng-Ming Liang Yichen Liu Shangzhe Wu Yu-Wing Tai Chi-Keung Tang 28 7 0 22 Nov 2022
A Short Survey of Systematic Generalization Yuanpeng Li AI4CE 27 1 0 22 Nov 2022
Disentangled Representation Learning Xin Eric Wang Hong Chen Siao Tang Zihao Wu Wenwu Zhu DRL 26 77 0 21 Nov 2022
Boosting Object Representation Learning via Motion and Object Continuity Quentin Delfosse Wolfgang Stammer Thomas Rothenbacher Dwarak Vittal Kristian Kersting OCL 37 20 0 16 Nov 2022
Dance of SNN and ANN: Solving binding problem by combining spike timing and reconstructive attention Hao Zheng Hui Lin Rong Zhao Luping Shi 23 5 0 11 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video Manipulation Levent Karacan Tolga Kerimouglu .Ismail .Inan Tolga Birdal Erkut Erdem Aykut Erdem 18 1 0 05 Nov 2022
Neural Systematic Binder Gautam Singh Yeongbin Kim Sungjin Ahn OCL 29 36 0 02 Nov 2022
Learning Explicit Object-Centric Representations with Vision Transformers Oscar Vikström Alexander Ilin OCL ViT 30 4 0 25 Oct 2022
Search for Concepts: Discovering Visual Concepts Using Direct Optimization P. Reddy Paul Guerrero Niloy J. Mitra OCL 19 4 0 25 Oct 2022
Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns Laurynas Karazija Subhabrata Choudhury Iro Laina Christian Rupprecht Andrea Vedaldi OCL 100 20 0 21 Oct 2022
Play It Back: Iterative Attention for Audio Recognition Alexandros Stergiou Dima Damen 26 4 0 20 Oct 2022
MoCoDA: Model-based Counterfactual Data Augmentation Silviu Pitis Elliot Creager Ajay Mandlekar Animesh Garg OffRL 31 33 0 20 Oct 2022
Robust and Controllable Object-Centric Learning through Energy-based Models Ruixiang Zhang Tong Che B. Ivanovic Renhao Wang Marco Pavone Yoshua Bengio Liam Paull OCL 26 8 0 11 Oct 2022
Learning Hierarchical Image Segmentation For Recognition and By Recognition Tsung-Wei Ke Sangwoo Mo Stella X. Yu VLM 27 9 0 01 Oct 2022
Motion-inductive Self-supervised Object Discovery in Videos Shuangrui Ding Weidi Xie Yabo Chen Rui Qian Xiaopeng Zhang H. Xiong Q. Tian VOS 16 18 0 01 Oct 2022
Differentiable Parsing and Visual Grounding of Natural Language Instructions for Object Placement Zirui Zhao W. Lee David Hsu OOD 32 9 0 01 Oct 2022
Entropy-driven Unsupervised Keypoint Representation Learning in Videos A. Younes Simone Schaub-Meyer Georgia Chalvatzaki SSL 24 0 0 30 Sep 2022
Bridging the Gap to Real-World Object-Centric Learning Maximilian Seitzer Max Horn Andrii Zadaianchuk Dominik Zietlow Tianjun Xiao ... Tong He Zheng-Wei Zhang Bernhard Schölkopf Thomas Brox Francesco Locatello OCL 37 139 0 29 Sep 2022
Reconstruction-guided attention improves the robustness and shape processing of neural networks Seoyoung Ahn Hossein Adeli G. Zelinsky DiffM AAML 25 1 0 27 Sep 2022
A Simple and Powerful Global Optimization for Unsupervised Video Object Segmentation Georgy Ponimatkin Nermin Samet Yanghua Xiao Yuming Du Renaud Marlet Vincent Lepetit VOS 72 20 0 19 Sep 2022