ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.15055
  4. Cited By
Object-Centric Learning with Slot Attention

Object-Centric Learning with Slot Attention

26 June 2020
Francesco Locatello
Dirk Weissenborn
Thomas Unterthiner
Aravindh Mahendran
G. Heigold
Jakob Uszkoreit
Alexey Dosovitskiy
Thomas Kipf
    OCL
ArXivPDFHTML

Papers citing "Object-Centric Learning with Slot Attention"

50 / 193 papers shown
Title
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head
  Synthesis
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis
Dongze Li
Kang Zhao
Wei Wang
Bo Peng
Yingya Zhang
Jing Dong
Tien-Ping Tan
DiffM
VGen
27
12
0
18 Dec 2023
Uni3DL: Unified Model for 3D and Language Understanding
Uni3DL: Unified Model for 3D and Language Understanding
Xiang Li
Jian Ding
Zhaoyang Chen
Mohamed Elhoseiny
30
3
0
05 Dec 2023
Learning Generalizable Manipulation Policies with Object-Centric 3D
  Representations
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations
Yifeng Zhu
Zhenyu Jiang
Peter Stone
Yuke Zhu
3DPC
22
43
0
22 Oct 2023
Loci-Segmented: Improving Scene Segmentation Learning
Loci-Segmented: Improving Scene Segmentation Learning
Manuel Traub
Frederic Becker
Adrian Sauter
S. Otte
Martin Volker Butz
26
2
0
16 Oct 2023
Vision Transformers Need Registers
Vision Transformers Need Registers
Zilong Chen
Maxime Oquab
Julien Mairal
Huaping Liu
ViT
37
311
0
28 Sep 2023
Temporal-aware Hierarchical Mask Classification for Video Semantic
  Segmentation
Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation
Zhaochong An
Guolei Sun
Zongwei Wu
Hao Tang
Luc Van Gool
VOS
23
4
0
14 Sep 2023
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute
  Decomposition-Aggregation
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
Chaofan Ma
Yu-Hao Yang
Chen Ju
Fei Zhang
Ya-Qin Zhang
Yanfeng Wang
VLM
40
17
0
31 Aug 2023
Enhancing Interpretable Object Abstraction via Clustering-based Slot
  Initialization
Enhancing Interpretable Object Abstraction via Clustering-based Slot Initialization
Ni Gao
Bernard Hohmann
Gerhard Neumann
OCL
27
2
0
22 Aug 2023
Does Visual Pretraining Help End-to-End Reasoning?
Does Visual Pretraining Help End-to-End Reasoning?
Chen Sun
Calvin Luo
Xingyi Zhou
Anurag Arnab
Cordelia Schmid
OCL
LRM
ViT
30
3
0
17 Jul 2023
Compositional Generalization from First Principles
Compositional Generalization from First Principles
Thaddäus Wiedemer
Prasanna Mayilvahanan
Matthias Bethge
Wieland Brendel
OCL
25
37
0
10 Jul 2023
Learning Differentiable Logic Programs for Abstract Visual Reasoning
Learning Differentiable Logic Programs for Abstract Visual Reasoning
Hikaru Shindo
Viktor Pfanschilling
D. Dhami
Kristian Kersting
NAI
26
6
0
03 Jul 2023
Online Unsupervised Video Object Segmentation via Contrastive Motion
  Clustering
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering
Lin Xi
Weihai Chen
Xingming Wu
Zhong Liu
Zhengguo Li
VOS
23
9
0
21 Jun 2023
How can objects help action recognition?
How can objects help action recognition?
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
30
14
0
20 Jun 2023
OCTScenes: A Versatile Real-World Dataset of Tabletop Scenes for
  Object-Centric Learning
OCTScenes: A Versatile Real-World Dataset of Tabletop Scenes for Object-Centric Learning
Yin-Tao Huang
Tonglin Chen
Zhimeng Shen
Jinghao Huang
Bin Li
Xiangyang Xue
OCL
25
1
0
16 Jun 2023
Scalable Neural-Probabilistic Answer Set Programming
Scalable Neural-Probabilistic Answer Set Programming
Arseny Skryagin
Daniel Ochs
D. Dhami
Kristian Kersting
30
5
0
14 Jun 2023
Im-Promptu: In-Context Composition from Image Prompts
Im-Promptu: In-Context Composition from Image Prompts
Bhishma Dedhia
Michael Chang
Jake C. Snell
Thomas L. Griffiths
N. Jha
LRM
MLLM
29
1
0
26 May 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot
  Attention for Vision-and-Language Navigation
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation
Jingyang Huo
Qiang Sun
Boyan Jiang
Haitao Lin
Yanwei Fu
32
19
0
26 May 2023
An Examination of the Robustness of Reference-Free Image Captioning
  Evaluation Metrics
An Examination of the Robustness of Reference-Free Image Captioning Evaluation Metrics
Saba Ahmadi
Aishwarya Agrawal
17
6
0
24 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffM
OCL
30
44
0
18 May 2023
Ray-Patch: An Efficient Querying for Light Field Transformers
Ray-Patch: An Efficient Querying for Light Field Transformers
T. B. Martins
Javier Civera
ViT
34
0
0
16 May 2023
Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans
Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans
Romain Loiseau
Elliot Vincent
Mathieu Aubry
Loic Landrieu
3DPC
13
3
0
19 Apr 2023
RePAST: Relative Pose Attention Scene Representation Transformer
RePAST: Relative Pose Attention Scene Representation Transformer
Aleksandr Safin
Daniel Durckworth
Mehdi S. M. Sajjadi
29
3
0
03 Apr 2023
Prefix-Tree Decoding for Predicting Mass Spectra from Molecules
Prefix-Tree Decoding for Predicting Mass Spectra from Molecules
Samuel Goldman
John Bradshaw
Jiayi Xin
Connor W. Coley
27
12
0
11 Mar 2023
Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed
  Environments
Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments
Jun Yamada
J. Collins
Ingmar Posner
31
8
0
06 Mar 2023
Reusable Slotwise Mechanisms
Reusable Slotwise Mechanisms
Trang Nguyen
Amin Mansouri
Kanika Madan
Khuong N. Nguyen
Kartik Ahuja
Dianbo Liu
Yoshua Bengio
OCL
28
4
0
21 Feb 2023
Structured Generative Models for Scene Understanding
Structured Generative Models for Scene Understanding
Christopher K. I. Williams
OCL
3DV
19
3
0
07 Feb 2023
Causal Triplet: An Open Challenge for Intervention-centric Causal
  Representation Learning
Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning
Yuejiang Liu
Alexandre Alahi
Chris Russell
Max Horn
Dominik Zietlow
Bernhard Schölkopf
Francesco Locatello
CML
54
22
0
12 Jan 2023
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group
  Propagation
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang
Jiarui Xu
Shalini De Mello
Elliot J. Crowley
X. Wang
ViT
30
21
0
13 Dec 2022
Improving Cross-Modal Retrieval with Set of Diverse Embeddings
Improving Cross-Modal Retrieval with Set of Diverse Embeddings
Dongwon Kim
Nam-Won Kim
Suha Kwak
16
37
0
30 Nov 2022
ILSGAN: Independent Layer Synthesis for Unsupervised
  Foreground-Background Segmentation
ILSGAN: Independent Layer Synthesis for Unsupervised Foreground-Background Segmentation
Qiran Zou
Yu Yang
Wing Yin Cheung
Chang-rui Liu
Xiang Ji
GAN
21
4
0
25 Nov 2022
ONeRF: Unsupervised 3D Object Segmentation from Multiple Views
ONeRF: Unsupervised 3D Object Segmentation from Multiple Views
Sheng-Ming Liang
Yichen Liu
Shangzhe Wu
Yu-Wing Tai
Chi-Keung Tang
28
7
0
22 Nov 2022
A Short Survey of Systematic Generalization
A Short Survey of Systematic Generalization
Yuanpeng Li
AI4CE
27
1
0
22 Nov 2022
Disentangled Representation Learning
Disentangled Representation Learning
Xin Eric Wang
Hong Chen
Siao Tang
Zihao Wu
Wenwu Zhu
DRL
26
77
0
21 Nov 2022
Boosting Object Representation Learning via Motion and Object Continuity
Boosting Object Representation Learning via Motion and Object Continuity
Quentin Delfosse
Wolfgang Stammer
Thomas Rothenbacher
Dwarak Vittal
Kristian Kersting
OCL
37
20
0
16 Nov 2022
Dance of SNN and ANN: Solving binding problem by combining spike timing
  and reconstructive attention
Dance of SNN and ANN: Solving binding problem by combining spike timing and reconstructive attention
Hao Zheng
Hui Lin
Rong Zhao
Luping Shi
23
5
0
11 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video
  Manipulation
Disentangling Content and Motion for Text-Based Neural Video Manipulation
Levent Karacan
Tolga Kerimouglu
.Ismail .Inan
Tolga Birdal
Erkut Erdem
Aykut Erdem
18
1
0
05 Nov 2022
Neural Systematic Binder
Neural Systematic Binder
Gautam Singh
Yeongbin Kim
Sungjin Ahn
OCL
29
36
0
02 Nov 2022
Learning Explicit Object-Centric Representations with Vision
  Transformers
Learning Explicit Object-Centric Representations with Vision Transformers
Oscar Vikström
Alexander Ilin
OCL
ViT
30
4
0
25 Oct 2022
Search for Concepts: Discovering Visual Concepts Using Direct
  Optimization
Search for Concepts: Discovering Visual Concepts Using Direct Optimization
P. Reddy
Paul Guerrero
Niloy J. Mitra
OCL
19
4
0
25 Oct 2022
Unsupervised Multi-object Segmentation by Predicting Probable Motion
  Patterns
Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns
Laurynas Karazija
Subhabrata Choudhury
Iro Laina
Christian Rupprecht
Andrea Vedaldi
OCL
100
20
0
21 Oct 2022
Play It Back: Iterative Attention for Audio Recognition
Play It Back: Iterative Attention for Audio Recognition
Alexandros Stergiou
Dima Damen
26
4
0
20 Oct 2022
MoCoDA: Model-based Counterfactual Data Augmentation
MoCoDA: Model-based Counterfactual Data Augmentation
Silviu Pitis
Elliot Creager
Ajay Mandlekar
Animesh Garg
OffRL
31
33
0
20 Oct 2022
Robust and Controllable Object-Centric Learning through Energy-based
  Models
Robust and Controllable Object-Centric Learning through Energy-based Models
Ruixiang Zhang
Tong Che
B. Ivanovic
Renhao Wang
Marco Pavone
Yoshua Bengio
Liam Paull
OCL
26
8
0
11 Oct 2022
Learning Hierarchical Image Segmentation For Recognition and By
  Recognition
Learning Hierarchical Image Segmentation For Recognition and By Recognition
Tsung-Wei Ke
Sangwoo Mo
Stella X. Yu
VLM
27
9
0
01 Oct 2022
Motion-inductive Self-supervised Object Discovery in Videos
Motion-inductive Self-supervised Object Discovery in Videos
Shuangrui Ding
Weidi Xie
Yabo Chen
Rui Qian
Xiaopeng Zhang
H. Xiong
Q. Tian
VOS
16
18
0
01 Oct 2022
Differentiable Parsing and Visual Grounding of Natural Language
  Instructions for Object Placement
Differentiable Parsing and Visual Grounding of Natural Language Instructions for Object Placement
Zirui Zhao
W. Lee
David Hsu
OOD
32
9
0
01 Oct 2022
Entropy-driven Unsupervised Keypoint Representation Learning in Videos
Entropy-driven Unsupervised Keypoint Representation Learning in Videos
A. Younes
Simone Schaub-Meyer
Georgia Chalvatzaki
SSL
24
0
0
30 Sep 2022
Bridging the Gap to Real-World Object-Centric Learning
Bridging the Gap to Real-World Object-Centric Learning
Maximilian Seitzer
Max Horn
Andrii Zadaianchuk
Dominik Zietlow
Tianjun Xiao
...
Tong He
Zheng-Wei Zhang
Bernhard Schölkopf
Thomas Brox
Francesco Locatello
OCL
37
139
0
29 Sep 2022
Reconstruction-guided attention improves the robustness and shape
  processing of neural networks
Reconstruction-guided attention improves the robustness and shape processing of neural networks
Seoyoung Ahn
Hossein Adeli
G. Zelinsky
DiffM
AAML
25
1
0
27 Sep 2022
A Simple and Powerful Global Optimization for Unsupervised Video Object
  Segmentation
A Simple and Powerful Global Optimization for Unsupervised Video Object Segmentation
Georgy Ponimatkin
Nermin Samet
Yanghua Xiao
Yuming Du
Renaud Marlet
Vincent Lepetit
VOS
72
20
0
19 Sep 2022
Previous
1234
Next