Attention over learned object embeddings enables complex visual
reasoning

Attention over learned object embeddings enables complex visual reasoning

15 December 2020

Malcolm Reynolds

Papers citing "Attention over learned object embeddings enables complex visual reasoning"

18 / 18 papers shown

Title
Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios Shantanu Jaiswal Debaditya Roy Basura Fernando Cheston Tan ReLM LRM 71 2 0 20 Nov 2024
Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem Declan Campbell Sunayana Rane Tyler Giallanza Nicolò De Sabbata Kia Ghods ... Alexander Ku Steven M. Frankland Thomas L. Griffiths Jonathan D. Cohen Taylor W. Webb 34 13 0 31 Oct 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models Amir Mohammad Karimi Mamaghan Samuele Papa Karl Henrik Johansson Stefan Bauer Andrea Dittadi OCL 42 5 0 22 Jul 2024
Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery Anand Gopalakrishnan Aleksandar Stanić Jürgen Schmidhuber M. C. Mozer 45 5 0 27 May 2024
Learning Object Permanence from Videos via Latent Imaginations Manuel Traub Frederic Becker S. Otte Martin Volker Butz 25 1 0 16 Oct 2023
Does Visual Pretraining Help End-to-End Reasoning? Chen Sun Calvin Luo Xingyi Zhou Anurag Arnab Cordelia Schmid OCL LRM ViT 28 3 0 17 Jul 2023
A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents Sukai Huang N. Lipovetzky Trevor Cohn 30 2 0 26 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models Ziyi Wu Jingyu Hu Wuyue Lu Igor Gilitschenski Animesh Garg DiffM OCL 28 44 0 18 May 2023
Reusable Slotwise Mechanisms Trang Nguyen Amin Mansouri Kanika Madan Khuong N. Nguyen Kartik Ahuja Dianbo Liu Yoshua Bengio OCL 20 4 0 21 Feb 2023
Reasoning about Actions over Visual and Linguistic Modalities: A Survey Shailaja Keyur Sampat Maitreya Patel Subhasish Das Yezhou Yang Chitta Baral ReLM LM&Ro LRM 19 12 0 15 Jul 2022
Interactive Visual Reasoning under Uncertainty Manjie Xu Guangyuan Jiang Wei Liang Song-Chun Zhu Yixin Zhu LRM 37 5 0 18 Jun 2022
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax Bruno Sauvalle A. de La Fortelle 3DPC 44 12 0 26 May 2022
Unsupervised Learning of Temporal Abstractions with Slot-based Transformers Anand Gopalakrishnan Kazuki Irie Jürgen Schmidhuber Sjoerd van Steenkiste OffRL 19 16 0 25 Mar 2022
Conditional Object-Centric Learning from Video Thomas Kipf Gamaleldin F. Elsayed Aravindh Mahendran Austin Stone S. Sabour G. Heigold Rico Jonschkowski Alexey Dosovitskiy Klaus Greff OCL 39 213 0 24 Nov 2021
Understanding the computational demands underlying visual reasoning Mohit Vaishnav Rémi Cadène A. Alamia Drew Linsley Rufin VanRullen Thomas Serre GNN CoGe 32 16 0 08 Aug 2021
Coordination Among Neural Modules Through a Shared Global Workspace Anirudh Goyal Aniket Didolkar Alex Lamb Kartikeya Badola Nan Rosemary Ke Nasim Rahaman Jonathan Binas Charles Blundell Michael C. Mozer Yoshua Bengio 154 98 0 01 Mar 2021
Unsupervised Discovery of 3D Physical Objects from Video Yilun Du Kevin A. Smith Tomer Ulman J. Tenenbaum Jiajun Wu OCL 107 37 0 24 Jul 2020
Learning Object Permanence from Video Aviv Shamsian Ofri Kleinfeld Amir Globerson Gal Chechik SSL 29 31 0 23 Mar 2020