Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03570
Cited By
Kubric: A scalable dataset generator
7 March 2022
Klaus Greff
Francois Belletti
Lucas Beyer
Carl Doersch
Yilun Du
Daniel Duckworth
David J. Fleet
Dan Gnanapragasam
Florian Golemo
Charles Herrmann
Thomas Kipf
Abhijit Kundu
Dmitry Lagun
I. Laradji
Hsueh-Ti Liu
Liu
H. Meyer
Yishu Miao
Derek Nowrouzezahrai
Cengiz Öztireli
Etienne Pot
Noha Radwan
Daniel Rebain
S. Sabour
Mehdi S. M. Sajjadi
Matan Sela
Vincent Sitzmann
Austin Stone
Deqing Sun
Suhani Vora
Ziyu Wang
Tianhao Wu
K. M. Yi
Fangcheng Zhong
Andrea Tagliasacchi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Kubric: A scalable dataset generator"
50 / 193 papers shown
Title
DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors
Joseph Ortiz
Antoine Dedieu
Wolfgang Lehrach
Swaroop Guntupalli
Carter Wendelken
Ahmad Humayun
Guangyao Zhou
Sivaramakrishnan Swaminathan
Miguel Lázaro-Gredilla
Kevin P. Murphy
OffRL
44
1
0
26 Sep 2024
Self-Supervised Any-Point Tracking by Contrastive Random Walks
Ayush Shrivastava
Andrew Owens
23
3
0
24 Sep 2024
MHRC: Closed-loop Decentralized Multi-Heterogeneous Robot Collaboration with Large Language Models
Wenhao Yu
Jie Peng
Yueliang Ying
Sai Li
Jianmin Ji
Yanyong Zhang
30
4
0
24 Sep 2024
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Liu He
Yizhi Song
Hejun Huang
Pinxin Liu
Yunlong Tang
Daniel G. Aliaga
Xin Zhou
DiffM
VGen
90
3
0
19 Aug 2024
Zero-Shot Object-Centric Representation Learning
Aniket Didolkar
Andrii Zadaianchuk
Anirudh Goyal
Mike Mozer
Yoshua Bengio
Georg Martius
Maximilian Seitzer
VLM
OCL
29
4
0
17 Aug 2024
Local All-Pair Correspondence for Point Tracking
Seokju Cho
Jiahui Huang
Jisu Nam
Honggyu An
Seungryong Kim
Joon-Young Lee
19
26
0
22 Jul 2024
Shape of Motion: 4D Reconstruction from a Single Video
Qianqian Wang
Vickie Ye
Hang Gao
Jake Austin
Zhengqi Li
Angjoo Kanazawa
VGen
45
63
0
18 Jul 2024
TAPVid-3D: A Benchmark for Tracking Any Point in 3D
Skanda Koppula
Ignacio Rocco
Yi Yang
Joe Heyward
João Carreira
Andrew Zisserman
Gabriel J. Brostow
Carl Doersch
39
14
0
08 Jul 2024
Attention Normalization Impacts Cardinality Generalization in Slot Attention
Markus Krimmel
Jan Achterhold
Joerg Stueckler
OCL
37
0
0
04 Jul 2024
Guiding Video Prediction with Explicit Procedural Knowledge
Patrick Takenaka
Johannes Maucher
Marco F. Huber
37
1
0
26 Jun 2024
ViPro: Enabling and Controlling Video Prediction for Complex Dynamical Scenarios using Procedural Knowledge
Patrick Takenaka
Johannes Maucher
Marco F. Huber
VGen
26
0
0
26 Jun 2024
Slot State Space Models
Jindong Jiang
Fei Deng
Gautam Singh
Minseung Lee
Sungjin Ahn
39
4
0
18 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGen
DiffM
31
10
0
13 Jun 2024
Comparison Visual Instruction Tuning
Wei Lin
M. Jehanzeb Mirza
Sivan Doveh
Rogerio Feris
Raja Giryes
Sepp Hochreiter
Leonid Karlinsky
46
4
0
13 Jun 2024
Adaptive Slot Attention: Object Discovery with Dynamic Slot Number
Ke Fan
Zechen Bai
Tianjun Xiao
Tong He
Max Horn
Yanwei Fu
Francesco Locatello
Zheng Zhang
OCL
30
5
0
13 Jun 2024
IllumiNeRF: 3D Relighting without Inverse Rendering
Xiaoming Zhao
Pratul P. Srinivasan
Dor Verbin
Keunhong Park
Ricardo Martín Brualla
Philipp Henzler
26
5
0
10 Jun 2024
GenHeld: Generating and Editing Handheld Objects
Chaerin Min
Srinath Sridhar
29
0
0
07 Jun 2024
Streaming quanta sensors for online, high-performance imaging and vision
Tianyi Zhang
Matthew Dutson
Vivek Boominathan
Mohit Gupta
Ashok Veeraraghavan
28
0
0
02 Jun 2024
Transformers and Slot Encoding for Sample Efficient Physical World Modelling
Francesco Petri
Luigi Asprino
Aldo Gangemi
OCL
ViT
27
0
0
30 May 2024
NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild
Weining Ren
Zihan Zhu
Boyang Sun
Jiaqi Chen
Marc Pollefeys
Songyou Peng
21
31
0
29 May 2024
Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery
Anand Gopalakrishnan
Aleksandar Stanić
Jürgen Schmidhuber
M. C. Mozer
40
5
0
27 May 2024
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang
Shichao Dong
Yapeng Zhu
Kelu Yao
Weidong Zhao
Chao Li
Ping Luo
CoGe
LRM
32
2
0
27 May 2024
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Basile Van Hoorick
Rundi Wu
Ege Ozguroglu
Kyle Sargent
Ruoshi Liu
P. Tokmakov
Achal Dave
Changxi Zheng
Carl Vondrick
DiffM
VGen
47
29
0
23 May 2024
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
Wen-Hsuan Chu
Lei Ke
Katerina Fragkiadaki
3DGS
VGen
21
29
0
03 May 2024
Unsupervised Dynamics Prediction with Object-Centric Kinematics
Yeon-Ji Song
Suhyung Choi
Jaein Kim
Jin-Hwa Kim
Byoung-Tak Zhang
28
0
0
29 Apr 2024
SpatialTracker: Tracking Any 2D Pixels in 3D Space
Yuxi Xiao
Qianqian Wang
Shangzhan Zhang
Nan Xue
Sida Peng
Yujun Shen
Xiaowei Zhou
19
51
0
05 Apr 2024
RaSim: A Range-aware High-fidelity RGB-D Data Simulation Pipeline for Real-world Applications
Xingyu Liu
Chenyangguang Zhang
Gu Wang
Ruida Zhang
Xiangyang Ji
24
1
0
05 Apr 2024
NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation
Jiahao Chen
Yipeng Qin
Lingjie Liu
Jiangbo Lu
Guanbin Li
30
11
0
26 Mar 2024
Reasoning-Enhanced Object-Centric Learning for Videos
Jian Li
Pu Ren
Yang Liu
Hao-Lun Sun
OCL
LRM
33
2
0
22 Mar 2024
TAPTR: Tracking Any Point with Transformers as Detection
Hongyang Li
Hao Zhang
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Lei Zhang
24
19
0
19 Mar 2024
Fast Sparse View Guided NeRF Update for Object Reconfigurations
Ziqi Lu
Jianbo Ye
Xiaohan Fei
Xiaolong Li
Jiawei Mo
Ashwin Swaminathan
Stefano Soatto
29
1
0
16 Mar 2024
Synth
2
^2
2
: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings
Sahand Sharifzadeh
Christos Kaplanis
Shreya Pathak
D. Kumaran
Anastasija Ilić
Jovana Mitrović
Charles Blundell
Andrea Banino
VLM
26
9
0
12 Mar 2024
Genetic Learning for Designing Sim-to-Real Data Augmentations
Bram Vanherle
Nick Michiels
F. Reeth
22
0
0
11 Mar 2024
A spatiotemporal style transfer algorithm for dynamic visual stimulus generation
Antonino Greco
Markus Siegel
17
2
0
07 Mar 2024
A Survey of Geometric Graph Neural Networks: Data Structures, Models and Applications
Jiaqi Han
Jiacheng Cen
Liming Wu
Zongzhao Li
Xiangzhe Kong
...
Zhewei Wei
Deli Zhao
Yu Rong
Wenbing Huang
Wenbing Huang
AI4CE
32
20
0
01 Mar 2024
Parallelized Spatiotemporal Binding
Gautam Singh
Yue Wang
Jiawei Yang
B. Ivanovic
Sungjin Ahn
Marco Pavone
Tong Che
36
1
0
26 Feb 2024
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
Thang-Anh-Quan Nguyen
Amine Bourki
Mátyás Macudzinski
Anthony Brunel
M. Bennamoun
25
9
0
17 Feb 2024
Robust Inverse Graphics via Probabilistic Inference
Tuan Anh Le
Pavel Sountsov
Matthew D. Hoffman
Ben Lee
Brian Patton
Rif A. Saurous
16
0
0
02 Feb 2024
Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representations
Bhishma Dedhia
N. Jha
OCL
43
1
0
02 Feb 2024
BootsTAP: Bootstrapped Training for Tracking-Any-Point
Carl Doersch
Pauline Luc
Yi Yang
Dilara Gokay
Skanda Koppula
...
Joseph Heyward
Ignacio Rocco
Ross Goroshin
João Carreira
Andrew Zisserman
35
39
0
01 Feb 2024
Computer Vision for Primate Behavior Analysis in the Wild
Richard Vogg
Timo Lüddecke
Jonathan Henrich
Sharmita Dey
Matthias Nuske
...
Alexander Gail
Stefan Treue
H. Scherberger
F. Worgotter
Alexander S. Ecker
28
3
0
29 Jan 2024
Scaling Face Interaction Graph Networks to Real World Scenes
Tatiana López-Guevara
Yulia Rubanova
William F. Whitney
Tobias Pfaff
Kimberly L. Stachenfeld
Kelsey R. Allen
AI4CE
GNN
3DH
10
2
0
22 Jan 2024
Understanding Video Transformers via Universal Concept Discovery
M. Kowal
Achal Dave
Rares Ambrus
Adrien Gaidon
Konstantinos G. Derpanis
P. Tokmakov
ViT
27
8
0
19 Jan 2024
EgoGen: An Egocentric Synthetic Data Generator
Gen Li
Kai Zhao
Siwei Zhang
X. Lyu
Mihai Dusmanu
Yan Zhang
Marc Pollefeys
Siyu Tang
EgoV
VGen
20
14
0
16 Jan 2024
Unsupervised Object-Centric Learning from Multiple Unspecified Viewpoints
Jinyang Yuan
Tonglin Chen
Zhimeng Shen
Bin Li
Xiangyang Xue
OCL
16
2
0
03 Jan 2024
DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos
Arjun Balasingam
Joseph Chandler
Chenning Li
Zhoutong Zhang
Hari Balakrishnan
19
8
0
15 Dec 2023
View-Dependent Octree-based Mesh Extraction in Unbounded Scenes for Procedural Synthetic Data
Zeyu Ma
Alexander R. E. Raistrick
Lahav Lipson
Jia Deng
17
0
0
13 Dec 2023
Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation
Xianghui Xie
Bharat Lal Bhatnagar
J. E. Lenssen
Gerard Pons-Moll
3DH
27
13
0
12 Dec 2023
Learning 3D Particle-based Simulators from RGB-D Videos
William F. Whitney
Tatiana López-Guevara
Tobias Pfaff
Yulia Rubanova
Thomas Kipf
Kimberly L. Stachenfeld
Kelsey R. Allen
VGen
17
11
0
08 Dec 2023
Benchmarking and Analysis of Unsupervised Object Segmentation from Real-world Single Images
Yafei Yang
Bo Yang
OCL
22
1
0
08 Dec 2023
Previous
1
2
3
4
Next