ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.07770
  4. Cited By
Seeing the Unseen: Visual Common Sense for Semantic Placement

Seeing the Unseen: Visual Common Sense for Semantic Placement

15 January 2024
Ram Ramrakhya
Aniruddha Kembhavi
Dhruv Batra
Z. Kira
Kuo-Hao Zeng
Luca Weihs
    VLM
ArXivPDFHTML

Papers citing "Seeing the Unseen: Visual Common Sense for Semantic Placement"

10 / 10 papers shown
Title
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
Ahmed Abdelreheem
Filippo Aleotti
Jamie Watson
Z. Qureshi
Abdelrahman Eldesokey
Peter Wonka
Gabriel J. Brostow
Sara Vicente
Guillermo Garcia-Hernando
DiffM
44
0
0
08 May 2025
Text2Place: Affordance-aware Text Guided Human Placement
Text2Place: Affordance-aware Text Guided Human Placement
Rishubh Parihar
Harsh Gupta
VS Sachidanand
R. V. Babu
DiffM
21
5
0
22 Jul 2024
Smart Vision-Language Reasoners
Smart Vision-Language Reasoners
Denisa Roberts
Lucas Roberts
VLM
ReLM
LRM
28
4
0
05 Jul 2024
GenDexGrasp: Generalizable Dexterous Grasping
GenDexGrasp: Generalizable Dexterous Grasping
Puhao Li
Tengyu Liu
Yuyang Li
Yiran Geng
Yixin Zhu
Yaodong Yang
Siyuan Huang
49
63
0
03 Oct 2022
End-to-End Affordance Learning for Robotic Manipulation
End-to-End Affordance Learning for Robotic Manipulation
Yiran Geng
Boshi An
Haoran Geng
Yuanpei Chen
Yaodong Yang
Hao Dong
52
59
0
26 Sep 2022
One-Shot Transfer of Affordance Regions? AffCorrs!
One-Shot Transfer of Affordance Regions? AffCorrs!
Denis Hadjivelichkov
Sicelukwanda Zwane
M. Deisenroth
Lourdes Agapito
Dimitrios Kanoulas
35
24
0
15 Sep 2022
Housekeep: Tidying Virtual Households using Commonsense Reasoning
Housekeep: Tidying Virtual Households using Commonsense Reasoning
Yash Kant
Arun Ramachandran
Sriram Yenamandra
Igor Gilitschenski
Dhruv Batra
Andrew Szot
Harsh Agrawal
LM&Ro
LRM
132
70
0
22 May 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
380
4,010
0
28 Jan 2022
Where2Act: From Pixels to Actions for Articulated 3D Objects
Where2Act: From Pixels to Actions for Articulated 3D Objects
Kaichun Mo
Leonidas J. Guibas
Mustafa Mukadam
Abhinav Gupta
Shubham Tulsiani
149
175
0
07 Jan 2021
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
226
74,467
0
18 May 2015
1