29
0

Unifying Light Field Perception with Field of Parallax

Abstract

Field of Parallax (FoP)}, a spatial field that distills the common features from different LF representations to provide flexible and consistent support for multi-task learning. FoP is built upon three core features--projection difference, adjacency divergence, and contextual consistency--which are essential for cross-task adaptability. To implement FoP, we design a two-step angular adapter: the first step captures angular-specific differences, while the second step consolidates contextual consistency to ensure robust representation. Leveraging the FoP-based representation, we introduce the LFX framework, the first to handle arbitrary LF representations seamlessly, unifying LF multi-task vision. We evaluated LFX across three different tasks, achieving new state-of-the-art results, compared with previous task-specific architectures: 84.74% in mIoU for semantic segmentation on UrbanLF, 0.84% in AP for object detection on PKU, and 0.030 in MAE and 0.026 in MAE for salient object detection on Duftv2 and PKU, respectively. The source code will be made publicly available atthis https URL.

View on arXiv
@article{teng2025_2503.00747,
  title={ Unifying Light Field Perception with Field of Parallax },
  author={ Fei Teng and Buyin Deng and Boyuan Zheng and Kai Luo and Kunyu Peng and Jiaming Zhang and Kailun Yang },
  journal={arXiv preprint arXiv:2503.00747},
  year={ 2025 }
}
Comments on this paper