ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.13539
11
0

Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

22 November 2023
Tam Thuc Do
Philip A. Chou
Gene Cheung
    3DPC
ArXivPDFHTML
Abstract

We study 3D point cloud attribute compression via a volumetric approach: assuming point cloud geometry is known at both encoder and decoder, parameters θ\thetaθ of a continuous attribute function f:R3↦Rf: \mathbb{R}^3 \mapsto \mathbb{R}f:R3↦R are quantized to θ^\hat{\theta}θ^ and encoded, so that discrete samples fθ^(xi)f_{\hat{\theta}}(\mathbf{x}_i)fθ^​(xi​) can be recovered at known 3D points xi∈R3\mathbf{x}_i \in \mathbb{R}^3xi​∈R3 at the decoder. Specifically, we consider a nested sequences of function subspaces Fl0(p)⊆⋯⊆FL(p)\mathcal{F}^{(p)}_{l_0} \subseteq \cdots \subseteq \mathcal{F}^{(p)}_LFl0​(p)​⊆⋯⊆FL(p)​, where Fl(p)\mathcal{F}_l^{(p)}Fl(p)​ is a family of functions spanned by B-spline basis functions of order ppp, fl∗f_l^*fl∗​ is the projection of fff on Fl(p)\mathcal{F}_l^{(p)}Fl(p)​ and encoded as low-pass coefficients Fl∗F_l^*Fl∗​, and gl∗g_l^*gl∗​ is the residual function in orthogonal subspace Gl(p)\mathcal{G}_l^{(p)}Gl(p)​ (where Gl(p)⊕Fl(p)=Fl+1(p)\mathcal{G}_l^{(p)} \oplus \mathcal{F}_l^{(p)} = \mathcal{F}_{l+1}^{(p)}Gl(p)​⊕Fl(p)​=Fl+1(p)​) and encoded as high-pass coefficients Gl∗G_l^*Gl∗​. In this paper, to improve coding performance over [1], we study predicting fl+1∗f_{l+1}^*fl+1∗​ at level l+1l+1l+1 given fl∗f_l^*fl∗​ at level lll and encoding of Gl∗G_l^*Gl∗​ for the p=1p=1p=1 case (RAHT(111)). For the prediction, we formalize RAHT(1) linear prediction in MPEG-PCC in a theoretical framework, and propose a new nonlinear predictor using a polynomial of bilateral filter. We derive equations to efficiently compute the critically sampled high-pass coefficients Gl∗G_l^*Gl∗​ amenable to encoding. We optimize parameters in our resulting feed-forward network on a large training set of point clouds by minimizing a rate-distortion Lagrangian. Experimental results show that our improved framework outperformed the MPEG G-PCC predictor by 111111 to 12%12\%12% in bit rate reduction.

View on arXiv
Comments on this paper