209

Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

International Conference on Information Photonics (ICIP), 2023
Abstract

We study 3D point cloud attribute compression via a volumetric approach: assuming point cloud geometry is known at both encoder and decoder, parameters θ\theta of a continuous attribute function f:R3Rf: \mathbb{R}^3 \mapsto \mathbb{R} are quantized to θ^\hat{\theta} and encoded, so that discrete samples fθ^(xi)f_{\hat{\theta}}(\mathbf{x}_i) can be recovered at known 3D points xiR3\mathbf{x}_i \in \mathbb{R}^3 at the decoder. Specifically, we consider a nested sequences of function subspaces Fl0(p)FL(p)\mathcal{F}^{(p)}_{l_0} \subseteq \cdots \subseteq \mathcal{F}^{(p)}_L, where Fl(p)\mathcal{F}_l^{(p)} is a family of functions spanned by B-spline basis functions of order pp, flf_l^* is the projection of ff on Fl(p)\mathcal{F}_l^{(p)} and encoded as low-pass coefficients FlF_l^*, and glg_l^* is the residual function in orthogonal subspace Gl(p)\mathcal{G}_l^{(p)} (where Gl(p)Fl(p)=Fl+1(p)\mathcal{G}_l^{(p)} \oplus \mathcal{F}_l^{(p)} = \mathcal{F}_{l+1}^{(p)}) and encoded as high-pass coefficients GlG_l^*. In this paper, to improve coding performance over [1], we study predicting fl+1f_{l+1}^* at level l+1l+1 given flf_l^* at level ll and encoding of GlG_l^* for the p=1p=1 case (RAHT(11)). For the prediction, we formalize RAHT(1) linear prediction in MPEG-PCC in a theoretical framework, and propose a new nonlinear predictor using a polynomial of bilateral filter. We derive equations to efficiently compute the critically sampled high-pass coefficients GlG_l^* amenable to encoding. We optimize parameters in our resulting feed-forward network on a large training set of point clouds by minimizing a rate-distortion Lagrangian. Experimental results show that our improved framework outperformed the MPEG G-PCC predictor by 1111 to 12%12\% in bit rate reduction.

View on arXiv
Comments on this paper