首页|面向点云几何压缩的隐式编码网络

面向点云几何压缩的隐式编码网络

扫码查看
目的 现有点云几何压缩算法通常将点云转换为八叉树或带潜在特征的稀疏点,从而提高数据结构的存储效率。这些方法将点云量化至三维网格点,导致点云所在表面的精度受限于量化分辨率。针对这一问题,本文将点云转化为连续的隐式表征,提出一种基于隐式表征的点云几何压缩算法框架,以克服量化分辨率对压缩质量的不利影响。方法 该框架由基于符号对数距离场的隐式表征与带乘性分支结构的神经网络组成。具体来说,本文在编码阶段利用神经网络拟合隐式表征,并对该网络进行模型压缩,然后在解码阶段结合改进的Marching Cube(MC)算法重建点云所在表面,采样恢复点云数据。结果 本文在ABC(a big CAD model dataset)、Famous与MPEG PCC(MPEG point cloud compression dataset)数据集上进行了点云表面压缩实验。与基准算法 INR(implicit neural repre-sentations for image compression)相比,本文算法的LI倒角损失平均下降了 12。4%,Normal Consistency与F-score指标平均提升了 1。5%与13。6%,压缩效率随模型参数量增大而提升,平均增幅为12。9%。与几何压缩标准算法G-PCC(geometry-based point cloud compression)相比,本文算法在存储大小为10 KB下依然保持55 dB以上的D1-PSNR重建性能,有效压缩上限高于G-PCC。此外,消融实验分别验证了本文提出的隐式表征和神经网络结构的有效性。结论 实验结果表明,本文提出的点云压缩算法克服了现有算法的分辨率限制,不仅提升了表面重建精度,而且提升了点云表面的压缩效率与有效压缩上限。
An implicit coding network for point cloud geometry compression
Objective Point clouds captured by depth sensors or generated by reconstruction algorithms are essential for various 3D vision tasks,including 3D scene understanding,scan registration,and 3D reconstruction.However,a simple scene or object contains massive amounts of unstructured points,leading to challenges in the storage and transmission of these point cloud data.Therefore,developing point cloud geometry compression algorithms is important to effectively handle and process point cloud data.Existing point cloud compression algorithms typically involve converting point clouds into a storage-efficient data structure,such as an octree representation or sparse points with latent features.These interme-diate representations are then encoded as a compact bitstream by using either handcrafted or learning-based entropy cod-ers.Although the correlation of spatial points effectively improves compression performance,existing algorithms may not fully exploit these points as representations of the object surface and topology.Recent studies have addressed this problem by exploring implicit representations and neural networks for surface reconstruction.However,these methods are primarily tailored for 3D objects represented as occupancy fields and signed distance fields,thus limiting their applicability to point clouds and non-watertight meshes in terms of surface representation and reconstruction.Furthermore,the neural networks used in these approaches often rely on simple multi-layer perceptron structures,which may lack capacity and compression efficiency for point cloud geometry compression tasks.Method To deal with these limitations,we proposed a novel point cloud geometry compression framework,including a signed logarithmic distance field,an implicit network structure with the multiplicative branches,and an adaptive marching cube algorithm for surface extraction.First,the point cloud surface(serving as the zero level-set)maps the arbitrary points in space to the distance values of their nearest points on the point cloud surface.We design an implicit representation called signed logarithmic distance field(SLDF),which utilizes the thickness assumption and logarithmic parameterization to fit arbitrary point cloud surfaces.Afterward,we apply a multipli-cative implicit neural encoding network(MINE)to encode the surface as a compact neural representation.MINE combines sinusoidal activation functions and multiplicative operators to enhance the capability and distribution characteristics of the network.The overfitting process transforms the mapping function from point cloud coordinates to implicit distance fields into a neural network,which is subsequently utilized for model compression.Through the decompressed network,the con-tinuous surface is reconstructed using the adaptive marching cubes algorithm(AMC),which incorporates a dual-layer sur-face fusion process to further enhance the accuracy of surface extraction for SLDF.Result We compared our algorithm with six state-of-the-art algorithms,including the surface compression approaches based on implicit representation and point cloud compression methods,on three public datasets,namely,ABC,Famous,and MPEG PCC.The quantitative evalua-tion metrics included the rate-distortion curves of chamfer-L1 distance(L1-CD),normal consistency(NC),F-score for continuous point cloud surface,and the rate-distortion curve of D1-PSNR for quantized point cloud surface.Compared with the suboptimal method(i.e.,INR),our proposed method reduces L1-CD loss by 12.4%and improves the NC and F-score performance by 1.5%and 13.6%on the ABC and Famous datasets,respectively.Moreover,the compression efficiency increases by an average of 12.9%along with the growth of model parameters.On multiple MPEG PCC datasets with samples taken from the 512-resolution MVUB dataset,1024-resolution 8iVFB dataset,and 2048-resolution Owlii dataset,our method achieves a D1-PSNR performance of over 55 dB within the 10 KB range,which highlights its higher effective compression limit compared with G-PCC.Ablation experiments show that in the absence of SLDF,the L1-CD loss increases by 18.53%,while the D1-PSNR performance increases by 15 dB.Similarly,without the MINE network,the L1-CD loss increases by 3.72%,and the D1-PSNR performance increases by 2.67 dB.Conclusion This work explores the implicit representation for point cloud surfaces and proposes an enhanced point cloud compression framework.We initially design SLDF to extend the implicit representations of arbitrary topologies in point clouds,and then we use the multiplica-tive branches network to enhance the capability and distribution characteristics of the network.We then apply a surface extraction algorithm to enhance the quality of the reconstructed point cloud.In this way,we obtain a unified framework for the geometric compression of point cloud surfaces at arbitrary resolutions.Experimental results demonstrate that our pro-posed method achieves a promising performance in point cloud geometry compression.

point cloud geometry compressionimplicit representationsurface reconstructionmodel compressionsur-face extraction algorithm

陈佳慧、方广驰、李浩然、张晔、黄小红、郭裕兰

展开 >

中山大学电子与通信工程学院,深圳 518107

点云几何压缩 隐式表征 三维重建 模型压缩 表面提取算法

2024

中国图象图形学报
中国科学院遥感应用研究所,中国图象图形学学会 ,北京应用物理与计算数学研究所

中国图象图形学报

CSTPCD北大核心
影响因子:1.111
ISSN:1006-8961
年,卷(期):2024.29(12)