Structure-Aware and Context-Modeling Point Cloud Compression
DOI:
https://doi.org/10.54097/bwanx125Keywords:
LiDAR Point Cloud, Multi-scale Sparse Tensor, Structure-Aware, Progressive BitwiseAbstract
To address the limited geometric representation capability and the coarse-grained context modeling in learning-based point cloud compression for LiDAR point cloud coding, we propose a structure-aware and context-modeling point cloud compression method (SACM-PCC). On the representation learning side, we design a Structure-Aware Target Embedding module to achieve structural alignment and effective propagation of cross-scale voxel features, thereby enhancing the expression of geometric relationships from local to global. On the probabilistic modeling side, we build a progressive bitwise target occupancy predictor that adopts a conditional autoregressive strategy to decompose each 8-bit occupancy code into four sub-codes and progressively refine the probability estimation from the most significant bits to the least significant bits, improving spatial context utilization and bit-level discrimination accuracy. Experiments on the KITTI and Ford datasets show that, at comparable reconstruction quality, SACM-PCC reduces the bitrate on KITTI by approximately 57%, 21%, and 8.7% relative to Draco, G-PCCv23, and RENO, respectively, and by approximately 54%, 21.7%, and 9% on Ford. These results demonstrate that the proposed method achieves a better rate–distortion trade-off across the full bitrate range while maintaining stable geometric reconstruction performance in complex scenes.
Downloads
References
[1] Roriz R, Silva H, Dias F, et al. A survey on data compression techniques for automotive lidar point clouds[J]. Sensors, 2024, 24(10): 3185.
[2] You K, Chen T, Ding D, et al. Reno: Real-time neural compression for 3d lidar point clouds[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 22172-22181.
[3] Galligan F, Hemmer M, Stava O, et al. Google/draco: a library for compressing and decompressing 3d geometric meshes and point clouds[J]. Draco: a library for compressing and decompressing 3D geometric meshes and point clouds, 2018.
[4] Schwarz S, Preda M, Baroncini V, et al. Emerging MPEG standards for point cloud compression[J]. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2019, 9(1): 133-148.
[5] Huang L, Wang S, Wong K, et al. Octsqueeze: Octree-structured entropy model for lidar compression[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 1313-1323.
[6] Fu C, Li G, Song R, et al. Octattention: Octree-based large-scale contexts model for point cloud compression[C]//Proceedings of the AAAI conference on artificial intelligence. 2022, 36(1): 625-633.
[7] Jin Y, Zhu Z, Xu T, et al. Ecm-opcc: Efficient context model for octree-based point cloud compression[C]//ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024: 7985-7989.
[8] Song R, Fu C, Liu S, et al. Efficient hierarchical entropy model for learned point cloud compression[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 14368-14377.
[9] Wang J, Ding D, Li Z, et al. Sparse tensor-based multiscale representation for point cloud geometry compression[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(7): 9055-9071.
[10] Xue R, Wang J, Ma Z. Efficient LiDAR point cloud geometry compression through neighborhood point attention[J]. arXiv preprint arXiv:2208.12573, 2022..
[11] Wang J, Xue R, Li J, et al. A versatile point cloud compressor using universal multiscale conditional coding–Part I: Geometry[J]. IEEE transactions on pattern analysis and machine intelligence, 2024.
[12] Fan T, Gao L, Xu Y, et al. Multiscale latent-guided entropy model for lidar point cloud compression[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2023, 33(12): 7857-7869.
[13] Lodhi M A, Pang J, Tian D. Sparse convolution based octree feature propagation for lidar point cloud compression[C]//ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023: 1-5.
[14] Stathoulopoulos N, Saucedo M A V, Koval A, et al. RecNet: An invertible point cloud encoding through range image embeddings for multi-robot map sharing and reconstruction[C]//2024 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2024: 4883-4889.
[15] Wang S, Jiao J, Cai P, et al. R-pcc: A baseline for range image-based point cloud compression[C]//2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022: 10055-10061.
[16] Zhou X, Qi C R, Zhou Y, et al. Riddle: Lidar data compression with range image deep delta encoding[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 17212-17221.
[17] Geiger A, Lenz P, Urtasun R. Are we ready for autonomous driving? the kitti vision benchmark suite[C]//2012 IEEE conference on computer vision and pattern recognition. IEEE, 2012: 3354-3361.
[18] Pandey G, McBride J R, Eustice R M. Ford campus vision and lidar data set[J]. The International Journal of Robotics Research, 2011, 30(13): 1543-1552.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Journal of Computing and Electronic Information Management

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.








