Structure-Aware and Context-Modeling Point Cloud Compression
DOI:
https://doi.org/10.54097/bwanx125Keywords:
LiDAR Point Cloud, Multi-scale Sparse Tensor, Structure-Aware, Progressive BitwiseAbstract
To address the limited geometric representation capability and the coarse-grained context modeling in learning-based point cloud compression for LiDAR point cloud coding, we propose a structure-aware and context-modeling point cloud compression method (SACM-PCC). On the representation learning side, we design a Structure-Aware Target Embedding module to achieve structural alignment and effective propagation of cross-scale voxel features, thereby enhancing the expression of geometric relationships from local to global. On the probabilistic modeling side, we build a progressive bitwise target occupancy predictor that adopts a conditional autoregressive strategy to decompose each 8-bit occupancy code into four sub-codes and progressively refine the probability estimation from the most significant bits to the least significant bits, improving spatial context utilization and bit-level discrimination accuracy. Experiments on the KITTI and Ford datasets show that, at comparable reconstruction quality, SACM-PCC reduces the bitrate on KITTI by approximately 57%, 21%, and 8.7% relative to Draco, G-PCCv23, and RENO, respectively, and by approximately 54%, 21.7%, and 9% on Ford. These results demonstrate that the proposed method achieves a better rate–distortion trade-off across the full bitrate range while maintaining stable geometric reconstruction performance in complex scenes.
Downloads
References
[1] Roriz R, Silva H, Dias F, et al. A survey on data compression techniques for automotive lidar point clouds[J]. Sensors, 2024, 24(10): 3185. DOI: https://doi.org/10.3390/s24103185
[2] You K, Chen T, Ding D, et al. Reno: Real-time neural compression for 3d lidar point clouds[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 22172-22181. DOI: https://doi.org/10.1109/CVPR52734.2025.02065
[3] Galligan F, Hemmer M, Stava O, et al. Google/draco: a library for compressing and decompressing 3d geometric meshes and point clouds[J]. Draco: a library for compressing and decompressing 3D geometric meshes and point clouds, 2018.
[4] Schwarz S, Preda M, Baroncini V, et al. Emerging MPEG standards for point cloud compression[J]. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2019, 9(1): 133-148. DOI: https://doi.org/10.1109/JETCAS.2018.2885981
[5] Huang L, Wang S, Wong K, et al. Octsqueeze: Octree-structured entropy model for lidar compression[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 1313-1323. DOI: https://doi.org/10.1109/CVPR42600.2020.00139
[6] Fu C, Li G, Song R, et al. Octattention: Octree-based large-scale contexts model for point cloud compression[C]//Proceedings of the AAAI conference on artificial intelligence. 2022, 36(1): 625-633. DOI: https://doi.org/10.1609/aaai.v36i1.19942
[7] Jin Y, Zhu Z, Xu T, et al. Ecm-opcc: Efficient context model for octree-based point cloud compression[C]//ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024: 7985-7989. DOI: https://doi.org/10.1109/ICASSP48485.2024.10446374
[8] Song R, Fu C, Liu S, et al. Efficient hierarchical entropy model for learned point cloud compression[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 14368-14377. DOI: https://doi.org/10.1109/CVPR52729.2023.01381
[9] Wang J, Ding D, Li Z, et al. Sparse tensor-based multiscale representation for point cloud geometry compression[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(7): 9055-9071. DOI: https://doi.org/10.1109/TPAMI.2022.3225816
[10] Xue R, Wang J, Ma Z. Efficient LiDAR point cloud geometry compression through neighborhood point attention[J]. arXiv preprint arXiv:2208.12573, 2022..
[11] Wang J, Xue R, Li J, et al. A versatile point cloud compressor using universal multiscale conditional coding–Part I: Geometry[J]. IEEE transactions on pattern analysis and machine intelligence, 2024. DOI: https://doi.org/10.1109/TPAMI.2024.3462938
[12] Fan T, Gao L, Xu Y, et al. Multiscale latent-guided entropy model for lidar point cloud compression[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2023, 33(12): 7857-7869. DOI: https://doi.org/10.1109/TCSVT.2023.3276788
[13] Lodhi M A, Pang J, Tian D. Sparse convolution based octree feature propagation for lidar point cloud compression[C]//ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023: 1-5. DOI: https://doi.org/10.1109/ICASSP49357.2023.10096990
[14] Stathoulopoulos N, Saucedo M A V, Koval A, et al. RecNet: An invertible point cloud encoding through range image embeddings for multi-robot map sharing and reconstruction[C]//2024 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2024: 4883-4889. DOI: https://doi.org/10.1109/ICRA57147.2024.10611602
[15] Wang S, Jiao J, Cai P, et al. R-pcc: A baseline for range image-based point cloud compression[C]//2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022: 10055-10061. DOI: https://doi.org/10.1109/ICRA46639.2022.9811880
[16] Zhou X, Qi C R, Zhou Y, et al. Riddle: Lidar data compression with range image deep delta encoding[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 17212-17221. DOI: https://doi.org/10.1109/CVPR52688.2022.01670
[17] Geiger A, Lenz P, Urtasun R. Are we ready for autonomous driving? the kitti vision benchmark suite[C]//2012 IEEE conference on computer vision and pattern recognition. IEEE, 2012: 3354-3361. DOI: https://doi.org/10.1109/CVPR.2012.6248074
[18] Pandey G, McBride J R, Eustice R M. Ford campus vision and lidar data set[J]. The International Journal of Robotics Research, 2011, 30(13): 1543-1552. DOI: https://doi.org/10.1177/0278364911400640
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Journal of Computing and Electronic Information Management

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.








