Structure-Aware and Context-Modeling Point Cloud Compression

Authors

  • Ying He
  • Chuanjiang Yan
  • Zhisen Chen

DOI:

https://doi.org/10.54097/bwanx125

Keywords:

LiDAR Point Cloud, Multi-scale Sparse Tensor, Structure-Aware, Progressive Bitwise

Abstract

To address the limited geometric representation capability and the coarse-grained context modeling in learning-based point cloud compression for LiDAR point cloud coding, we propose a structure-aware and context-modeling point cloud compression method (SACM-PCC). On the representation learning side, we design a Structure-Aware Target Embedding module to achieve structural alignment and effective propagation of cross-scale voxel features, thereby enhancing the expression of geometric relationships from local to global. On the probabilistic modeling side, we build a progressive bitwise target occupancy predictor that adopts a conditional autoregressive strategy to decompose each 8-bit occupancy code into four sub-codes and progressively refine the probability estimation from the most significant bits to the least significant bits, improving spatial context utilization and bit-level discrimination accuracy. Experiments on the KITTI and Ford datasets show that, at comparable reconstruction quality, SACM-PCC reduces the bitrate on KITTI by approximately 57%, 21%, and 8.7% relative to Draco, G-PCCv23, and RENO, respectively, and by approximately 54%, 21.7%, and 9% on Ford. These results demonstrate that the proposed method achieves a better rate–distortion trade-off across the full bitrate range while maintaining stable geometric reconstruction performance in complex scenes.

Downloads

Download data is not yet available.

References

[1] Roriz R, Silva H, Dias F, et al. A survey on data compression techniques for automotive lidar point clouds[J]. Sensors, 2024, 24(10): 3185.

[2] You K, Chen T, Ding D, et al. Reno: Real-time neural compression for 3d lidar point clouds[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 22172-22181.

[3] Galligan F, Hemmer M, Stava O, et al. Google/draco: a library for compressing and decompressing 3d geometric meshes and point clouds[J]. Draco: a library for compressing and decompressing 3D geometric meshes and point clouds, 2018.

[4] Schwarz S, Preda M, Baroncini V, et al. Emerging MPEG standards for point cloud compression[J]. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2019, 9(1): 133-148.

[5] Huang L, Wang S, Wong K, et al. Octsqueeze: Octree-structured entropy model for lidar compression[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 1313-1323.

[6] Fu C, Li G, Song R, et al. Octattention: Octree-based large-scale contexts model for point cloud compression[C]//Proceedings of the AAAI conference on artificial intelligence. 2022, 36(1): 625-633.

[7] Jin Y, Zhu Z, Xu T, et al. Ecm-opcc: Efficient context model for octree-based point cloud compression[C]//ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024: 7985-7989.

[8] Song R, Fu C, Liu S, et al. Efficient hierarchical entropy model for learned point cloud compression[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 14368-14377.

[9] Wang J, Ding D, Li Z, et al. Sparse tensor-based multiscale representation for point cloud geometry compression[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(7): 9055-9071.

[10] Xue R, Wang J, Ma Z. Efficient LiDAR point cloud geometry compression through neighborhood point attention[J]. arXiv preprint arXiv:2208.12573, 2022..

[11] Wang J, Xue R, Li J, et al. A versatile point cloud compressor using universal multiscale conditional coding–Part I: Geometry[J]. IEEE transactions on pattern analysis and machine intelligence, 2024.

[12] Fan T, Gao L, Xu Y, et al. Multiscale latent-guided entropy model for lidar point cloud compression[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2023, 33(12): 7857-7869.

[13] Lodhi M A, Pang J, Tian D. Sparse convolution based octree feature propagation for lidar point cloud compression[C]//ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023: 1-5.

[14] Stathoulopoulos N, Saucedo M A V, Koval A, et al. RecNet: An invertible point cloud encoding through range image embeddings for multi-robot map sharing and reconstruction[C]//2024 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2024: 4883-4889.

[15] Wang S, Jiao J, Cai P, et al. R-pcc: A baseline for range image-based point cloud compression[C]//2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022: 10055-10061.

[16] Zhou X, Qi C R, Zhou Y, et al. Riddle: Lidar data compression with range image deep delta encoding[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 17212-17221.

[17] Geiger A, Lenz P, Urtasun R. Are we ready for autonomous driving? the kitti vision benchmark suite[C]//2012 IEEE conference on computer vision and pattern recognition. IEEE, 2012: 3354-3361.

[18] Pandey G, McBride J R, Eustice R M. Ford campus vision and lidar data set[J]. The International Journal of Robotics Research, 2011, 30(13): 1543-1552.

Downloads

Published

29-01-2026

Issue

Section

Articles

How to Cite

He, Y., Yan, C., & Chen, Z. (2026). Structure-Aware and Context-Modeling Point Cloud Compression. Journal of Computing and Electronic Information Management, 20(1), 12-17. https://doi.org/10.54097/bwanx125