Latent Progressive Diffusion Model for Super-Resolution Reconstruction of Oil and Gas Well Perforation Images
DOI:
https://doi.org/10.54097/pe7qkr69Keywords:
Oil and gas well perforation images, Image super-resolution, Latent diffusion model, Progressive training, Hierarchical feature enhancement, Variational autoencoderAbstract
To address the challenges in super-resolution (SR) reconstruction of oil and gas well perforation images—such as excessive GPU memory consumption during high-resolution training, susceptibility to losing complex textural details, severe artifact generation, low computational efficiency of the original Denoising Diffusion Probabilistic Model (DDPM) in pixel space, and the failure of naive progressive diffusion to resolve computational costs from a dimensional perspective—this paper proposes a latent progressive denoising diffusion probabilistic model for super-resolution (LP-DDPMSR). The model integrates the dimensional compression advantages of the Latent Diffusion Model (LDM) with the staged training strategy of progressive diffusion. By mapping perforation images into a low-dimensional latent space via a pre-trained Variational Autoencoder (VAE) for the diffusion process, the approach fundamentally alleviates the GPU memory burden caused by pixel-level computations. A Latent Perforation Hierarchical Feature Enhancement Sub-Network (L-PFEN) is designed to specifically optimize the latent representation of high-frequency and low-frequency features in perforation images. The Latent Perforation-Adapted U-Net (L-PA-U-Net) is enhanced by embedding a Time-Aware adaptive Cross-Attention (TACA) mechanism, which strengthens feature modeling of critical perforation regions within the latent space. Furthermore, a three-stage progressive training framework in the latent space is established to achieve a smooth resolution escalation from 64×64 to 256×256. Experimental results demonstrate that, under 2×, 4×, and 8× magnification factors for oil and gas well perforation images, LP-DDPMSR outperforms Bicubic, ESRGAN, DDPMSR, and LDM-based methods in terms of Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM). At 8× magnification, the model achieves a PSNR of 20.83 dB, an SSIM of 0.8217, and a Fréchet Inception Distance (FID) of 45.29. Compared to the pixel-space progressive model DDPMSR, LP-DDPMSR reduces training GPU memory consumption by 68.5% and improves training efficiency by 62.3%. While ensuring high-fidelity restoration of perforation image details, the model significantly enhances computational efficiency, providing more effective technical support for oil and gas well perforation quality assessment.
Downloads
References
[1] Liu N. Research on visualization perforation analysis and fracturing evaluation methods for oil and gas wells [D]. Xi'an: Xi'an Shiyou University, 2023. DOI: 10.27400/d.cnki.gxasc.2023.000371. (in Chinese).
[2] Xu G Y, Wu S Y. Infrared and visible image fusion based on progressive multi-scale feature extraction and fusion [J]. Journal of Qilu University of Technology, 2026, 40(1): 45-56. DOI: 10.16442/j.cnki.qlgydxxb.2026.01.006. (in Chinese).
[3] Du R S, Mu W X, Meng L D. Super-resolution reconstruction of rock thin section images based on diffusion model [J]. Computer Systems & Applications, 2026, 35(2): 132-140. DOI: 10.15888/j.cnki.csa.010059. (in Chinese).
[4] Saharia C, Ho J, Chan W, et al. Image super-resolution via iterative refinement [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(4): 4713-4726.
[5] Hu X Y. Research on image super-resolution reconstruction method based on edge enhancement [D]. Nanchang: Jiangxi University of Finance and Economics, 2021. DOI: 10.27175/d.cnki.gjxcu.2021.000259. (in Chinese)
[6] Xu X Y, Zhang M F. Remote sensing image super-resolution algorithm integrating LR encoding network and diffusion model [J]. Computer Engineering and Applications, 2024, 60(22): 271-281. (in Chinese)
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Journal of Computing and Electronic Information Management

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.








