Underwater Fish Image Generation Based on Diffusion Model

Authors

  • Xinyu Wang

DOI:

https://doi.org/10.54097/gfxrqj93

Keywords:

Underwater fish, Image generation, Diffusion model, U-Net

Abstract

Fish play a vital role in the ecosystems of rivers and lakes, and the application of deep learning-based intelligent identification methods offers an efficient and accurate approach to water environment regulation. However, the random spatial distribution of underwater fish complicates the collection of extensive real-image datasets. This scarcity of images diminishes the generalization capability of deep learning models, thereby limiting their practical applicability. To address this challenge, we propose a diffusion model-based approach that involves pre-training on an open-source underwater fish dataset and subsequently generating realistic, diverse underwater fish images from pure noise. The generated images achieved 6.8070, 9.1829 and 26.3132 on the evaluation metrics NIQE, PIQE and BRISQUE, respectively.

Downloads

Download data is not yet available.

References

[1] Van Den Oord A, Dieleman S, Zen H, et al. Wavenet: A generative model for raw audio[J]. arXiv preprint arXiv:1609.03499, 2016, 12.

[2] Goodfellow I J, Pouget-Abadie J, Mirza M, et al.Generative Adversarial Nets[J]. Nips'14, 2014: 2672–2680.

[3] Dinh L, Krueger D, Bengio Y.Nice: Non-linear independent components estimation [J].arXiv preprint arXiv:1410.8516, 2014.

[4] Kingma D P, Welling M.Auto-encoding variational bayes [J].arXiv preprint arXiv:1312.6114, 2013.

[5] Ho J, Jain A, Abbeel P. Denoising Diffusion Probabilistic Models [J]. ArXiv, 2020, abs/2006.11239.

[6] Song J, Meng C, Ermon S. Denoising Diffusion Implicit Models [J]. ArXiv, 2020, abs/2010.02502.

[7] Dhariwal P, Nichol A. Diffusion Models Beat GANs on Image Synthesis [J]. ArXiv, 2021, abs/2105.05233.

[8] Ho J. Classifier-Free Diffusion Guidance [J]. ArXiv, 2022, abs/2207.12598.

[9] Liu X, Park D H, Azadi S, et al. More Control for Free! Image Synthesis with Semantic Diffusion Guidance [J]. IEEE/CVF Winter Conference on Applications of Computer Vision, 2021: 289-99.

[10] GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

[11] Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation [J]. ArXiv, 2015, abs/1505.04597.

Downloads

Published

30-04-2025

Issue

Section

Articles

How to Cite

Wang, X. (2025). Underwater Fish Image Generation Based on Diffusion Model. Journal of Computing and Electronic Information Management, 16(3), 44-47. https://doi.org/10.54097/gfxrqj93