Hierarchical YOLOv5 Detection and ResNet Recognition Pipeline for Degraded Heritage Character Imagery

Junfeng Dan; Yanhao Fan; Guo Chen; Zicheng Meng; Hongbing Zhu

doi:10.54097/03w4j441

Authors

Junfeng Dan
Yanhao Fan
Guo Chen
Zicheng Meng
Hongbing Zhu

DOI:

https://doi.org/10.54097/03w4j441

Keywords:

YOLOv5 Object Detection, ResNet Deep Recognition, Morphological Opening Denoising, Attention-Based GAN Restoration, Multi-Scale Convolutional Features, Cross-Domain Transfer Learning

Abstract

Character recognition on heavily degraded historical imagery presents three intertwined challenges: heterogeneous noise patterns generated by the underlying physical substrate, scarce labeled training data within the target domain, and small character instances embedded in cluttered backgrounds. This paper develops a four-stage pipeline that integrates adaptive image denoising, hierarchical YOLOv5 detection, multi-scale convolutional feature analysis, and ResNet recognition with cross-domain transfer learning. The denoising stage combines mean filtering with morphological opening to suppress impulse noise and blob-like artifacts, complemented by an attention-based generative adversarial network for residual texture artifacts. The detection stage trains YOLOv5 with initial learning rate 0.01, achieving test-set precision of 64.10% and recall of 49.50%, with the loss function stabilizing at 0.04 after sufficient iterations and outperforming YOLOv2, YOLOv3, Fast R-CNN, and R-CNN baselines on intersection-over-union, recall, and average precision metrics. Multi-scale feature visualization across convolutional output layers confirms that the detection model recovers character locations across 200 benchmark images and produces region coordinate vectors with high coverage. The recognition stage achieves over 70% accuracy under in-domain training, and incorporation of an external auxiliary dataset through transfer learning substantially improves recognition accuracy. Sensitivity analysis under additive noise confirms pipeline robustness across degradation levels.

Downloads

Download data is not yet available.

References

[1] S. Raza, M. Farooq, U. Farooq, H. Karamti, T. Khurshaid and I. Ashraf, "A convolutional neural network based optical character recognition for purely handwritten characters and digits", Computers, Materials, & Continua, 2025, Vol. 84 (2), p3149 DOI: https://doi.org/10.32604/cmc.2025.063255

[2] M. R. Al-Maamari, R. Ramteke, A. M. Al-Hejri and S. S. Alshamrani, "Integrating CNN and transformer architectures for superior Arabic printed and handwriting characters classification", Scientific Reports, 2025, Vol. 15 (1), p29936 DOI: https://doi.org/10.1038/s41598-025-12045-z

[3] M. Ayadi, N. Masmoudi, L. Almuqren, H. Saeed Alshahrani and R. Oudah Aljohani, "Designing a novel CNN-LSTM-based model for Arabic handwritten character recognition for the visually impaired person", Journal of Disability Research, 2025, Vol. 4 (1), p20240080 DOI: https://doi.org/10.57197/JDR-2024-0080

[4] K. Manoj and M. Iyapparaja, "Tamil handwritten character recognition: A comprehensive review of recent innovations and progress", Algorithms, 2025, Vol. 16 (8) DOI: https://doi.org/10.14569/IJACSA.2025.0160831

[5] T. Al Mindeel, E. Spentzou and M. Eftekhari, "Energy, thermal comfort, and indoor air quality: Multi-objective optimization review", Renewable and Sustainable Energy Reviews, 2024, Vol. 202, p114682 DOI: https://doi.org/10.1016/j.rser.2024.114682

[6] B. Wu, Z. Cai, W. Wu and X. Yin, "AoI-aware resource management for smart health via deep reinforcement learning", IEEE Access, 2023, Vol. 11, p81180-81195 DOI: https://doi.org/10.1109/ACCESS.2023.3299340

[7] M. A. M. Alhassan and E. Yılmaz, "Evaluating YOLOv4 and YOLOv5 for enhanced object detection in UAV-based surveillance", Processes, 2025, Vol. 13 (1), p254 DOI: https://doi.org/10.3390/pr13010254

[8] B. Wu and W. Wu, "Model-free cooperative optimal output regulation for linear discrete-time multi-agent systems using reinforcement learning", Mathematical Problems in Engineering, 2023, p6350647 DOI: https://doi.org/10.21203/rs.3.rs-2797557/v1

[9] A. Sharba and H. Kanaan, "Improving tiny object detection in aerial images with YOLOv5", Journal of Engineering and Sustainable Development, 2025, Vol. 29 (1), p57-67 DOI: https://doi.org/10.31272/jeasd.2682

[10] B. T. Lieu, C. K. Nguyen, H. L. Nguyen and T. H. Le, "Enhanced small-object detection in UAV images using modified YOLOv5 model", IET Image Processing, 2025, Vol. 19 (1), pe70121 DOI: https://doi.org/10.1049/ipr2.70121

[11] H. Wu and Y. Cao, "Complementary phase interleaving-based fringe order recognition for temporal phase unwrapping", Pattern Recognition, 2025, Vol. 157, p110937 DOI: https://doi.org/10.1016/j.patcog.2024.110937

[12] O. Zafar, Y. Cohen, L. Wolf and I. Schwartz, "Detection-driven object count optimization for text-to-image diffusion models", Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026, p1885-1894 DOI: https://doi.org/10.1109/WACV61042.2026.00188

[13] M. Trigka and E. Dritsas, "A comprehensive survey of machine learning techniques and models for object detection", Sensors, 2025, Vol. 25 (1), p214 DOI: https://doi.org/10.3390/s25010214

[14] E. Edozie, A. N. Shuaibu, U. K. John and B. O. Sadiq, "Comprehensive review of recent developments in visual object detection based on deep learning", Artificial Intelligence Review, 2025, Vol. 58 (9), p277 DOI: https://doi.org/10.1007/s10462-025-11284-w

[15] Y. Xiuwu, J. Shiqi and L. Yong, "Non-uniform WSN clustering routing protocol based on non-cooperative game", Wireless Personal Communications, 2025, Vol. 140 (1), p561-590 DOI: https://doi.org/10.1007/s11277-025-11737-z

[16] PV. Pagire, M. Chavali and A. Kale, "A comprehensive review of object detection with traditional and deep learning methods", Signal Processing, 2025, Vol. 237, p110075 DOI: https://doi.org/10.1016/j.sigpro.2025.110075

[17] B. A. Nguyen, M. B. Kha, D. M. Dao, H. K. Nguyen, M. D. Nguyen, T. V. Nguyen and T. L. Dang, "UFR-GAN: A lightweight multi-degradation image restoration model", Pattern Recognition Letters, 2025 DOI: https://doi.org/10.1016/j.patrec.2025.08.008

[18] B. Wu, J. Huang, Q. Duan, L. Dong and Z. Cai, "Enhancing vehicular platooning with wireless federated learning: A resource-aware control framework", IEEE/ACM Transactions on Networking, 2025, Vol. 33 (1), p1-16

[19] D. P. Bertsekas, "Dynamic programming and optimal control", Athena Scientific, Belmont, MA, 4th ed., 2017, Vol. 1

[20] B. Wu, J. Huang and Q. Duan, "FedTD3: An accelerated learning approach for UAV trajectory planning", Proc. Int. Conf. on Wireless Artificial Intelligent Computing Systems and Applications (WASA), 2025, p13-24 DOI: https://doi.org/10.1007/978-981-96-8725-1_2

[21] J. H. Lee, M. Kim, S. Lee and C. Kang, "GAN-based image restoration for enhancing object detection in projector-camera systems", IEEE Access, 2025 DOI: https://doi.org/10.1109/ACCESS.2025.3618252

[22] A. Bechar, R. Medjoudj, Y. Elmir, Y. Himeur and A. Amira, "Federated and transfer learning for cancer detection based on image analysis", Neural Computing and Applications, 2025, Vol. 37 (4), p2239-2284 DOI: https://doi.org/10.1007/s00521-024-10956-y

[23] B. Wu, J. Huang and Q. Duan, "Real-time intelligent healthcare enabled by federated digital twins with AoI optimization", IEEE Network, 2025, p1 DOI: https://doi.org/10.1109/MNET.2025.3565977

[24] V. Giglioni, J. Poole, R. Mills, I. Venanzi, F. Ubertini and K. Worden, "Transfer learning in bridge monitoring: Laboratory study on domain adaptation for population-based SHM of multispan continuous girder bridges", Mechanical Systems and Signal Processing, 2025, Vol. 224, p112151 DOI: https://doi.org/10.1016/j.ymssp.2024.112151

[25] D. Pan, B.-N. Wu, Y.-L. Sun and Y.-P. Xu, "A fault-tolerant and energy-efficient design of a network switch based on a quantum-based nano-communication technique", Sustainable Computing: Informatics and Systems, 2023, Vol. 37, p100827 DOI: https://doi.org/10.1016/j.suscom.2022.100827

[26] B. Wu, Z. Ding and J. Huang, "A review of continual learning in edge AI", IEEE Transactions on Netw

Hierarchical YOLOv5 Detection and ResNet Recognition Pipeline for Degraded Heritage Character Imagery

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Cover

Indexing & Abstracting