Hierarchical YOLOv5 Detection and ResNet Recognition Pipeline for Degraded Heritage Character Imagery
DOI:
https://doi.org/10.54097/03w4j441Keywords:
YOLOv5 Object Detection, ResNet Deep Recognition, Morphological Opening Denoising, Attention-Based GAN Restoration, Multi-Scale Convolutional Features, Cross-Domain Transfer LearningAbstract
Character recognition on heavily degraded historical imagery presents three intertwined challenges: heterogeneous noise patterns generated by the underlying physical substrate, scarce labeled training data within the target domain, and small character instances embedded in cluttered backgrounds. This paper develops a four-stage pipeline that integrates adaptive image denoising, hierarchical YOLOv5 detection, multi-scale convolutional feature analysis, and ResNet recognition with cross-domain transfer learning. The denoising stage combines mean filtering with morphological opening to suppress impulse noise and blob-like artifacts, complemented by an attention-based generative adversarial network for residual texture artifacts. The detection stage trains YOLOv5 with initial learning rate 0.01, achieving test-set precision of 64.10% and recall of 49.50%, with the loss function stabilizing at 0.04 after sufficient iterations and outperforming YOLOv2, YOLOv3, Fast R-CNN, and R-CNN baselines on intersection-over-union, recall, and average precision metrics. Multi-scale feature visualization across convolutional output layers confirms that the detection model recovers character locations across 200 benchmark images and produces region coordinate vectors with high coverage. The recognition stage achieves over 70% accuracy under in-domain training, and incorporation of an external auxiliary dataset through transfer learning substantially improves recognition accuracy. Sensitivity analysis under additive noise confirms pipeline robustness across degradation levels.
Downloads
References
[1] S. Raza, M. Farooq, U. Farooq, H. Karamti, T. Khurshaid and I. Ashraf, "A convolutional neural network based optical character recognition for purely handwritten characters and digits", Computers, Materials, & Continua, 2025, Vol. 84 (2), p3149
[2] M. R. Al-Maamari, R. Ramteke, A. M. Al-Hejri and S. S. Alshamrani, "Integrating CNN and transformer architectures for superior Arabic printed and handwriting characters classification", Scientific Reports, 2025, Vol. 15 (1), p29936
[3] M. Ayadi, N. Masmoudi, L. Almuqren, H. Saeed Alshahrani and R. Oudah Aljohani, "Designing a novel CNN-LSTM-based model for Arabic handwritten character recognition for the visually impaired person", Journal of Disability Research, 2025, Vol. 4 (1), p20240080
[4] K. Manoj and M. Iyapparaja, "Tamil handwritten character recognition: A comprehensive review of recent innovations and progress", Algorithms, 2025, Vol. 16 (8)
[5] T. Al Mindeel, E. Spentzou and M. Eftekhari, "Energy, thermal comfort, and indoor air quality: Multi-objective optimization review", Renewable and Sustainable Energy Reviews, 2024, Vol. 202, p114682
[6] B. Wu, Z. Cai, W. Wu and X. Yin, "AoI-aware resource management for smart health via deep reinforcement learning", IEEE Access, 2023, Vol. 11, p81180-81195
[7] M. A. M. Alhassan and E. Yılmaz, "Evaluating YOLOv4 and YOLOv5 for enhanced object detection in UAV-based surveillance", Processes, 2025, Vol. 13 (1), p254
[8] B. Wu and W. Wu, "Model-free cooperative optimal output regulation for linear discrete-time multi-agent systems using reinforcement learning", Mathematical Problems in Engineering, 2023, p6350647
[9] A. Sharba and H. Kanaan, "Improving tiny object detection in aerial images with YOLOv5", Journal of Engineering and Sustainable Development, 2025, Vol. 29 (1), p57-67
[10] B. T. Lieu, C. K. Nguyen, H. L. Nguyen and T. H. Le, "Enhanced small-object detection in UAV images using modified YOLOv5 model", IET Image Processing, 2025, Vol. 19 (1), pe70121
[11] H. Wu and Y. Cao, "Complementary phase interleaving-based fringe order recognition for temporal phase unwrapping", Pattern Recognition, 2025, Vol. 157, p110937
[12] O. Zafar, Y. Cohen, L. Wolf and I. Schwartz, "Detection-driven object count optimization for text-to-image diffusion models", Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026, p1885-1894
[13] M. Trigka and E. Dritsas, "A comprehensive survey of machine learning techniques and models for object detection", Sensors, 2025, Vol. 25 (1), p214
[14] E. Edozie, A. N. Shuaibu, U. K. John and B. O. Sadiq, "Comprehensive review of recent developments in visual object detection based on deep learning", Artificial Intelligence Review, 2025, Vol. 58 (9), p277
[15] Y. Xiuwu, J. Shiqi and L. Yong, "Non-uniform WSN clustering routing protocol based on non-cooperative game", Wireless Personal Communications, 2025, Vol. 140 (1), p561-590
[16] PV. Pagire, M. Chavali and A. Kale, "A comprehensive review of object detection with traditional and deep learning methods", Signal Processing, 2025, Vol. 237, p110075
[17] B. A. Nguyen, M. B. Kha, D. M. Dao, H. K. Nguyen, M. D. Nguyen, T. V. Nguyen and T. L. Dang, "UFR-GAN: A lightweight multi-degradation image restoration model", Pattern Recognition Letters, 2025
[18] B. Wu, J. Huang, Q. Duan, L. Dong and Z. Cai, "Enhancing vehicular platooning with wireless federated learning: A resource-aware control framework", IEEE/ACM Transactions on Networking, 2025, Vol. 33 (1), p1-16
[19] D. P. Bertsekas, "Dynamic programming and optimal control", Athena Scientific, Belmont, MA, 4th ed., 2017, Vol. 1
[20] B. Wu, J. Huang and Q. Duan, "FedTD3: An accelerated learning approach for UAV trajectory planning", Proc. Int. Conf. on Wireless Artificial Intelligent Computing Systems and Applications (WASA), 2025, p13-24
[21] J. H. Lee, M. Kim, S. Lee and C. Kang, "GAN-based image restoration for enhancing object detection in projector-camera systems", IEEE Access, 2025
[22] A. Bechar, R. Medjoudj, Y. Elmir, Y. Himeur and A. Amira, "Federated and transfer learning for cancer detection based on image analysis", Neural Computing and Applications, 2025, Vol. 37 (4), p2239-2284
[23] B. Wu, J. Huang and Q. Duan, "Real-time intelligent healthcare enabled by federated digital twins with AoI optimization", IEEE Network, 2025, p1
[24] V. Giglioni, J. Poole, R. Mills, I. Venanzi, F. Ubertini and K. Worden, "Transfer learning in bridge monitoring: Laboratory study on domain adaptation for population-based SHM of multispan continuous girder bridges", Mechanical Systems and Signal Processing, 2025, Vol. 224, p112151
[25] D. Pan, B.-N. Wu, Y.-L. Sun and Y.-P. Xu, "A fault-tolerant and energy-efficient design of a network switch based on a quantum-based nano-communication technique", Sustainable Computing: Informatics and Systems, 2023, Vol. 37, p100827
[26] B. Wu, Z. Ding and J. Huang, "A review of continual learning in edge AI", IEEE Transactions on Netw
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Journal of Computing and Electronic Information Management

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.








