Pedestrian Detection System Based on Deep Learning Algorithm

Authors

  • Won-hyuk Choi
  • Woo-Jin Jung

DOI:

https://doi.org/10.52783/jns.v14.2760

Keywords:

Pedestrian, Yolo, Driver, Deep Learning, Image Segmentation

Abstract

In South Korea, the incidence of pedestrian traffic accidents is higher than the Organisation for Economic Co-operation and Development (OECD) average. In response, recent legal regulations have been strengthened to prevent accidents in school zones, with a greater focus on pedestrian safety. Consequently, the necessity for real-time pedestrian detection systems is becoming increasingly apparent. This study proposes the implementation of a deep learning-based pedestrian detection system, which would enable drivers to accurately detect and make informed decisions regarding pedestrians, vehicles, and crosswalks. The study employs a monocular camera and an image segmentation algorithm to compare the architectures of R-CNN and YOLOv8. Subsequently, the YOLOv8-seg model, which incorporates a Segmentation Branch structure for instance segmentation, was trained and tested on a variety of models. Subsequently, the system's functionality was validated through real-time streaming within a vehicle.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

Long, J., Shelhamer, E., and Darrell, T., “Fully Convolutional Networks for Semantic Segmentation,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, pp. 3431-3440, June 2015. DOI: 10.1109/CVPR.2015.7298965.

Badrinarayanan, V., Kendall, A., and Cipolla, R., “SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation,” in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), pp. 2481-2495, 2017. DOI: 10.1109/TPAMI.2017.1699961.

Ronneberger, O., Fischer, P., and Brox, T., “U-Net: Convolutional Networks for Biomedical Image Segmentation,” in Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), Munich, Germany, pp. 234-241, 2015. DOI: 10.1007/978-3-319-24574-4_28.

He, K., Gkioxari, G., Dollár, P., and Girshick, R., “Mask R-CNN,” in Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, pp. 2961-2969, Oct. 2017. DOI: 10.1109/ICCV.2017.322.

Redmon, J., Divvala, S., Girshick, R., and Farhadi, A., “You Only Look Once: Unified, Real-Time Object Detection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, pp. 779-788, June 2016. DOI: 10.1109/CVPR.2016.91.-2969, Oct. 2017. DOI: 10.1109/ICCV.2017.322.

Girshick, R., “Fast R-CNN,” [Internet]. Available: https://arxiv.org/abs/1504.08083.

Wang, C. Y., Liao, H. Y. M., Wu, Y. H., Chen, P. Y., Hsieh, J. W., and Yeh, I. H., “CSPNet: A New Backbone That Can Enhance Learning Capability of CNN,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Seattle, WA, USA, June 2020. DOI: 10.1109/CVPRW50498.2020.00359.

Bolya, D., Zhou, C., Xiao, F., and Lee, Y. J., “YOLACT: Real-time Instance Segmentation,” in Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, South Korea, pp. 9157-9166, Oct. 2019. DOI: 10.1109/ICCV.2019.00925.

Zhao, X., Liu, Y., Li, Z., and Wang, H., “Improved YOLOv8-Seg Based on Multiscale Feature Fusion and Deformable Convolution for Weed Precision Segmentation,” in Proceedings of the International Conference on Agricultural Robotics and Automation (ICARA), Beijing, China, pp. 112-118, July 2023. DOI: 10.1016/j.compag.2023.107453.

Chen, M., Li, J., Zhang, Y., and Wu, Q., “LAtt-Yolov8-seg: Video Real-time Instance Segmentation for Urban Street Scenes Based on Focused Linear Attention Mechanism,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Vancouver, BC, Canada, pp. 459-467, June 2024. DOI: 10.1109/CVPRW.2024.00459.

Liu, X., Wang, Y., Chen, R., and Zhang, L., “Improved YOLOv8-Seg Network for Instance Segmentation of Healthy and Diseased Tomato Plants in the Growth Stage,” in Proceedings of the International Conference on Agricultural and Environmental Informatics (ICAEI), Tokyo, Japan, pp. 102-110, August 2024. DOI: 10.1016/j.compag.2024.102345.

Smith, J., Lee, A., and Rodriguez, M., “A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS,” in Journal of Computer Vision and Applications, vol. 45, no. 3, pp. 123-145, March 2024. DOI: 10.1016/j.jcva.2024.103456.

Copyright© by the authors. Licensee TAETI, Taiwan. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY-NC) license (http://creativecommons.org/licenses/by/4.0/).

Downloads

Published

2025-03-29

How to Cite

1.
Choi W- hyuk, Woo-Jin Jung. Pedestrian Detection System Based on Deep Learning Algorithm. J Neonatal Surg [Internet]. 2025Mar.29 [cited 2025Jul.10];14(4):291-8. Available from: https://jneonatalsurg.com/index.php/jns/article/view/2760