Object Detection in Last Decade - A Survey*

Usama Arshad

doi:10.15294/sji.v8i1.28956

Object Detection in Last Decade - A Survey*

Usama Arshad⁽¹⁾,

DOI: https://doi.org/10.15294/sji.v8i1.28956

(1) Comsats University Islamabad

Abstract

Purpose: In the last decade, object detection is one of the interesting topics that played an important role in revolutionizing the present era. Especially when it comes to computer vision, object detection is a challenging and most fundamental problem. Researchers in the last decade enhanced object detection and made many advanced discoveries using technological advancements.Â Methods: This research work describes the advancements in object detection over the last 10 years (2010-2020). Different papers published in last 10 years related to object detection and its types are discussed with respect to their role in the advancement of object detection.Â Result: This research work also describes different types of object detection, which include text detection, face detection etc. It clearly describes the changes in object detection techniques over the period of last 10 years. Object detection is divided into two groups. General detection and Task-based detection. General detection is discussed chronologically and with its different variants while task-based detection includes many state-of-the-art algorithms and techniques according to tasks. This paper also described the basic comparison of how some algorithms and techniques have been updated and played a major role in advancements of different fields related to object detection.Â Novelty: This research concludes that the most important advancements that happened in last decade and future is promising much more advancement in object detection on the basis of work done in this decade.

Keywords

Computer Vision, Object Detection, Text Detection, Face Detection, YOLO, RCNN, Fast RCNN

Full Text:

PDF

References

R. Szeliski, Computer vision: algorithms and applications. Springer, 2010.

R. Sathya and A. Abraham, â€œComparison of Supervised and Unsupervised Learning Algorithms for Pattern Classification,â€ Int. J. Adv. Res. Artif. Intell., vol. 2, no. 2, pp. 34â€“38, 2013.

L. Chen, J. Hoey, C. D. Nugent, D. J. Cook, and Z. Yu, â€œSensor-based activity recognition,â€ IEEE Trans. Syst. Man Cybern. Part C Appl. Rev., vol. 42, no. 6, pp. 790â€“808, 2012.

Q. You, H. Jin, Z. Wang, C. Fang, and J. Luo, â€œImage captioning with semantic attention,â€ Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 4651â€“4659, 2016.

F. Ahmad, A. Najam, and Z. Ahmed, â€œImage-based Face Detection and Recognition: â€˜State of the Art,â€™â€ Proc. IEEE Conf. Comput. Vis. pattern Recognit., pp. 3â€“6, 2013.

M. Mehta, C. Goyal, M. C. Srivastava, and R. C. Jain, â€œReal time object detection and tracking: Histogram matching and Kalman filter approach,â€ 2010 2nd Int. Conf. Comput. Autom. Eng. ICCAE 2010, vol. 5, pp. 796â€“801, 2010.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, â€œRich feature hierarchies for accurate object detection and semantic segmentation,â€ Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 580â€“587, 2014.

R. Girshick, â€œFast R-CNN,â€ Proc. IEEE Int. Conf. Comput. Vis., pp. 1440â€“1448, 2015.

S. Ren, K. He, R. Girshick, and J. Sun, â€œFaster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks,â€ IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 6, pp. 1137â€“1149, 2017.

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, â€œYou only look once: Unified, real-time object detection,â€ Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 779â€“788, 2016.

Z. Li, C. Peng, G. Yu, X. Zhang, Y. Deng, and J. Sun, â€œLight-head R-CNN: In defense of two-stage object detector,â€ arXiv, pp. 1â€“9, 2017.

M. Najibi, P. Samangouei, R. Chellappa, and L. S. Davis, â€œSSH: Single Stage Headless Face Detector,â€ Proc. IEEE Int. Conf. Comput. Vis., pp. 4885â€“4894, 2017.

J. G. Andrews et al., â€œWhat will 5G be?,â€ IEEE J. Sel. Areas Commun., vol. 32, no. 6, pp. 1065â€“1082, 2014.

N. Dvornik, J. Mairal, and C. Schmid, â€œModeling visual context is key to augmenting object detection datasets,â€ Proc. Eur. Conf. Comput. Vis., pp. 364â€“380, 2018.

T. Yi-Lin et al., â€œMicrosoft COCO,â€ Eur. Conf. Comput. Vis., pp. 740â€“755, 2014.

Y. Xiang, R. Mottaghi, and S. Savarese, â€œBeyond PASCAL: A benchmark for 3D object detection in the wild,â€ IEEE Winter Conf. Appl. Comput. Vis., pp. 75â€“82, 2014.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, â€œImageNet Classification with Deep Convolutional Neural Networks Alex,â€ Adv. Neural Inf. Process. Syst., pp. 1097â€“1105, 2012.

A. Kuznetsova et al., â€œThe Open Images Dataset V4: Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale,â€ Int. J. Comput. Vis., vol. 128, no. 7, pp. 1956â€“1981, 2020.

K. Choi and J. Yun, â€œRobust and Fast Moving Object Detection in A Non-Stationary Camera Via Foreground Probability Based Sampling,â€ IEEE Int. Conf. Image Process., pp. 4897â€“4901, 2015.

H. J. Yoo, â€œDeep Convolution Neural Networks in Computer Vision: a Review,â€ IEIE Trans. Smart Process. Comput., vol. 4, no. 1, pp. 35â€“43, 2015.

O. Russakovsky et al., â€œImageNet Large Scale Visual Recognition Challenge,â€ Int. J. Comput. Vis., vol. 115, no. 3, pp. 211â€“252, 2015.

J. Dai, Y. Li, K. He, and J. Sun, â€œR-fcn: Object detection via region-based fully convolutional networks,â€ Adv. Neural Inf. Process. Syst., pp. 379â€“387, 2016.

K. He, X. Zhang, S. Ren, and J. Sun, â€œSpatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition,â€ IEEE Trans. Pattern Anal. Mach. Intell., vol. 37, no. 9, pp. 1904â€“1916, 2015.

E. Jahani Heravi, H. Habibi Aghdam, and D. Puig, â€œAn optimized convolutional neural network with bottleneck and spatial pyramid pooling layers for classification of foods,â€ Pattern Recognit. Lett., vol. 105, pp. 50â€“58, 2018.

H. Li, P. Xiong, J. An, and L. Wang, â€œDynamic attention network for semantic segmentation,â€ Neurocomputing, vol. 384, pp. 182â€“191, 2018.

K. He, G. Gkioxari, P. DollÃ¡r, and R. Girshick, â€œMask R-CNN,â€ IEEE Trans. Pattern Anal. Mach. Intell., vol. 42, no. 2, pp. 386â€“397, 2020.

W. S. Lai, J. Bin Huang, N. Ahuja, and M. H. Yang, â€œDeep laplacian pyramid networks for fast and accurate super-resolution,â€ Proc. - 30th IEEE Conf. Comput. Vis. Pattern Recognition, CVPR 2017, vol. 2017-January, pp. 5835â€“5843, 2017.

T. Y. Lin, P. DollÃ¡r, R. Girshick, K. He, B. Hariharan, and S. Belongie, â€œFeature pyramid networks for object detection,â€ Proc. - 30th IEEE Conf. Comput. Vis. Pattern Recognition, CVPR 2017, vol. 2017-January, pp. 936â€“944, 2017.

Q. Zhao et al., â€œM2Det: A single-shot object detector based on multi-level feature pyramid network,â€ Thirty-Third AAAI Conf. Artif. Intell., 2018.

T. Kong, F. Sun, W. Huang, and H. Liu, â€œDeep Feature Pyramid Reconfiguration for Object Detection,â€ Eur. Conf. Comput. Vis., vol. 1, pp. 172â€“188, 2018.

V. Ruzicka and F. Franchetti, â€œFast and accurate object detection in high resolution 4K and 8K video using GPUs,â€ IEEE High Perform. Extrem. Comput. Conf., 2018.

R. Huang, J. Pedoeem, and C. Chen, â€œYOLO-LITE: A Real-Time object detection algorithm optimized for non-GPU computers,â€ IEEE Int. Conf. Big Data, pp. 2503â€“2510, 2018.

J. Redmon and A. Farhadi, â€œYOLO9000: Better, faster, stronger,â€ Proc. - 30th IEEE Conf. Comput. Vis. Pattern Recognition, CVPR 2017, vol. 2017-January, pp. 6517â€“6525, 2017.

J. Redmon and A. Farhadi, â€œYOLO v.3,â€ arXiv:1804.02767, pp. 1â€“6, 2018.

W. Liu, D. Anguelov, D. Erhan, and C. Szegedy, â€œSSD: Single Shot MultiBox Detector,â€ Eur. Conf. Comput. Vis., vol. 1, pp. 21â€“37, 2016.

W. Khan, N. Zaki, and L. Ali, â€œIntelligent Pneumonia Identification from Chest X-Rays: A Systematic Literature Review,â€ medRxiv, 2020.

Z. X. Li and F. Q. Zhou, â€œFSSD: Feature fusion single shot multibox detector,â€ arXiv:1712.00960, 2017.

L. Zheng, C. Fu, and Y. Zhao, â€œExtend the shallow part of single shot MultiBox detector via convolutional neural network,â€ arXiv:1801.05918, 2018.

R. Li and J. Yang, â€œImproved YOLOv2 Object Detection Model,â€ Int. Conf. Multimed. Comput. Syst. -Proceedings, vol. 2018-May, pp. 1â€“6, 2018.

E. Dong, Y. Zhu, Y. Ji, and S. Du, â€œAn improved convolution neural network for object detection using Yolov2,â€ IEEE Int. Conf. Mechatronics Autom., pp. 1184â€“1188, 2018.

Y. C. Lim and M. Kang, â€œObject Detection Using a Single Extended Feature Map,â€ IEEE Intell. Veh. Symp. Proc., vol. 2018-June, no. Iv, pp. 820â€“825, 2018.

H. Nakahara, Y. Haruyoshi, T. Fujii, and S. Sato, â€œA Lightweight YOLOv2: A Binarized CNN with A Parallel Support Vector Regression for an FPGA,â€ in Proc. 2018 ACM/SIGDA Int. Symp. Field-Program. Gate Arrays, 2018, pp. 31â€“40.

B. SchÃ¶lkopf and A. J. Smola, Learning With Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. The MIT Press, 2018.

W. Hammedi, M. Ramirez-Martinez, P. Brunet, and S.-M. Senouci, â€œDeep Learning-Based Real-Time Object Detection in Inland Navigation,â€ IEEE Glob. Commun. Conf., pp. 1â€“6, 2019.

Refbacks

There are currently no refbacks.

Scientific Journal of Informatics (SJI)
p-ISSN 2407-7658 | e-ISSN 2460-0040
Published By Department of Computer Science Universitas Negeri Semarang
Website: https://journal.unnes.ac.id/nju/index.php/sji
Email: [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Username
Password
Remember me