Detection of Human Behaviour by Object Recognition using Deep Learning: A Review
Keywords:
Deep Learning, Object Detection, Neural Network, and Object RecognitionAbstract
The major drawback of the society is the falsehood of human nature; so prediction of nature of the individual by analysing the video or image of that person is highly necessary. From the post World War II period due to the advancement of technology since the past few decades many countries have developed low cost cameras with high resolution. They generally use the RGB and depth features to enhance the image quality captured by the camera. Hence object recognition is the new branch of computer science which emerges. It has got a close relationship with image understanding and analysis of video; which encourages several researchers to work in this domain since past few years. As the branch of deep learning develops handful of efficient tools have been develop which prove to be highly efficient to learn high level deeper features of the image and semantics. In this paper a review on object recognition using deep learning has been discussed. This paper includes basics of deep learning and object recognition. Then we have discussed some tools required to perform object recognition as well as moving object recognition and finally some future goals of this subject have been discussed.
Downloads
References
P. F. Felzenszwalb, R. B. Girshick, D. Mcallester, and D. Ramanan, “Object detection with discriminatively trained part-based models,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 9, p. 1627, 2010.
Object Detection with Deep Learning: A Review Zhong-Qiu Zhao, Member, IEEE, Peng Zheng, Shou-tao Xu, and Xindong Wu, Fellow, IEEE
LeCun, Y., Bengio, Y. and Hinton, G., 2015. Deep learning. nature, 521(7553), pp.436-444
D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” Int. J. of Comput. Vision,vol. 60, no. 2, pp. 91–110, 2004.
N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” in CVPR, 2005. 6. R. Lienhart and J. Maydt, “An extended set of haar – like features for rapid object detection,” in ICIP, 2002.
C. Cortes and V. Vapnik, “Support vector machine, ” Machine Learning, vol. 20, no. 3, pp. 273–297, 1995.
Y. Freund and R. E. Schapire, “A desicion theoretic generalization of on-line learning and an application to boosting,” J. of Comput. & Sys. Sci., vol. 13, no. 5, pp. 663–671, 1997.
M. Pandey and S. Lazebnik, "Scene recognition and weakly supervised object localization with deformable part-based models," 2011 International Conference on Computer Vision, Barcelona, 2011,
pp. 1307-1314
A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in NIPS, 2012.
R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in CVPR, 2014.
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look once: Unified, real-time object detection,” in CVPR, 2016.
Mandal, M., Kumar, L.K. and Saran, M.S., 2020. MotionRec: A Unified Deep Framework for Moving Object Recognition. In The IEEE Winter Conference on Applications of Computer Vision
(pp. 2734-2743).
R.Muralidharan, Dr.C.Chandrasekar, Object Recognition using SVM-KNN based on Geometric Moment Invariant,International Journal of Computer Trends and Technology-July to Aug Issue 2011 ISSN:2231-2803 http://www.internationaljournalssrg.org Page 215.
3D Object Recognition Using Convolutional Neural Networks with Transfer Learning Between Input Channels Luís A. Alexandre
B. Yang, J. Yan, Z. Lei, and S. Z. Li, “Aggregate channel features for multi-view face detection,” in IJCB, 2014.
X. Chen, K. Kundu, Y. Zhu, A. G. Berneshawi, H. Ma, S. Fidler, and R. Urtasun, “3d object proposals for accurate object class detection,” in NIPS, 2015.
K. Kang, H. Li, T. Xiao, W. Ouyang, J. Yan, X. Liu, and X. Wang, “Object detection in videos with tubelet proposal networks,” in CVPR, 2017.
S. Paisitkriangkrai, C. Shen, and A. van den Hengel, “Pedestrian detection with spatially pooled features and structured ensemble learning,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 38, pp. 1243–1257, 2016.
W. Byeon, T. M. Breuel, F. Raue, and M. Liwicki, “Scene labeling with lstm recurrent neural networks,” in CVPR, 2015