Improved HMM by Deep Learning for Ear Classification
Keywords:
Improved SHMM, convolutional layers, Max-pooling, Ear Classification, Inverse Weighted Average K-means clustering, Deviance Information CriterionAbstract
Ear recognition is one of the most relevant applications of image analysis. It’s a true challenge to build an automated system which exceeds human ability to recognize ears. Humans do not identify the ears ordinarily, so we are not skilled when we must deal with a large number of unknown ears. The modern computers, with an almost limitless memory and computational speed, should overcome humans' limitations. This work uses Ear classification problem to improve Deviance Information Criterion- Structural Hidden Markov Model (DIC-SHMM) by Convolutional Neural Network (CNN). HMM is a strong model for large size of features. While the CNN, as a most important technology for deep learning, used for image classification, to recognize persons by their ears as a one of unique biometric physiological characteristics. Three systems will be used to classify ear images, deep learning for the original image directly, deep learning for eigenvector as Principle Components Analysis (PCA) of the original image to compare them with proposed combining convolution layers of CNN with improved HMM for the original Image. The proposed system shows the best correction rate as 97.5%.
Downloads
References
L.R. Rabiner, “A tutorial on hidden markov models and selected applications in speech recognition", IEEE Proc., vol. 77, no. 2, 1989, pp. 257–286.
M. El-Hanjouri, W.Alkhaldi, N.Hamdy, O.Abdel Alim, “Heart Diseases Diagnosis Using HMM,” IEEE MELECON, pp. 489-491. 2002.
A.V. Nefian, M.H. Hayes, “Maximum likelihood training of the embedded HMM for face detection and recognition”, Proc. Of the IEEE International Conference on Image Processing, ICIP 2000, Vol. 1, 10-13 September 2000, Vancouver, BC, Canada, pp. 33-36
Mohammed Alhanjouri, Hana Hejazi, “DIC Structural HMM based IWAK-Means to Enclosed Face Data”, International Journal of Computer Applications, Volume 18, No.4, (0975–8887), March 2011
Nefian A., "Embedded Bayesian networks for face recognition", Proc. of the IEEE International Conference on Multimedia and Expo, Vol. 2, , Lusanne, Switzerland, 26-29 August 2002, pp. 133-136, 2002.
Le H., Li H., "Recognizing frontal face images using hidden Markov models with one training image per person", ICPR, vol. 1, p: 318–321, 2004.
A. Pflug, C. Busch, A. Ross, “2D ear classification based on unsupervised clustering”, IEEE International Joint Conference on Biometrics (IJCB), 2014, FL, USA, 29 Sept.-2 Oct. 2014.
B. Bhagabati, K. Sarma, “Application of Face Recognition Techniques in Video for Biometric Security: A Review of Basic Methods and Emerging Trends”, Chapter: 19, Handbook of Research on Modern Cryptographic Solutions for Computer and Cyber Security, 1st Edition, IGI Global, June 2016.
Mohammed Alhanjouri, “Curvelet and Waveatom Transforms Based Feature Extraction for Face Detection", Alaqsa University Journal, Vol. 15, No.1.
A. Das and D. Ghoshal, " Human Skin Region Segmentation Based on Chrominance Component using Modified Watershed Algorithm", Twelfth International Multi-Conference on Information Processing, Procedia Computer Science 89 ( 2016 ) 856 – 863
B. Banchhor, T. Sharma, “Hybrid Approach for Face Detection Using Skin Color Based Segmentation and Edge Detection”, International Journal of Advanced Research in Computer and Communication Engineering, Vol. 3, Issue 6, June 2014.
M. D. Zeiler, R. Fergus, "Visualizing and understanding convolutional Networks", In Computer Vision–ECCV Springer 2014, 818–833.
L. Chen, "Pedestrian Detection Based on Deep Learning", Master Thesis in Electrical Engineering, University of California Riverside, September 2017.
D. A. Mancevo, "Compressing Deep Convolutional Neural Networks", Master thesis in Machine Learning, KTH Royal Institute of Technology, June 2017.
S. Basu, S. Mukhopadhyay, M. Karki, R. DiBiano, S. Ganguly, R. Nemani, S. Gayaka, "Deep neural networks for texture classification -A theoretical analysis", Journal of Neural Networks, Elsevier,
January 2018
P. Baldi, P. Sadowski, Z. Lu, "Learning in the machine: The symmetries of the deep learning channel", Journal of Neural Networks, Elsevier, November 2017
G. Benjamin, “Fractional max-pooling." arXiv preprint arXiv:1412.6071 (2014).
E. Gonzalez, L. Alvarez, L. Mazorra, “AMI EAR DATABASE”, Universidad De Las Palmas De G.C.,