A Highly Robust Heterogenous Deep Ensemble Assisted Multi-Feature Learning Model for Diabetic Mellitus Prediction

doi:10.23940/ijpe.21.11.p3.926937

Abstract

Abstract: In the present work we propose a novel heterogeneous deep ensemble based multi-feature learning environment for diabetic mellitus prediction. The overall proposed model was designed in such manner that it addresses the key at hand problems like data or the class imbalance, low accuracy and lack of consensus. To achieve it, a multi-level enhancement approach where to address the problem of class-imbalance was performed, data sampling with 95% of confidence interval is performed. Different sampling approaches were applied such as random-sampling, down-sampling and synthetic minority oversampling technique (SMOTE). Once sample data is retrieved, we performed feature selection using different algorithms like Wilcoxon Significant Test, also called significant predictor test (SPR), Univariate Logistic Regression based feature selection (ULOGR), Cross-Correlation Analysis (CRA), Principle Component Analysis (PCA), Gini Score based significant feature selection (GSFR) and Information Gain based features (IGFR). The key purpose of applying different feature selection methods was to retain most suitable features for high accuracy with low computation. In the subsequent phase, we designed a first-of-its kind heterogenous deep ensemble model using Decision Tree (DT), Artificial Neural Network (ANN) with Radial Basis Function (RBF) and Levenberg Marquardt (LM) learning methods, Probabilistic Neural Network (PNN) and Support Vector Machine (SVM) algorithms as the base classifier. For the ensemble decision, Maximum Voting Ensemble (MVE) and Best Trained Ensemble (BTE) were applied for two-class classification, which predicts each sample of the Pima Indian dataset as diabetic or non-diabetic. The simulation-based performance comparison in terms of accuracy (91.56%), F-measure (0.91) and AUC (0.91) confirmed superiority of the proposed system over major existing approaches.

Key words: Diabetic Mellitus prediction, heterogenous deep ensemble learning, multi-feature learning, machine learning, computer aided diagnosis

Sandeep Honnurappa and Bevoor Krishnappa Raghavendra. A Highly Robust Heterogenous Deep Ensemble Assisted Multi-Feature Learning Model for Diabetic Mellitus Prediction [J]. Int J Performability Eng, 2021, 17(11): 926-937.

Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks

References

1. Katzmarzyk P.T., Craig C.L., andGauvin L,Adiposity, Physical Fitness and Incident Diabetes: The Physical Activity Longitudinal Study. Diabetologia, vol. 50, no. 3, pp. 538-544, 2007.
2. Wahl P.W., Savage P.J., Psaty B.M., Orchard T.J., Robbins J.A., andTracy, R.P, Diabetes in Older Adults: Comparison of1997 American Diabetes Association Classification of Diabetes Mellitus with 1985 WHO Classification. The Lancet, vol. 352, no. 9133, pp. 1012-1015, 1998.
3. Rahman M.A., Shoaib S.M., Al Amin, M., Toma, R.N., Moni, M.A., and Awal, M.A. A Bayesian Optimization Framework for the Prediction of Diabetes Mellitus. In 2019 5th International Conference on Advances in Electrical Engineering (ICAEE), IEEE, pp. 357-362, September 2019.
4. DEPERLİOĞLU, Ö. and Utku, K.Ö.S.E. Diabetes Determination using Retraining Neural Network. In 2018 International Conference on Artificial Intelligence and Data Processing (IDAP). IEEE, pp. 1-5, September 2018.
5. Wang Q., Cao W., Guo J., Ren J., Cheng Y., andDavis D.N.DMP_MI: an Effective Diabetes Mellitus Classification Algorithm on Imbalanced Data with Missing Values. IEEE Access, vol. 7, pp. 102232-102238, 2019.
6. Kaur, H. and Batra, S.HPCC: An Ensembled Framework for the Prediction of the Onset of Diabetes. In 2017 4th International Conference on Signal Processing, Computing and Control (ISPCC). IEEE, pp. 216-222, September 2017.
7. Kalyankar G.D., Poojara S.R., andDharwadkar N.V,Predictive Analysis of Diabetic Patient Data using Machine Learning and Hadoop. In 2017 international conference on I-SMAC (IoT in social, mobile, analytics and cloud)(I-SMAC). IEEE, pp. 619-624, February 2017.
8. Wei, S., Zhao, X. and Miao, C, A Comprehensive Exploration to the Machine Learning Techniques for Diabetes Identification. In 2018 IEEE 4th World Forum on Internet of Things (WF-IoT), IEEE, pp. 291-295, February 2018.
9. Mohebbi A., Aradóttir T.B., Johansen A.R., Bengtsson H., Fraccaro M., andMørup M.A Deep Learning Approach to Adherence Detection for Type 2 Diabetics. In 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, pp. 2896-2899, July 2017
10. Deperlioğlu, Ö. and Köse, U.Diagnogsis of Diabete Mellitus using Deep Neural Network. In 2018 Medical Technologies National Congress (TIPTEKNO). IEEE, pp. 1-4, November 2018.
11. Ahmed, S.T. and Patil, K.K, An Investigative Study on Motifs Extracted Features on Real Time Big-data Signals. In 2016 International Conference on Emerging Technological Trends (ICETT). IEEE, pp. 1-4, October 2016
12. Wang J., Cao K., Fang C., andChen J.FDFuzz: Applying Feature Detection to Fuzz Deep Learning Systems. International Journal of Performability Engineering, vol. 15, no. 10, pp. 2675, 2019.
13. Ahmed S.T., Priyanka H.K., Attar S. and Patted A.Cataract Density Ratio Analysis under Color Image Processing Approach. In 2017 International Conference on Intelligent Computing and Control Systems (ICICCS). IEEE, pp. 178-180, June 2017.

[1]	Ashu Mehta, Navdeep Kaur, and Amandeep Kaur. A Review of Software Fault Prediction Techniques in Class Imbalance Scenarios [J]. Int J Performability Eng, 2025, 21(3): 123-130.
[2]	Vikas, Charu Wahi, Bharat Bhushan Sagar, and Manisha Manjul. Trust Management in WSN using ML for Detection of DDoS Attacks [J]. Int J Performability Eng, 2025, 21(3): 157-167.
[3]	Arpna Saxena and Sangeeta Mittal. CluSHAPify: Synergizing Clustering and SHAP Value Interpretations for Improved Reconnaissance Attack Detection in IIoT Networks [J]. Int J Performability Eng, 2025, 21(1): 36-47.
[4]	Seema Kalonia and Amrita Upadhyay. Comparative Analysis of Machine Learning Model and PSO Optimized CNN-RNN for Software Fault Prediction [J]. Int J Performability Eng, 2025, 21(1): 48-55.
[5]	Vikas Kumar, Charu Wahi, Bharat Bhushan Sagar, and Manisha Manjul. Ensemble Learning Based Intrusion Detection for Wireless Sensor Network Environment [J]. Int J Performability Eng, 2024, 20(9): 541-551.
[6]	Kalyani H. Deshmukh, Gajendra R. Bamnote, and Pratik K Agrawal. A Novel Approach for Drought Monitoring and Evaluation using Time Series Analysis and Deep Learning [J]. Int J Performability Eng, 2024, 20(8): 498-509.
[7]	Saurabh Saxena, and Chetna Gupta. Optimizing Bug Resolution: A Data-Driven Developer Recommendation System [J]. Int J Performability Eng, 2024, 20(8): 510-519.
[8]	Lakshya Vaswani, Sai Sri Harsha, Subham Jaiswal, and Aju D. Unravelling Complexity: Investigating the Effectiveness of SHAP Algorithm for Improving Explainability in Network Intrusion System Across Machine and Deep Learning Models [J]. Int J Performability Eng, 2024, 20(7): 421-431.
[9]	Meenakshi Chawla and Meenakshi Pareek. A Hybrid Deep Learning Perspective for Software Effort Estimation [J]. Int J Performability Eng, 2024, 20(7): 442-450.
[10]	Ajeet Kumar Sharma and Rakesh Kumar. IoT Malware Detection and Dynamic Analysis of MQTT Simulated Network [J]. Int J Performability Eng, 2024, 20(7): 451-459.
[11]	Abhishek Gupta and Jaspreet Singh. Data-Driven Security Framework for VANET using Firefly and ANN [J]. Int J Performability Eng, 2024, 20(6): 344-354.
[12]	Vikas Verma, Arun Malik, and Isha Batra. Analyzing and Classifying Malware Types on Windows Platform using an Ensemble Machine Learning Approach [J]. Int J Performability Eng, 2024, 20(5): 312-318.
[13]	Harshita Batra and Leema Nelson. ESD: E-mail Spam Detection using Cybersecurity-Driven Header Analysis and Machine Learning based Content Analysis [J]. Int J Performability Eng, 2024, 20(4): 205-213.
[14]	Manu Jyoti Gupta and Parveen Sehgal. Optimizing Credit Card Fraud Detection: Classifier Performance and Feature Selection Empowered by Grasshopper Algorithm [J]. Int J Performability Eng, 2024, 20(3): 177-185.
[15]	Aparna Shrivastava and P Raghu Vamsi. Improving Anomaly Classification using Combined Data Transformation and Machine Learning Methods [J]. Int J Performability Eng, 2024, 20(2): 68-80.