Bio-Inspired PSO for Improving Neural Based Diabetes Prediction System
Keywords:F-Score, PSO, neural-network, hybrid-feature-selection, Machine learning;, diabetes-dataset
A high level of glucose in the blood over a long period creates diabetes disease. Undiagnosed diabetes may trigger other complications such as cardiovascular disease, nerve damage, renal failure, and so on. There are many factors age, blood pressure, food habits, lifestyle changes are some of the reasons for diabetes. With increasing cases of diabetes in the smart Internet world, there is a need for an automated prediction system to facilitate the patients, to get know, whether they are affected by the disease or not. There are many diabetes prediction software that is already in use, still, the accurateness of a diabetes prediction is not complete. This paper presents a robust framework (PSO-NNDP), employs a novel hybrid feature selector to improvise the neural-based diabetes prediction system. The novel hybrid feature selector presented in this paper comprises the merits of the correlation coefficient, F-score, and particle swarm optimization methods to influence the feature selection process. The reliability of the proposed framework has been experimented on the benchmarking dataset. By establishing the clear steps, for the replacement of missing values, removal of outliers, the proposed framework obtains 99.5% accuracy. Moreover, the experimented machine learning models also show a great improvement upon the usage of the proposed feature selector.
Cho N, Shaw J E, Karuranga S, Huang Y D, da Rocha Fernandes J D, Ohlrogge, A W and Malanda B, “IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045”, Diabetes research and clinical practice, 138, 271–281, 2018.
Li G, Peng S, Wang C, Niu J and Yuan Y, “An energy-efficient data collection scheme using denoising autoencoder in wireless sensor networks”, Tsinghua Science and Technology, 24(1), 86–96, 2018.
Care D, “To compare oral glucose tolerance test (OGTT) glucose, C-peptide, and insulin responses and insulin sensitivity in youth and adults with impaired glucose tolerance (IGT) or recently diagnosed type 2 diabetes”, Diabetes Care, 41, 1707, 2018.
Saadatnejad S, Oveisi M, Hashemi M, “LSTM-based ECG classification for continuous monitoring on personal wearable devices”, IEEE J. Biomed. Health Inform, Vol. 24, No. 2, pp. 515–523, April 2019.
Pham T, Tran T, Phung D and Venkatesh S, “Predicting healthcare trajectories from medical records: A deep learning approach”, Journal of biomedical informatics, 69, 218–229, 2017.
Kaji D A, Zech J R, Kim J S, Cho S K, Dangayach N S, Costa A B and Oermann E K, “An attention based deep learning model of clinical events in the intensive care unit”, PloS one, 14(2), 0211057, 2019.
Pima Indians Diabetes Database. Available online: https://gist.github.com/ktisha/c21e73a1bd1700294ef790c56c8aec1f (accessed on 27 August 2019).
Predict the Onset of Diabetes Based on Diagnostic Measures. Available online: https://www.kaggle.com/uciml/pima-indians-diabetes-database (accessed on 21 June 2019).
Luo M, Wang K, Cai Z, Liu A, Li Y and Cheang C F, “Using imbalanced triangle synthetic data for machine learning anomaly detection”, Comput., Mater. Continua, 58(1), 15–26, 2019.
Al Helal M, Chowdhury A I, Islam A, Ahmed E, Mahmud M S and Hossain S, An optimization approach to improve classification performance in cancer and diabetes prediction. In 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE) (pp. 1–5). IEEE, 2019.
Kowsher M, Turaba M Y, Sajed T, “Prognosis and treatment prediction of type-2 diabetes using deep neural and machine learning classifiers” in International Conference on Computer and Information Technology (ICCIT), pp. 1–6, Dhaka, Bangladesh, December 2019.
Soniya S, Singh L, “Application and need based architecture design of deep neural networks”, Int.J. Patterson Recogn. Artif. Intell, Vol. 34, No. 13, pp. 2052014, April 2020.
Ramazi R, Perndorfer C, Soriano E, Laurenceau JP, Beheshti R, “Multi-modal Predictive Models of Diabetes Progression”, in Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, pp. 253–258, New Yark, United States, September 2019.
Alharbi A, Alghahtani M, “Using genetic algorithm and ELM neural networks for feature extraction and classification of type 2-diabetes melIitus”, Appl. Artif. Intelli, Vol. 33, No. 1, pp. 311–328, March 2019.
Zhu L, Wu B and Cao C, “Introduction to medical data mining”, Journal of biomedical engineering, Vol. 20, No 3, pp. 559–62, September 2003.
Durgesh K, Srivastava Lekha Bhambhu, “Data Classification Using Support Vector Machine”, Journal of Theoretical and Applied Information Technology, Vol. 12, No. 1, pp. 1—, February 2009.
Balaji H, Iyengar, N, and R D Caytiles, “Optimal Predictive analytics of Pima Diabetics using Deep Learning”, Int. J. Database Theory Appl., 10(1), 47–62, 2017.
Lekha S, and M. Suchetha, “Real-Time Non-Invasive Detection and Classification of Diabetes Using Modified Convolution Neural Network”, IEEE J. Biomed. Health Inform., 22(1), 1630–1636, 2018.
Nemade D R, and R K, Gupta, “Diabetes Prediction using BPSO and Decision Tree Classifier”, In 2nd International Conference on Data, Engineering and Applications (IDEA) (pp. 1–5). IEEE, 2020.
Gitmez M, Aydilek İ B, and A. Gümüşçü, “Statistical Box Graph Method and Determination of Outlier Values by Meta Heuristic Algorithms”, In 2019 1st International Informatics and Software Engineering Conference (UBMYK) (pp. 1–6). IEEE, 2019.
Xie W, Chkrebtii O, and S. Kurtek, “Visualization and outlier detection for multivariate elastic curve data”, IEEE transactions on visualization and computer graphics, 26(11), 3353–3364, 2019.
Song Q, Jiang H, and J. Liu, “Feature selection based on FDA and F-score for multi-class classification”, Expert Systems with Applications, 81, 22–27, 2017.
Lukmanto R B, Nugroho A, and H. Akbar, “Early detection of diabetes mellitus using feature selection and fuzzy support vector machine”, Procedia Computer Science, 157, 46–54, 2019.
Subbulakshmi C V, and S N. Deepa, “Medical dataset classification: a machine learning paradigm integrating particle swarm optimization with extreme learning machine classifier”, The Scientific World Journal, 2015.
Brezoènik L, “Feature selection for classification using particle swarm optimization”, In IEEE EUROCON 2017-17th International Conference on Smart Technologies (pp. 966–971). IEEE, 2017.
Koumi F, Aldasht M, and H. Tamimi, “Efficient feature selection using particle swarm optimization: A hybrid filters-wrapper approach”, In 2019 10th International Conference on Information and Communication Systems (ICICS) (pp. 122–127). IEEE, 2019.
Ashiquzzaman A, Tushar A K, Islam M R, Shon D, Im, K., Park, J. H, and J.Kim, “Reduction of overfitting in diabetes prediction using deep learning neural network”. In IT convergence and security pp. 35–43, Springer, Singapore, 2017.
Massaro A, Maritati V, Giannone D, Convertini D, and A. Galiano, “LSTM DSS automatism and dataset optimization for diabetes prediction”, Applied Sciences, 9(17), 3532, 2019.
Zhou H, Myrzashova R, and R. Zheng, “Diabetes prediction model based on an enhanced deep neural network”, EURASIP Journal on Wireless Communications and Networking, 2020.