Comparative Analysis on Machine Learning Algorithms for Multiple Disease Prediction

Authors

  • Sivadi Balakrishna Vignan's Foundation for Science, Technology & Research (Deemed to be University)
  • Yerrakula Gopi Vignan's Foundation for Science, Technology & Research (Deemed to be University)

Keywords:

K-Nearest Neighbor (KNN), ), Random Forest, Decision Tree, Naïve Bayes, Disease Prediction, Machine Learning

Abstract

These days, majority of the humans are suffered from multiple diseases because of eating habits and environmental situations. Hence, predication of these multiple diseases become a challenging and critical task in these days. Machine Learning (ML) algorithms becomes more popular to predict multiple diseases. For the multiple disease prediction, in this paper, we investigated and examined various ML algorithms such as Decision Tree, Random Forest, Naïve Bayes, K-Nearest Neighbor (KNN) used for accurate prediction of disease. For analysis of the ML-based classification algorithms, this paper intently used Accuracy as a performance metric and tested on the DiseaseSymptomKB dataset. The accuracy of general disease prediction by using Decision Tree is 95%, Random Forest is 95%, Naïve Bayes is 95% and KNN is 92%.

References

Alizadeh, Mohamad Javad, et al. "A new approach for simulating and forecasting the rainfall-runoff process within the next two months." Journal of hydrology 548 (2017): 588-597.

Seo, Youngmin, et al. "Daily water level forecasting using wavelet decomposition and artificial intelligence techniques." Journal of Hydrology 520 (2015): 224-243.

Wang, Baowei, et al. "Air quality forecasting based on gated recurrent long short term memory model in Internet of Things." IEEE Access 7 (2019): 69524-69534.

Andreas Lepperod, Hai Thanh Nguyen, Sigmund Akselsen, Leendert Wienhofen, and Pinar Ozturk and Weiqing Zhang, Air Quality Monitor and Forecast in Norway using NB-IoT and Machine Learning - Summit Smart City 360°.: Springer, 2019.

Y. Li and H. Cao, "Prediction for tourism flow based on LSTM neural networks," Procedia Computer Science 129, pp. 277-283, 2018.

Hrnjica, Bahrudin, and Ognjen Bonacci. "Lake level prediction using feed forward and recurrent neural networks." Water Resources Management 33.7 (2019): 2471-2484.

Zhang, Juntao, et al. "Using Recurrent Neural Network for Intelligent Prediction of Water Level in Reservoirs." 2020 IEEE 44th Annual Computers, Software, and Applications Conference (COMPSAC). IEEE, 2020.

Faruq, Amrul, et al. "River water level forecasting for flood warning system using deep learning long short-term memory network." IOP Conference Series: Materials Science and Engineering. Vol. 821. No. 1. IOP Publishing, 2020.

Kyunghyun Cho et al., "Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation," EMNLP, pp. 1724-1734, 2014.

L. Peng, S. Liu, R. Liu and L. Wang, "Effective long short-term memory with differential evolution algorithm for electricity price prediction, " Energy, vol. 162, pp. 1301-1314, Nov. 2018.

Hochreiter, Sepp, and Jürgen Schmidhuber. "Long short-term memory." Neural computation 9.8 (1997): 1735-1780.

Pascanu, Razvan, Tomas Mikolov, and Yoshua Bengio. "On the difficulty of training recurrent neural networks." International conference on machine learning. PMLR, 2013.

Gers, Felix A., Jürgen Schmidhuber, and Fred Cummins. "Learning to forget: Continual prediction with LSTM." Neural computation 12.10 (2000): 2451-2471.

Downloads

Published

2022-09-01

How to Cite

Balakrishna, S., & Gopi, Y. (2022). Comparative Analysis on Machine Learning Algorithms for Multiple Disease Prediction. International Journal of Machine Learning and Networked Collaborative Engineering, 5(`1), 8–16. Retrieved from http://mlnce.net/index.php/Home/article/view/172