Optimization of the C4.5 Algorithm by Using a Genetic Algorithm for the Diagnosis of Life Expectancy for Hepatitis Patients
Abstract
As technology develops rapidly, the amount of data generated experiencing rapid development, including medical data. Data can help diagnose the life expectancy of people with the disease such as hepatitis using data mining methods in the medical field. In this research, technique data mining uses a classification technique with the C4.5 algorithm and the UCI Machine Learning Repository dataset. This dataset has 19 attributes, 1 class, and 155 samples. C4.5 algorithm is optimized using the Genetic Algorithm feature selection process. This study compares the accuracy of the C4.5 algorithm before and after optimization using a Genetic Algorithm. C4.5 algorithm produces the highest accuracy of 96.23%. Meanwhile, the C4.5 algorithm, after being optimized using Genetic Algorithm, has the highest accuracy of 98.11%. The number of features selected is 15 features. Application of Genetic Algorithms in C4.5 algorithm is proven to improve the accuracy in diagnosing life expectancy of people with hepatitis as much as 1.88%.
Copyright (c) 2021 Journal of Advances in Information Systems and Technology
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.