Performance of Ensemble Learning in Diabetic Retinopathy Disease Classification

Authors

DOI:

https://doi.org/10.15294/sji.v11i2.4725

Keywords:

Classification, Ensemble learning, Random forest, Extra-trees, XGBoost, Double random forest, Diabetic retinopathy

Abstract

Purpose: This study explores diabetic retinopathy (DR), a complication of diabetes leading to blindness, emphasizing early diagnostic interventions. Leveraging Macular OCT scan data, it aims to optimize prevention strategies through tree-based ensemble learning.

Methods: Data from RSKM Eye Center Padang (October-December 2022) were categorized into four scenarios based on physician certificates: Negative & non-diagnostic DR versus Positive DR, Negative versus Positive DR, Non-Diagnosis versus Positive DR, and Negative DR versus non-Diagnosis versus Positive DR. The suitability of each scenario for ensemble learning was assessed. Class imbalance was addressed with SMOTE, while potential underfitting in random forest models was investigated. Models (RF, ET, XGBoost, DRF) were compared based on accuracy, precision, recall, and speed.

Results: Tree-based ensemble learning effectively classifies DR, with RF performing exceptionally well (80% recall, 78.15% precision). ET demonstrates superior speed. Scenario III, encompassing positive and undiagnosed DR, emerges as optimal, with the highest recall and precision values. These findings underscore the practical utility of tree-based ensemble learning in DR classification, notably in Scenario III.

Novelty: This research distinguishes itself with its unique approach to validating tree-based ensemble learning for DR classification. This validation was accomplished using Macular OCT data and physician certificates, with ETDRS scores demonstrating promising classification capabilities.

Author Biographies

  • Anwar Fitrianto, IPB University

    Anwar Fitrianto holds a Bachelor of Science (S.Sc.) degree in Statistics from IPB University, a Masters in Statistics (M.Sc.) from Universiti Putra Malaysia, and a Doctoral degree (Dr.) in Statistics from Universiti Putra Malaysia. He teaches as a Lecturer in the Statistics Department at IPB University. His research interests include Robust Statistics, Statistical Quality Control, Statistical Modeling, and Experimental Design.

  • Agus Mohamad Soleh, IPB University

    Agus Mohamad Soleh holds a Bachelor of Science (S.Si.) in Statistics from IPB University, a Masters in Informatics (M.T.) at ITB, and a Doctorate (Dr.) in Statistics from IPB University. He teaches as a Lecturer in the Statistics Department at IPB University, Indonesia. His research interests focus on Machine Learning, Statistical Computing, and Statistical Modeling, which are important fields in the ever-evolving statistical landscape.

Downloads

Article ID

4725

Published

29-05-2024

Issue

Section

Articles

How to Cite

Performance of Ensemble Learning in Diabetic Retinopathy Disease Classification. (2024). Scientific Journal of Informatics, 11(2), 375-386. https://doi.org/10.15294/sji.v11i2.4725