Improving Random Forest Performance for Sentiment Analysis on Unbalanced Data Using SMOTE and BoW Integration: PLN Mobile Application Case Study

Authors

  • Muhammad Rifqi Fadhlan Rahmatullah Universitas Dian Nuswantoro Author
  • Pulung Nurtantio Andono Universitas Dian Nuswantoro Author
  • Affandy Universitas Dian Nuswantoro Author
  • M. Arief Soeleman Universitas Dian Nuswantoro Author

DOI:

https://doi.org/10.15294/sji.v12i1.19295

Keywords:

Sentiment Analysis, PLN Mobile, TF-IDF, BoW, SMOTE, Random Forest

Abstract

Purpose: This research aims to improve the accuracy of sentiment analysis on PLN Mobile app reviews by overcoming the challenge of data imbalance. This goal is important to provide a better understanding of user opinions and support PT PLN (Persero) in improving mobile application services.

Methods: This research uses the Random Forest algorithm combined with Synthetic Minority Over-sampling Technique (SMOTE) to handle imbalanced data. Data is collected through web scraping reviews from the Google Play Store, followed by preprocessing processes such as data cleaning, stopword removal, tokenization, and stemming. Feature extraction is performed using the Bag of Words (BoW) method, and the data is tested with four sharing schemes.

Result: The results showed that the 90%-10% sharing scheme gave the best performance with an accuracy of 81% and an average precision and recall of 0.79. This finding confirms that the larger the proportion of training data, the better the model performs sentiment classification.

Novelty: This research's novelty lies in combining SMOTE with BoW and Random Forest to overcome data imbalance. This approach is a significant reference for future sentiment analysis research. It provides practical insights that PT PLN (Persero) can use to improve the quality of its application services.

Author Biographies

  • Muhammad Rifqi Fadhlan Rahmatullah, Universitas Dian Nuswantoro

    Muhammad Rifqi Fadhlan Rahmatullah, S.Kom

    Faculty of Computer Science Dian Nuswantoro University Semarang, Central Java, Indonesia

  • Pulung Nurtantio Andono, Universitas Dian Nuswantoro

    Prof. Dr. Pulung Nurtantio Andono, S.T., M.Kom

    Faculty of Computer Science Dian Nuswantoro University Semarang, Central Java, Indonesia

  • Affandy, Universitas Dian Nuswantoro

    Affandy, M.Kom, Ph.D.

    Faculty of Computer Science Dian Nuswantoro University Semarang, Central Java, Indonesia

  • M. Arief Soeleman, Universitas Dian Nuswantoro

    Dr M. Arief Soeleman, M.Kom

    Faculty of Computer Science Dian Nuswantoro University Semarang, Central Java, Indonesia

Downloads

Published

08-04-2025

Article ID

19295

Issue

Section

Articles

How to Cite

Improving Random Forest Performance for Sentiment Analysis on Unbalanced Data Using SMOTE and BoW Integration: PLN Mobile Application Case Study. (2025). Scientific Journal of Informatics, 12(1), 1-10. https://doi.org/10.15294/sji.v12i1.19295