Sentiment Analysis on SocialMedia Using TF-IDF Vectorization and H2O Gradient Boosting for Student Anxiety Detection
DOI:
https://doi.org/10.15294/sji.v12i1.20582Keywords:
Anxiety, H2O Gradient Boosting, Analysis Sentiment, TF-IDF VectorizationAbstract
Purpose: Mental health issues are now a concern for many people. Anxiety or often called Anxiety that is excessive and prolonged has also become the forefront of various psychological disorders that trigger impacts such as stress to suicide. People using social media platforms tend to be a medium for expressing opinions sharing information and even expressing daily emotions. Many studies have shown a correlation between expressing emotional statements on social media and mental disorders. This research aims to conduct sentiment analysis of Anxiety on social media using H2O Gradient Boosting by implementing TF-IDF Vectorization which is set to max feature.
Methods: This research utilizes 6980 post data from social media. The method applied is by conducting Exploratory Data Analysis then Data preprocessing, Tf-Idf Vectoriztion with max feature experiments 100, 250, 500, 1000 and 2000, H2O Gradient Boosting Model, Cross Validation, and Model performance evaluation.
Result: The results of this study show good model performance through max feature TF-IDF = 250 with an accuracy value of 99%, Specificity 99.57%, and Eror Rate of 0.0106.
Novelty: So that the use of the H2O Gradient Boosting model succeeded in providing good performance in classifying anxiety sentiment.