number of words. Consequently, each test case under-
went four iterations to derive metrics encompassing
accuracy, precision, recall, and F1-Score. Initially,
the input number of words was fixed at 5,000. Subse-
quently, this parameter was augmented to 10,000.
The outcomes of these method executions are pre-
sented in Tables 1 and 2.
Observations reveal that in the test case involving
5,000 words, the SVM approach produced subopti-
mal performance results. Nevertheless, upon imple-
menting the strategy of increasing the word count to
10,000, only slight variations in the outcomes were
noted.
Conversely, the Logistic Regression approach ex-
hibited robust consistency, with results remaining
largely unaffected despite the variance in word count.
Therefore, the significance of the Logistic Re-
gression algorithm is underscored in methodologies
where the dataset encompasses a larger number of
words and demonstrates greater variation.
Therefore, Logistic Regression stands out as the
number of words increases.
5 CONCLUSIONS
In conclusion, the importance of employing Super-
vised Machine Learning for Sentiment Analysis is un-
derscored, providing a robust framework that delivers
satisfactory results, particularly when handling exten-
sive datasets.
Nevertheless, it is noteworthy that the field lacks
a standardized computational method for Sentiment
Analysis, resulting in outcome variations contingent
on the specific techniques, algorithms, and models
employed.
The utilization of algorithms such as Logistic Re-
gression and SVM has proven instrumental in pro-
cessing large volumes of textual data, markedly aug-
menting computational capabilities for such tasks.
The outcomes of this study exhibit a level of perfor-
mance comparable to published works in the field,
signifying the efficacy of the proposed approach.
In essence, this method stands poised to aid pro-
fessionals in the realm of computational intelligence
engaged in Sentiment Analysis studies, offering a
well-suited avenue for discerning polarities among
analyzed words irrespective of the data source.
Finally, as future work, we will implement new
machine learning method and compare them with the
present ones. Moreover, we intend to establish com-
parisons with other approaches proposed in the litera-
ture, concerning the sentiment analysis issue.
ACKNOWLEDGEMENTS
The authors would like to thank Coordenac¸
˜
ao de
Aperfeic¸oamento de Pessoal de N
´
ıvel Superior -
Brasil (CAPES), under grant 88887.686064/2022-00,
and S
˜
ao Paulo Research Foundation (FAPESP), under
grant 2020/08615-8, for the partial financial support.
REFERENCES
Abdi, H., Valentin, D., and Edelman, B. (1999). Neural
networks. Number 124. Sage.
Bisong, E. and Bisong, E. (2019). Logistic regression.
Building Machine Learning and Deep Learning Mod-
els on Google Cloud Platform: A Comprehensive
Guide for Beginners, pages 243–250.
Bonaccorso, G. (2017). Machine learning algorithms.
Packt Publishing Ltd.
Chen, C. C. and Tseng, Y.-D. (2011). Quality evaluation of
product reviews using an information quality frame-
work. Decision Support Systems, 50(4):755–768.
Ghosh, S. and Gunning, D. (2019). Natural Language Pro-
cessing Fundamentals: Build intelligent applications
that can interpret the human language to deliver im-
pactful results. Packt Publishing Ltd.
Hatzivassiloglou, V. and McKeown, K. (1997). Predicting
the semantic orientation of adjectives. In 35th annual
meeting of the association for computational linguis-
tics and 8th conference of the european chapter of the
association for computational linguistics, pages 174–
181.
Hu, Z., Hu, J., Ding, W., and Zheng, X. (2015). Review sen-
timent analysis based on deep learning. In 2015 IEEE
12th International Conference on e-Business Engi-
neering, pages 87–94. IEEE.
Hutchins, W. J. (2004). The georgetown-ibm experiment
demonstrated in january 1954. In Conference of the
Association for Machine Translation in the Americas,
pages 102–114. Springer.
Kang, H., Yoo, S. J., and Han, D. (2012). Senti-lexicon and
improved na
¨
ıve bayes algorithms for sentiment analy-
sis of restaurant reviews. Expert Systems with Appli-
cations, 39(5):6000–6010.
Khurana, D., Koli, A., Khatter, K., and Singh, S. (2022).
Natural language processing: State of the art, current
trends and challenges. Multimedia Tools and Applica-
tions, pages 1–32.
Kitson, N. K., Constantinou, A. C., Guo, Z., Liu, Y., and
Chobtham, K. (2021). A survey of bayesian network
structure learning. arXiv preprint arXiv:2109.11415.
Ko, Y. and Seo, J. (2000). Automatic text categorization by
unsupervised learning. In COLING 2000 Volume 1:
The 18th International Conference on Computational
Linguistics.
Mammone, A., Turchi, M., and Cristianini, N. (2009). Sup-
port vector machines. Wiley Interdisciplinary Re-
views: Computational Statistics, 1(3):283–289.
ICEIS 2024 - 26th International Conference on Enterprise Information Systems
704