Towards the Optimal Use of Machine Learning Algorithms in Text Mining: A Quick Review

  • Syed Zafar Ali Shah Department of Computer Software Engineering University of Engineering &Technology Mardan, Pakistan
  • Sadaqat Jan Department of Computer Software Engineering University of Engineering &Technology Mardan, Pakistan
  • Ibrar Ali Shah Department of Computer Software Engineering University of Engineering &Technology Mardan, Pakistan
Keywords: text mining, machine learning, sentiment analysis, support vector machine

Abstract

This paper aims to provide a quick review to jump-start the research in the field of text mining where Machine Learning (ML) algorithms have been used and several accomplishments have been reported by the research community. There are different categories of text mining, and the implementation of ML algorithms and techniques have been supported in the literature to give promising results. However, in this area of study, most of the research activities in terms of time and efforts are consumed during the initial stages where implementations and experiments are carried out to evaluate various combinations. The accomplishments in this field can be further advanced by presenting early investigations concisely and analytically. Thus, the benefits of this paper are threefold: first, it will provide a platform for the new researchers to start quickly with a shorter literature review and knowing more precisely about the combinations of text mining and ML; secondly, clear analysis has been presented about the text mining categories where the performance of ML algorithms have been reported successful; and lastly, the problems have been identified for which the algorithms were used in various studies. This will enable the new researchers to directly target the problem instead of implementing the existing techniques. With the help of well-structured questions, the results are more analytical and present multidimensional views to this research issue. Main findings include that ML has been widely used in document classification and Support Vector Machine (SVM) is the most successful algorithm reported.

Published
2022-09-30
How to Cite
Syed Zafar Ali Shah, Sadaqat Jan, & Ibrar Ali Shah. (2022). Towards the Optimal Use of Machine Learning Algorithms in Text Mining: A Quick Review. University of Sindh Journal of Information and Communication Technology , 6(3), 89-94. Retrieved from https://sujo.usindh.edu.pk/index.php/USJICT/article/view/6276

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.