Towards the Optimal Use of Machine Learning Algorithms in Text Mining: A Quick Review
This paper aims to provide a quick review to jump-start the research in the field of text mining where Machine Learning (ML) algorithms have been used and several accomplishments have been reported by the research community. There are different categories of text mining, and the implementation of ML algorithms and techniques have been supported in the literature to give promising results. However, in this area of study, most of the research activities in terms of time and efforts are consumed during the initial stages where implementations and experiments are carried out to evaluate various combinations. The accomplishments in this field can be further advanced by presenting early investigations concisely and analytically. Thus, the benefits of this paper are threefold: first, it will provide a platform for the new researchers to start quickly with a shorter literature review and knowing more precisely about the combinations of text mining and ML; secondly, clear analysis has been presented about the text mining categories where the performance of ML algorithms have been reported successful; and lastly, the problems have been identified for which the algorithms were used in various studies. This will enable the new researchers to directly target the problem instead of implementing the existing techniques. With the help of well-structured questions, the results are more analytical and present multidimensional views to this research issue. Main findings include that ML has been widely used in document classification and Support Vector Machine (SVM) is the most successful algorithm reported.
Copyright (c) 2022 University of Sindh Journal of Information and Communication Technology
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
University of Sindh Journal of Information and Communication Technology (USJICT) follows an Open Access Policy under Attribution-NonCommercial CC-BY-NC license. Researchers can copy and redistribute the material in any medium or format, for any purpose. Authors can self-archive publisher's version of the accepted article in digital repositories and archives.
Upon acceptance, the author must transfer the copyright of this manuscript to the Journal for publication on paper, on data storage media and online with distribution rights to USJICT, University of sindh, Jamshoro, Pakistan. Kindly download the copyright for below and attach as a supplimentry file during article submission