An Effective Way to Enhance Classifications for the Semi-Structured Research Articles

Ejaz Ahmed; Sumbal Ashraf; Waseem Shahzad

An Effective Way to Enhance Classifications for the Semi-Structured Research Articles

Authors

Ejaz Ahmed
Sumbal Ashraf
Waseem Shahzad

Keywords:

Structured research articles, semi-structured research articles, supervised technique, Gensim, TFID

Abstract

Due to the drastic increase in the research publications, numerous research articles are available electronically on different online digital libraries. Some research articles or papers are not retrieved during online searches due to their classification issues. The adequately structured research articles are relatively easily approachable as compared to semi-structured and unstructured research articles, and sometimes the reader does not get accurate results on different digital libraries as the research articles are not classified properly. Neglecting the semi-structured and unstructured published research not only causes gap deficiency but also affects the results of the proposed techniques and citations for other articles. Usually, researchers missed semi-structured and unstructured research articles during their online search. Classification techniques have been applied to structured articles and no significant work has been performed towards the classification of semi-structured and unstructured research articles. Therefore, this research focuses on the classification of semi-structured research articles using different supervised classification techniques so that the most accurate and large amount of relevant research results will be achieved. For experimentation, a labeled dataset was used for the classification of semi-structured papers. The dataset we used for experimentation is comprised of manually gathered research articles from Santos repository dataset and labeling them accordingly. The current study used four different supervised classification techniques such as Support Vector Machine (SVM) classifier, Naïve Based classifier, K Nearest Neighbor classifier, and Decision Tree classifier. The comparison was performed between these supervised classification techniques to see which classifier gives better accuracy. The unit of measures or parameters selected to compare these classifiers are: accuracy, recall, precision, and f-score. The evaluation was performed on the basis of results and comparison in the experimentation. Experimental results of classifiers K-neighbor are better than other classifiers SVM, Decision Trees and Naive Bayes

Downloads

Published

2020-03-25

How to Cite

Ejaz Ahmed, Sumbal Ashraf, & Waseem Shahzad. (2020). An Effective Way to Enhance Classifications for the Semi-Structured Research Articles. University of Sindh Journal of Information and Communication Technology, 4(1), 17–23. Retrieved from https://sujo.usindh.edu.pk/index.php/USJICT/article/view/639

Download Citation

Issue

Vol. 4 No. 1 (2020): University of Sindh Journal of Information and Communication Technology

Section

Information Technology

License

University of Sindh Journal of Information and Communication Technology (USJICT) follows an Open Access Policy under Attribution-NonCommercial CC-BY-NC license. Researchers can copy and redistribute the material in any medium or format, for any purpose. Authors can self-archive publisher's version of the accepted article in digital repositories and archives.

Upon acceptance, the author must transfer the copyright of this manuscript to the Journal for publication on paper, on data storage media and online with distribution rights to USJICT, University of sindh, Jamshoro, Pakistan. Kindly download the copyright for below and attach as a supplimentry file during article submission

An Effective Way to Enhance Classifications for the Semi-Structured Research Articles

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Make a Submission

Information