Pashto Isolated Character Recognition Using K-NN Classifier

Main Article Content

N. AHMAD
A. A. KHAN
S. A. R. ABID
M. YASIR
NASIM-ULLAH

Abstract

This paper presents the development of Optical Character Recognition (OCR) system for printed Pashto text. The problem of the unavailability of the standard database for Pashto language has also been addressed by developing a medium size database with 25 different variations with a total number of 1125 entries in the final database. In the proposed approach, individual Pashto characters are recognized utilizing both high and low level features. High level features are based on the structural information from the characters and the resulting binary trees uniquely classify each of the characters. The approach though quite robust is affected slightly by the variation in size, orientation and writing style. An alternative low level feature approach based on K-Nearest Neighbors has been used giving an overall word recognition of 74.8%.

Article Details

How to Cite
N. AHMAD, A. A. KHAN, S. A. R. ABID, M. YASIR, & NASIM-ULLAH. (2013). Pashto Isolated Character Recognition Using K-NN Classifier. Sindh University Research Journal - SURJ (Science Series), 45(4). Retrieved from https://sujo.usindh.edu.pk/index.php/SURJ/article/view/5626
Section
Articles