Dataset of Urduud1k from Natural Scenes

Main Article Content

U. ZAKI
D. N. HAKRO
M. MEMON
F. H. KHOSO
K. U. R. KHOUMBATI
M. A. ZAKI
M. HAMEED
G. NABI

Abstract

In latest years research has drawn attention on text analysis in natural scenes. Databases play a significant part in the efficiency assessment of the algorithm for text recognition. A data set of natural scene text images in six distinct languages have recently been released in an International Conference on Document Analysis and Recognition (ICDAR).This dataset is for multi-languages except Urdu. In the natural images of the Urdu scene, there is an absence of a conventional Urdu text database. This research therefore mainly aims to build a database for Urdu text in natural scenes. The dataset is very large because there are 10 distinct cameras with distinct resolution, distinct angles and distinct range requirements for each picture captured by distinct light zone. The dataset comprises of Urdu words, ligatures and characters in natural scenes. The dataset contains 16k images of words, 32k ligatures and characters images. This dataset contains 1kimagesincluding signboard, a name of the store, banners and so on. In addition, the Urdu dataset is contrasted with the current data set including ICRAR 2003, ARASTI, Chars 74k, etc. The dataset includes many images from the natural scene so it can be used in natural environments to identify Urdu text.

Article Details

How to Cite
U. ZAKI, D. N. HAKRO, M. MEMON, F. H. KHOSO, K. U. R. KHOUMBATI, M. A. ZAKI, M. HAMEED, & G. NABI. (2019). Dataset of Urduud1k from Natural Scenes. Sindh University Research Journal - SURJ (Science Series), 51(4). Retrieved from https://sujo.usindh.edu.pk/index.php/SURJ/article/view/282
Section
Articles

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.