中文说明:乌尔都语光学字符识别是个欠发达的地区和复杂的任务开发随着乌尔都语阿拉伯语脚本的家庭是草书,从右到左在性质和字符改变其形状和形式时它放置在初始、 中间或末尾的一个词。 强度衡量的拟议的系统像素中检测中一个句子和联接 / 连接的化合物的词为分割这些分段中的字符的单词字符是零到神经网络的分类。系统的原型已采用 Matlab,目前达到的平均精度 70%。
English Description:
Urdu Optical Character Recognition is a less developed area and a complex task to develop as Urdu being a family of Arabic script is cursive, right to left in nature and the characters change its shapes and forms when it is placed at initial, middle or at the end of a word. In the proposed system pixels strength is measured to detect words in a sentence and joins of characters in a compound/connected word for segmentation these segmented characters are feeded to Neural Network for classification. A prototype of the system has been developed using Matlab, currently achieves 70% accuracy on the average.