OCR for Classifying and Retrieving Different Syriac -

Preview:

Citation preview

International Journal of Advancements in Research & Technology, Volume 2, Issue 10, October-2013 177 ISSN 2278-7763

Copyright © 2013 SciResPub. IJOART

two other forms of “Taw” in Estrangela and Serta(West) can be retrieved using the same index .Fig 4 shows the forms of “Taw” letter.

Using 7 or 6 or 5 or 4 or 3 or 2 or 1 moment as extracted fea-ture for classification, the same results of recognition rate are achieved, and the rate of Recognition and retrieval accuracy is 100% for different Syriac alphabet forms , So the proposed OCR can use one moment instead of 2,3,4,5,6 or 7 moments , this will reduce the time needed to recognize the charac-ter. Table 4 shows the recognition and retrival time in millise-cond for one character by using moments between 1 to 7.

5 CONCLUSION A simple Syraic Alphabet Characters recognition system of the three different forms that uses Invariant moments as features, and that uses the index of the recognized letter to retrieve the other two forms of this letter is proposed. From the test results, it has been identified that using seven, six, five, four, three, two or one moment as a feature, the same results of recogni-tion and retrieval are achieved, Due to these results, one mo-ment can be used instead of two or three or four or five or six or seven moments to recognize the character, this leads to the reduction of time of the Syriac Alphabet Forms recognition system.

REFERENCES [1] Hangarge, Shashikala Patil and B.V.Dhandra “Multi-font/size Kan-

nada Vowels and Numerals Recognition Based on Modified Inva-riant Moments”, IJCA Special Issue on “Recent Trends in Image Processing and Pattern Recognition, RTIPPR, pp 126-130, 2010.

[2] S. Sardar, A. Wahab “Optical character recognition system for Urdu” International conference on Information and Emerging Technologies, June 2010.

[3] M.Soleymani and F.Razzazi,“An efficient front-end system for Iso-lated Persian/Arabic character Recognition ofhandwritten data-Entry Forms,” International Journal Of Computational Intelligence, vol 1, pp.193-196,2003.

[4] B. B. Chaudhuri and U. Pal, “A Complete Printed Bangla OCR Sys-tem”, Pattern Recognition. Vol. 31, pp.531-549, 1998.

[5] Abdul Monem S. Rahma, Basima Z.Yacob and Danny T. Baito "Moment invariants Based Features Extraction For Classification of Syriac Alphabet Language," International Journal of Advances in Engi-neering and Technology, Vol. 6, Issue 4, pp. 1442-1451 Sept. 2013.

[6] Rev. Shlemon I. Khoshaba, “Lessons in the teaching of the Syriac language”, AL-Mashrq House Cultural, Duhok – Iraq, 2010.

[7] http://learnaramaic.blogspot.com/2012/05/estrangelo-syriac-script.html#.UjxT-NLwl7I ,2012.

[8] http://www.omniglot.com/writing/syriac.htm [9] Syriac alphabet , http://en.wikipedia.org/wiki/Syriac_alphabet [10] Dhandra B.V., Malemath V.S., Mallikarjun H., Hegadi Ravindra, ”

Multi-font English Character Recognition based on Modified Inva-riant Moments “, Journal of Combinatorial Mathematics and Combina-torial Computing, Vol. 67, pp. 153-162 , 2008.

TABLE 4 THE RECOGNITION AND RETRIVAL TIME (MS) OF USING

MOMENTS BETWEEN 7 TO 1 Number of

Moment The recognition time(MS)for one character +

retrieval time of the two other forms 7 230.5 6 227.5 5 225.9 4 225.5 3 223.2 2 222.0 1 220.1

Fig. 4. Letter “Taw” has three different forms, (a): Estrangela form, (b) Serta (West) form, (c) Madnkḥaya(East) form

IJoART

Recommended