Tamil Characters Recognition and Retrieval

Abdol Hamid Pilevar


In this paper the shape of the vertical projection curves are considered. The behavior of the edges of vertical projection curve is selected for creating the feature vectors of the characters. The edges of the vertical projection curve traced and the direction of the movement in the edges has been mapped by Eleven Direction Method (EDM) method .The direction codes have been extracted and saved as features vectors of the characters. The method is tested on the Tamil printed text documents. The testing data are collected from various legal documents. The test documents contain alphabet, special characters. A technique named EDM is used to search and retrieve the characters from Tamil text databases. The effectiveness and performance of the proposed algorithm have been tested with 10 separate sample data of 6 different fonts. The experiments shows that more than 97% of the Tamil characters are recognized correctly therefore, the proposed algorithm and the selected features perform satisfactorily.


