Loading...
Please wait, while we are loading the content...
Similar Documents
The 4 th International Conference on Electrical Engineering and Informatics ( ICEEI 2013 ) Arabic Character Recognition System Development
| Content Provider | Semantic Scholar |
|---|---|
| Author | Supriana, Iping Nasution, Albadr |
| Copyright Year | 2014 |
| Abstract | We develop Arabic Optical Character Recognition (AOCR) system that has five stages: preprocessing, segmentation, thinning, feature extraction, and classification. In preprocessing stage, we compare two skew estimation algorithms i.e. skew estimation by image moment and by skew triangle. We also implemented binarization and median filter. In thinning stage, we use Hilditch thinning algorithm incorporated by two templates, one to prevent superfluous tail and the other one to remove unnecessary interest point. In segmentation stage, line segmentation is done by horizontal projection cross verification by standard deviation, sub-word segmentation is done by connected pixel components, and letter segmentation is done by Zidouri algorithm. In the feature extraction stage, 24 features are extracted. The features can be grouped into three groups: main body features, perimeterskeleton features, and secondary object features. In the classification stage, we use decision tree that generated by C4.5 algorithm. Functionality test showed that skew estimation using moment is more accurate than using skew triangle, median filter tends to erode the letter shape, and template addition into Hilditch algorithm gives a good result. Performance test yield these result. Line segmentation had 99.9% accuracy. Standard deviation is shown can reduce over-segmentation and quasi-line. Letter segmentation had 74% accuracy, tested on six different fonts. Classification components had 82% accuracy, tested by cross validation. Unfortunately, overall performance of the system only reached 48.3%. © 2013 The Authors. Published by Elsevier B.V. Selection and peer-review under responsibility of the Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://dmr.in/iping_aocr.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |