Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/23147
DC FieldValueLanguage
dc.contributor.authorGjoreski, Martinen_US
dc.contributor.authorZajkovski, Gorjanen_US
dc.contributor.authorBogatinov, Aleksandaren_US
dc.contributor.authorMadjarov, Gjorgjien_US
dc.contributor.authorGjorgjevikj, Dejanen_US
dc.contributor.authorGjoreski, Hristijanen_US
dc.date.accessioned2022-09-28T09:10:20Z-
dc.date.available2022-09-28T09:10:20Z-
dc.date.issued2014-04-
dc.identifier.urihttp://hdl.handle.net/20.500.12188/23147-
dc.description.abstractThe paper presents an approach to Optical Character Recognition (OCR) applied on receipts printed in Macedonian language. The OCR engine recognizes the characters of the receipt and extracts some useful information, such as: the name of the market, the names of the products purchased, the prices of the products, the total amount of money spent, and also the date and the time of the purchase. We used the publicly available OCR framework Tesseract, which was trained on pictures of receipts printed in Macedonian language. The results showed that it can recognize the characters with 93% accuracy. Additionally, we used another approach that uses the original Tesseract to extract the features out of the picture and the final classification was performed with k-nearest neighbor’s classifier using dynamic time warping as a distance metrics. Even though the accuracy achieved with the modified approach was for 6 percentage points lower than the original approach, it is a proof of concept and we plan to further research it in future publications. The additional analysis of the results showed that the accuracy is higher for the words which are prescribed for each receipt, such as the date and the time of the purchase and the total amount of money spent.en_US
dc.subjectOCR; Receipt digitalization; Tesseract; DTW;en_US
dc.titleOptical character recognition applied on receipts printed in Macedonian Languageen_US
dc.typeProceeding articleen_US
dc.relation.conferenceInternational Conference on Informatics and Information Technologies (CIIT)en_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers
Files in This Item:
File Description SizeFormat 
CIIT2014.59.pdf304.25 kBAdobe PDFView/Open
Show simple item record

Page view(s)

46
checked on Jul 18, 2024

Download(s)

51
checked on Jul 18, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.