Welcome to the newly upgraded BRAC University Institutional Repository! Following our recent system upgrade, we are actively organizing our collections. While the category counters on the homepage are currently syncing and may temporarily display low numbers, rest assured that our full repository of over 27,000 items remains safely intact. Please use the search bar above to easily access all scholarly outputs, theses, and institutional documents while we complete this categorization process.

Benchmarking and enhancing Bengali OCR: a hybrid OCR system with analytic hierarchy process-based evaluation

dc.contributor.advisorAlam, Md. Golam Robiul
dc.contributor.authorPathan, Md. Keum Uddin
dc.contributor.departmentDepartment of Computer Science and Engineering
dc.date.accessioned2026-03-01T06:29:35Z
dc.date.available2026-03-01T06:29:35Z
dc.date.copyright2025
dc.date.issued2025-10
dc.descriptionCataloged from PDF version of project report.
dc.descriptionIncludes bibliographical references (pages 71-73).
dc.descriptionThis project report is submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2025.en_US
dc.description.abstractThis study benchmarks the performance of three OCR systems—Tesseract OCR, EasyOCR, and a hybrid approach combining Tesseract OCR, EasyOCR, and the Google Vision API—for Bengali text recognition. The evaluation was conducted on a diverse, real-world dataset comprising 216 images across nine categories of Bengali documents, totaling 19,064 words. Each OCR engine was independently assessed using multiple performance metrics, including Character Error Rate (CER), Word Error Rate (WER), Character-Level Accuracy (CLA), Word-Level Accuracy (WLA), and processing time. Among other preprocessing techniques, the pipeline employed grayscale conversion, resizing, noise removal, and adaptive thresholding; however, these steps did not consistently enhance the performance of standalone OCR engines. To address the limitations of single-engine systems, a hybrid OCR framework was developed that processes raw images and employs a multi-criteria decision-making approach based on the Analytic Hierarchy Process (AHP). A user study involving 41 participants was conducted to determine the relative importance of CER versus WER. Using Saaty’s scale, over 70% of participants assigned a value of 5 or higher in favor of CER. This resulted in a CER-to-WER importance ratio of 4.76, which was then used to compute AHP-based weights. For each image, OCR outputs were scored using a weighted combination of CER and WER, and the engine with the lowest score was selected as the optimal result. The hybrid system demonstrated strong performance under optimal conditions, achieving a Character- Level Accuracy (CLA) of 96.63% and a Word-Level Accuracy (WLA) of 80.34%, corresponding to a Character Error Rate (CER) of 3.37% and a Word Error Rate (WER) of 19.66%. This significantly outperformed Tesseract OCR (CLA: 88.54%, CER: 11.46%; WLA: 79.99%, WER: 20.01%) and EasyOCR (CLA: 90.98%, CER: 9.02%; WLA: 78.06%, WER: 21.94%). These results were obtained from specific document categories where OCR performance tends to be highest. While recognition accuracy may vary across different document types, the findings highlight the potential of the AHP-guided hybrid approach to substantially improve Bengali OCR performance in favorable scenarios and provide a strong foundation for further enhancement in more challenging, real-world conditions.en_US
dc.description.degreeM.Sc. in Computer Science and Engineering
dc.description.statementofresponsibilityMd. Keum Uddin Pathan
dc.format.extent92 pages
dc.identifier.otherID 16373002
dc.identifier.urihttp://hdl.handle.net/10361/27581
dc.language.isoenen_US
dc.publisherBRAC Universityen_US
dc.rightsBRAC University project reports are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectEasyOCRen_US
dc.subjectOptical character recognitionen_US
dc.subjectBengali OCRen_US
dc.subjectHybrid OCRen_US
dc.subjectGoogle vision APIen_US
dc.subjectBengali texten_US
dc.subjectText recognitionen_US
dc.subjectPerformance benchmarkingen_US
dc.subjectOCR evaluationen_US
dc.subjectWord error rateen_US
dc.subjectCharacter error rateen_US
dc.subjectReal-world dataseten_US
dc.subject.lcshOptical character recognition.
dc.subject.lcshBengali character sets (Data processing).
dc.titleBenchmarking and enhancing Bengali OCR: a hybrid OCR system with analytic hierarchy process-based evaluationen_US
dc.typeProject Reporten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
16373002_CSE.pdf
Size:
973.21 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: