Giter Club home page Giter Club logo

koksing456 / textrecognition Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 10 KB

This project is to achieve text recognition of 80% and above accuracy on the handwritten characters in the image and output the detected characters using different OCR tools. In order to recognize the handwritten characters more efficiently, image preprocessing must be done before the text detection process. The quality of the image is crucial as it will affect the accuracy of the text detection by OCR tools as the lower the quality of the image, the lower the chance of a character can be detected. Hence, a variety of image preprocessing techniques is performed to improve the quality of the image. For example, Pytesseract, EasyOCR, and KerasOCR are used.

Jupyter Notebook 100.00%

textrecognition's Introduction

TextRecognition

This project is to achieve text recognition of 80% and above accuracy on the handwritten characters in the image and output the detected characters using different OCR tools. In order to recognize the handwritten characters more efficiently, image preprocessing must be done before the text detection process. The quality of the image is crucial as it will affect the accuracy of the text detection by OCR tools as the lower the quality of the image, the lower the chance of a character can be detected. Hence, a variety of image preprocessing techniques is performed to improve the quality of the image. For example, Pytesseract, EasyOCR, and KerasOCR are used.

Train dataset

The Tesseract has three sets of the main dataset, which are tessdata, tessdata_fast and tessdata_best. These datasets consist of over 100 languages and 35 scripts. To name a few of the languages, it includes English (eng), Malay (msa), Chinese - Simplified (chi_sim), Chinese - Traditional (chi_tra), Tamil (tam) etc. For this project, the tessdata dataset is used to avoid any incompatibility with Tesseract’s version, as it has both the legacy models (--oem 0) and newer LSTM models (--oem 1). In terms of accuracy and speed, the tessdata is faster than tessdata-best but slightly less accurate than tessdata-best. Besides, it is also not retrainable. The tessdata LSTM models (-- oem 1) have recently been updated to the integerized versions of tessdata_best on GitHub. So, they should be faster but probably still a little less accurate than tessdata_best.

Test dataset

test_v2

The test_v2 is a public domain dataset that was obtained from Kaggle. It consists of more than four hundred thousand handwritten names collected through charity projects. Text recognition utilizes image processing technologies to convert characters on scanned documents into digital forms. It typically performs well in machine-printed fonts. However, it still poses difficult challenges for machines to recognize handwritten characters because of the enormous variation in individual writing styles. The inspiration of this is to explore the task of classifying handwritten text and to convert handwritten text into the digital format using various approaches out there

System Flowchart

image

For more information, see https://github.com/tesseract-ocr/tesseract and https://tesseract-ocr.github.io/tessdoc

textrecognition's People

Contributors

koksing456 avatar

Stargazers

Elif Gönül Ayar avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.