Giter Club home page Giter Club logo

Comments (6)

yasersakkaf avatar yasersakkaf commented on May 27, 2024

Nice question. I am wondering the same. Please tell me how to train the model?

from handwriting-ocr.

Breta01 avatar Breta01 commented on May 27, 2024

Hi,

This is question depends on the model you want to train. All the notebooks that are for model training contain name Classifier. These notebooks load data from the data folder (if you didn't already, you have to download the data from provided URL), process them and train the model which is then saved in the models folder.

You don't have to do much more than replace the original date with yours and train the model. Your data have to be in the right format which depends on the type of a model.
Often the data are stored as and image file with name in the format: label_timestamp.jpg.

If you need more details, please specify the model you want to train.

from handwriting-ocr.

yasersakkaf avatar yasersakkaf commented on May 27, 2024

I wanted to train the word-classifier CTC.
How to do it?

from handwriting-ocr.

Breta01 avatar Breta01 commented on May 27, 2024

OK, that's the easy one.

The training code is in this notebook: WordClassifier-CTC.ipynb. Currently, the data are loaded from folder data/words2/ (the location is parametr of loadWordsData()). In this folder I have images of words which are already normalized (grayscaled and with height: 60px). The words' images are named as word_timestamp.jpg (word stands for correct label and timestamp can be just random number).
For example, following image is named as sell_15132719.jpg:
sell_1513271957 8685486

The loadWordsData() loads grayscaled images and outputs numpy array of images and labels. The model is then trained and output into location defined by save_location variable.

I hope this helps.

from handwriting-ocr.

mhsamavatian avatar mhsamavatian commented on May 27, 2024

What are the .txt files in data/words2? I am going to retrain the char classifier and it needs the .txt files. How can I generate .txt files for my data?

from handwriting-ocr.

Breta01 avatar Breta01 commented on May 27, 2024

This question is duplicate with #44

from handwriting-ocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.