Giter Club home page Giter Club logo

keras-image-ocr's Introduction

How to Run

The easies way is to run on this Google Colab notebook for free with GPU enabled.

Alternatively run on your local dev machine

It is suggested to run on a Linux machine since it could be quite challenging to install the dependencies cairocffi, or if you only have a Windows machine, running on its Windows Subsystem for Linux (WSL) is also a good choice.

Require Python 3.5+ and Jupyter notebook installed

Clone or download this repo

git clone https://github.com/Tony607/keras-image-ocr

Install required libraries

pip3 install -r requirements.txt

In the project start a command line run

jupyter notebook

In the opened browser window open

image-ocr.ipynb

keras-image-ocr's People

Contributors

tony607 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

keras-image-ocr's Issues

Training not resulting in proper predictions, also loaded weights problem

Hi, i've two issues with the code i hope you might help me with.

First: Training for the 25 (20+5) epochs does not result in accurate predictions, i ran this both locally and on the google collab notebook, not changing anything, and both times the predictions were really bad.
basically the output was maybe a few letter from the text repeated up to the max length.
did you change anything from the training to the notebook? or did you train longer?

Second: (this might be a bit more on my side, but you might have a clue)
Loading the provided weights on the google collab notebook results in accurate predictions, however loading them locally drops the accuracy. Meaning that it gets most words right, but with a letter being wrong, this applies to beam search as well, and the probabilities for the letters are a bit more "diffuse", not as sharp as the ones on the google collab version.
So basically i'm running the same code, loading the same weights, but having a decrease of accuracy locally. Understandable if you can't help me there, just thought you might have run into something like this before.

Locally I'm on a rather fresh install of Ubuntu, all prerequisites installed, using an anaconda virtualenv.

Thanks for the help,
Nic

ps: i also tried a fresh anaconda env in case something was interfering, with no change

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.