Comments (3)
The API for recognize
was changed to support multiple images as of #9. In this case, if you change pipeline.recognize(image)
to pipeline.recognize([image])[0]
, you'll be off to the races. I'm sorry for the breaking change -- I think the API is rapidly stabilizing thanks to the great feedback provided by yourself and others.
I made this change because going from one image to multiple images efficiently (i.e., with appropriate batching at each step) is non-trivial for users, so I wanted to make the API handle that for them. The examples all assume multiple images because doing so requires no additional complexity in the example code and is easily adapted to a single image, as the fix above shows.
With regard to examples, I agree with you that it's a good idea to include a detector-only example and a recognizer-only example along with the existing full pipeline example. That said, the examples should all use multiple images, again because it's easy for users to adapt a multi-image example to a single image but it is not as easy to conceptualize doing the reverse. Would you want to send a PR with the added examples in /docs/examples/using_pretrained_models.rst
?
from keras-ocr.
I believe the above answer resolves the error you're seeing so I'm closing this issue for now. If it does not resolve your error, please reply here and I will reopen. Thanks!
from keras-ocr.
Thank you very much for your fast response.
I tried to understand whats happening and it's quite clear now. Because of the change to support multiple images I had to pass a list of images (pipeline.recognize(image)
to pipeline.recognize([image])
) and since the output is a list of lists I had to extract the list with [0]
before passing it to keras_ocr.tools.drawBoxes
...
I would very much like to provide examples for the API but never really worked with git and PRs and such :) I added this task to my todo list but even if its not a big deal I will need some time to get to this. Till now I just played around when I got some free time (weekend) to see whats possible.
from keras-ocr.
Related Issues (20)
- "Tried to convert 'num' to a tensor and failed. Error: None values not supported." HOT 1
- Can I get Korean Text from Image? Using keras-ocr HOT 1
- Open Source License HOT 1
- Adding an example for fine-tuning both detector & recognizer using an your own dataset HOT 4
- Detecting vertical text with craft HOT 3
- Can I extract the text color too?
- Error while import package
- How can I load the models in an offline environment? HOT 1
- Finetuning the recognizer crashes when reaching the fit_generator method
- README.md has 3 image links for running OCR. Second image is not available.
- Text bbox transform
- Train the recognizer
- Filling up RAM
- unable to load fonts. There is some issue not loading fonts while end-to-end training. HOT 1
- Small Issue With Letter Recognition
- is there a way to skip download data_generation.get_backgrounds and data_generation.get_fonts
- tensorflow is missing from requirements
- Readme.md issue
- Pipeline constructor initializing libiomp5 multiple times
- Cannot Download Pipeline: Unrecognized keyword arguments passed to Dense HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from keras-ocr.