Giter Club home page Giter Club logo

lucs1590 / nkocr Goto Github PK

View Code? Open in Web Editor NEW
34.0 34.0 9.0 81.79 MB

๐Ÿ”Ž๐Ÿ“ This is a module to make specifics OCRs at food products and nutritional tables.

Home Page: https://medium.com/analytics-vidhya/how-did-the-machine-read-nutritional-facts-271a53893194

License: Apache License 2.0

Python 79.19% Jupyter Notebook 19.67% Makefile 1.14%
computer-vision east food-products hacktoberfest image-processing language ocr opencv opencv-python pytesseract python python3 specifics-ocrs spelling-correction symspell tesseract tesseract-ocr tesseract-ocr-engine tesseract-python

nkocr's Introduction

Lucas Brito

Twitter Badge Linkedin Badge Gmail Badge Facebook Badge

Hi guys ๐Ÿ‘‹, I'm Lucas, but everyone calls me Brito, and feel free to call too. I have a lot of dreams, and they move me. I'm a technologist in Big Data in Agribusiness and I'm working as a Machine Learning Engineer at Agi. I'm also doing a Masters in Computer Science at UNESP.

๐Ÿƒ As a hobby, I run every week and do some exercises.

Strava Badge

โœ๏ธ Sometimes I write a little about the things I learn.

Medium Badge Dev.To Badge

To more informations, access:

nkocr's People

Contributors

alvarocavalcante avatar dependabot[bot] avatar kevinah95 avatar lucs1590 avatar renanzulian avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

nkocr's Issues

shows file not found when using OcrTable()

text = OcrTable("https://foodnoms.com/static/blog/2020-04-04_better-nutrition-label-scanner/label-scanner.png")

error:
FileNotFoundError Traceback (most recent call las

Downloading model every test.

I noticed that in every test that I run the model is downloaded again. This doesn't happen when running the OCR, only when testing, that's a factor that impacts directly in execution time when testing.

Publish package on conda

Is your feature request related to a problem? Please describe.
An excellent alternative to pip is related to the use of conda, so we could add this package on conda.

Describe the solution you'd like
Publishing NKOcr Python Package on conda and conda-forge

Describe alternatives you've considered
Publish a docker image.

Additional context

Add Methods docs

Add to each method a documentation following the docstring patterns.

Access Denied

Access denied with the following error:

    Cannot retrieve the public link of the file. You may need to change
    the permission to 'Anyone with the link', or have had many accesses. 

You may still be able to access the file from the browser:

     https://drive.google.com/uc?export=download&id=1qGe5Zq8VzGxU90Kpt3fUb4noBBywUGuw 

Python refactoring

I'll implement the changes that I suggested in the previous PR. The scope is to reduce repeated code and improve patterns.

Class name convention python

Missing documentation

Hey,
great work and very interesting project. I try to develop a project for the same purpose as well. The only difference is that I like to read in german nutritional facts.

So I came across your project and tried to install and used it. The text results were kind of strange to me. I used it like this:

text = OcrTable(image, language='deu')
Should I also use:
spell_corrector: bool = False, show_performace: bool = False
Or should that be True?

I used text.text to get the results. So overall I think it needs a bit more of documentation.
Thanks in advance

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.