Giter Club home page Giter Club logo

go.tesseract's Introduction

##go.tesseract go.tesseract is a wrapper for the tesseract OCR library (text-recognition from image/pdf).

Installation and dependencies

go.tesseract has two direct dependencies; go.leptonica and libtesseract

Make sure you have installed go.leptonica. go.leptonica has a C library dependency, please read the go.leptonica/README.md.

You are required to install the tesseract library including development headers at version 3.04.00 or later. You absolutely need 3.04.00 (or later) as go.tesseract can not compile with earlier versions of tesseract. At time of writing this version of tesseract is not in the ubuntu/debian stable repository yet.

go.tesseract uses gopkg.in for versioned releases:

go get gopkg.in/GeertJohan/go.tesseract.v1

Debian testing (stretch) package

sudo apt-get install -t testing libtesseract3 libtesseract-dev

OSX with Homebrew

Do the following before trying to go get this package:

$ brew install leptonica
$ brew install tesseract
$ export CGO_LDFLAGS="-L$(brew --prefix leptonica)/lib -L$(brew --prefix tesseract)/lib"
$ export CGO_CFLAGS="-I$(brew --prefix leptonica)/include -I$(brew --prefix tesseract)/include"

Note: this assumes you are using the standard Brew path of /usr/local/Cellar

Manual installation

Download, configure, make and install

git clone https://github.com/tesseract-ocr/tesseract
cd tesseract
git checkout tags/3.04.00
./autogen.sh
./configure
make
sudo make install
sudo ldconfig

Language files

If you have installed from debian testing (jessie):

sudo apt-get install -t testing tesseract-ocr-YOUR-LANGUAGE-SHORTCODE

# example, this installs dutch and english
sudo apt-get install -t testing tesseract-ocr-nld
sudo apt-get install -t testing tesseract-ocr-eng

If you have installed manually; copy language files (do this for any language you require)

sudo cp tessdata/YOUR-LANGUAGE-SHORTCODE.* /usr/local/share/tessdata/

# example for english and dutch:
sudo cp tessdata/eng.* /usr/local/share/tessdata/
sudo cp tessdata/nld.* /usr/local/share/tessdata/

For more information, view the tesseract compilation guide.

go.tesseract's People

Contributors

gwitmond avatar pitpit avatar robertmeta avatar tleyden avatar tsudoko avatar

Watchers

 avatar  avatar  avatar

Forkers

dibiasefrn

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.