Giter Club home page Giter Club logo

ocr-gt-tools's Introduction

ocr-gt-tools

A web interface for creating ground truth for evaluating and training OCR.

Docker Stars Docker Pulls license label Travis GitHub stars

Table of Contents

Summary

ocr-gt-tools allows editing hOCR files, such as those produced by the tesseract or ocropy OCR frameworks.

Screenshot

Features

  • Editing transcriptions of lines
  • Commenting on line and page level
  • Use standardized comment tags to mark common problems
  • Cheatsheet
  • Zoom in / Zoom out
  • Filter visible elements
  • Select multiple lines and apply tags.

Installation

See INSTALL.md.

About the code

The server-side code is written in Perl.

The frontend is written in HTML and Javascript.

Usage

  • Open 'ocr-gt-tools/index.html' with a browser
  • open in a second Window 'Page Previews' from Kitodo
  • Search the book from which you created the hOCR file
  • Drag and drop a image from the Kitodo 'Page Preview' Window to the Window with 'ocr-gt-tools/index.html'
  • The perl script ocr-gt-tools.cgi will create in the background all files, which takes a few seconds
  • with ajax a json objects will be returned to index.html
  • index.html will load with ajax the created 'correction.html' and 'anmerkungen.txt' inline
  • 'Speichern' will get active if you have written a comment or a text line

Contributing

Expand the wiki

We are using the wiki to collect transcription hints for unusual glyphs and frequent errors.

Pull Requests

Bug fixes, new functions, suggestions for new features and other user feedback are appreciated.

The source code is available from https://github.com/UB-Mannheim/ocr-gt-tools. Please prepare your code contributions also on Github.

Bug reports

Please feel free to open issues for any bug you encounter and features you'd like to have.

Acknowledgments

This is free software. You may use it under the terms of the GNU General Public License Version 3 (GPLv3). See LICENSE for details.

This project bundles other free software:

ocr-gt-tools's People

Contributors

kba avatar bfallert avatar zuphilip avatar stweil avatar

Watchers

HughP avatar James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.