Giter Club home page Giter Club logo

ocrinaction's Introduction

《深度实践OCR:基于深度学习的文字识别》 随书代码

本书主要是面向文字识别初、中级学习从业人员,介绍了OCR的各类方法,既包括图像预处理、数据生成与增强、文字检测与识别等基础内容,又有与神经网络和目标检测相结合的内容。 由于是第一版,还存在非常多的问题,我们也希望能够把这本书完善好,为国内的初学者提供一个体系化的入门路径,所以非常期待各位读者朋友提出改进意见。

环境

本书代码默认运行在ubuntu系统,主要依赖pytorch,Numpy,Opencv,Pillow。

1.安装依赖

pip install -r requirements.txt

章节安排

  • chapter-2 图像预处理方法
  • chapter-3 传统机器学习方法
  • chapter-4 深度学习的相关基础知识
  • chapter-5 数据生成
  • chapter-6 高级深度学习方法
  • chapter-7 文字检测技术
  • chapter-8 文字识别
  • chapter-9 OCR后处理的方法
  • chapter-10 版面分析方法

其他

在第五章节补充了数据集的部分,读者可以用于训练检测和识别网络。 目前第7章和第9章节,还有缺失,已经去催促对应章节的作者补全代码,由于检测一章,书中缺少实战,近期会补充两个案例,分别是一阶段和两阶段的方法,请读者耐心等待,感谢!!!

ocrinaction's People

Contributors

mjq11302010044 avatar liushuchun avatar ccmolly avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.