Giter Club home page Giter Club logo

Comments (1)

hiroi-sora avatar hiroi-sora commented on July 17, 2024 1

全屏长图怎么处理

实际上应该拆分为两个问题:超大分辨率图片如何添加忽略区域?超大分辨率图片如何准确地识别?

1. 超大分辨率图片如何添加忽略区域?

答:很遗憾,忽略区域窗口的UI不支持滚动,所以目前可能难以操作。未来有可能改善这个功能。

2. 超大分辨率图片如何准确地识别文字?

可能你也发现了,直接将超大图片扔处理列表转文字,效果奇差,甚至一个文字都转不出来。这是因为 为了提高效率,PaddleOCR预先将长度超标的图片进行压缩,再执行文字识别。想临时关闭这个功能,只需将PaddleOCR_json_config.txt中的max_side_len 960改写到比较大的数(要处理的图片的高度)即可。不过,处理完图片后,最好将参数改回来,否则以后处理正常尺寸图片时会很慢。

或者可以参照下面的步骤生成一个专门应对大图片的识别器:

  1. 复制一份PaddleOCR_json.exe,重命名为PaddleOCR_json_大分辨率.exe。复制一份PaddleOCR_json_config.txt,重命名为PaddleOCR_json_大分辨率_config.txt
  2. PaddleOCR_json_大分辨率_config.txt中的max_side_len一行改为max_side_len 7000
  3. 打开Umi-OCR软件,转到设置,将识别器路径改为PaddleOCR-json\PaddleOCR_json_大分辨率.exe
  4. 正常导入大分辨率图片并识别即可。

16577831063401.png

1657783221440.png

from umi-ocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.