<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi, There you have two main options: Write short to

TextLine region about p2pala HOT 4 CLOSED

mrocr commented on June 14, 2024 1

TextLine region

from p2pala.

Comments (4)

lquirosd commented on June 14, 2024

Hi, P2PaLA is developed to avoid that kind of cumbersome process. TextLine polygons most of the time are very time demanding to sketch for handwritten text documents (printed documents are easier but not always). In P2PaLA you can train the model for baselines and use the --line_offset argument to define height of the TextLine, then the system will automatically extract a TextLine around each baseline.

For example, on the sample.zip PAGE-XML file you can see the results using the example model and default parameters (--line_offset 50) for your sample page.

from p2pala.

mrocr commented on June 14, 2024

from p2pala.

lquirosd commented on June 14, 2024

Hi,
There you have two main options:

Write short script to create a baseline given the TextLine polygon (for printed documents something simple like the poly-line defined by the bottom vertexes of the TextLine-polygon plus some pixels on the Y-axis should work).
Update imgprocess.py and xmlPAGE.py scripts to create the GT mask using the TextLine node instead of Baseline.

Best Regards,

from p2pala.

mrocr commented on June 14, 2024

@lquirosd hmmmm...

A script that would create baselines from textlines page-xml is great. You will upload such script so that I test.
I tried this option, but instead of modifying imgprocess.py and xmlPAGE.py I just copied the TextLine regions and renamed into Baseline, download sample.zip, and trained using this config.zip the p2pala detection results were not good. I either get no results, or the results are bad.

from p2pala.

Recommend Projects

TextLine region about p2pala HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent