Comments (4)
Hi,
dhSegment takes as input a pair of images : the original image and the labelled image where the regions you want to extract are annotated with different 'colors'. It is not restricted to any format of annotation, as long as you are able to convert it to the above-mentioned labelled image.
So to answer your question, if you want to input directly XML files to dhSegment, no it will not work, but if you generate the corresponding labelled images, then yes, you'll be able to train a model.
There are already some implemented functions to parse files with PAGE-XML format and generate the corresponding masks in the PAGE.py file. You can also have a look at the exps/diva/utils.py
file that may give you some hints on how to adapt it to your specific experiment (the Layout Analysis example is the DIVA experiment with DIVA-HisDB data).
from dhsegment.
Ok, thanks!
Right now I'm using the page.py functions to analyze de XML files I have currently, to labeled image that dhSegment takes as input. After that, I should be able to train the system to recognize the type of documents I need to analyze.
But what about extracting the text to postprocess it and analyze what is written? Is that possible?
from dhsegment.
After thinking about the last question I made, I think I have the solution.
After training dhSegment, the output will be the page regions classified by different colours. After that, I have to analyze that image. Having known beforehand which colour corresponds to which element, I can take the coordinates and extract it from the original image. Only then I can analyze it properly because I know exactly what type of information is in that region (table, image, text...)
from dhsegment.
how train dhsegment using own dataset?
from dhsegment.
Related Issues (20)
- pages_sample.zip link not working HOT 1
- ValueError: too many values to unpack (expected 2) HOT 1
- No box found : Is dhSegment fit for my problem ? HOT 1
- Does it work without GPU? HOT 2
- Need a short guide of layout detection and line detection HOT 3
- Getting error "tensorflow.python.framework.errors_impl.InternalError: Failed to create session"
- OOTB syntax error with demo.py HOT 1
- It is not working with TF 2.0, HOT 2
- Which CUDA, cudnn and tensorflow versions are meant to be used together HOT 2
- Article Segmentation HOT 2
- Layout Analysis Use Case: DIVA-HisDB HOT 3
- Read Tag Values from XML file HOT 1
- Baselines to Textlines HOT 1
- Mulilabel limitation should be documented HOT 1
- PredictionType.CLASSIFICATION and extracting rectangles HOT 3
- Speed of Inference on GeForce GTX 1080 HOT 1
- Reproducing baseline detection results
- Tensorflow 2.4 (request for permission to upgrade this repo to this) HOT 3
- Performance issue in the definition of model_fn, dh_segment/estimator_fn.py(P1)
- Suggest to loosen the dependency on sacred
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dhsegment.