xiaoyu258 / docproj Goto Github PK

View Code? Open in Web Editor NEW

311.0 13.0 85.0 2.01 MB

Document Rectification and Illumination Correction using a Patch-based CNN

License: MIT License

Python 100.00%

distorted-images document-rectification document-database

docproj's Introduction

DocProj

Paper

The source code of Document Rectification and Illumination Correction using a Patch-based CNN by Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander, SIGGRAPH Asia 2019.

Prerequisites

Linux or Windows
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Geometric Rectification

Dataset Generation

We use Blender to automatically generate synthetic distorted document image and the corresponding flow.

You can download a small dataset with 20 samples (438MB) from here for fun and the full dataset with 2450 samples (65GB) from BitTorrent or from OneDrive.

The dataset includes three folders:

img (the distorted images, with the shape of [2400, 1800, 3])
img_mask (the mask of background, with the shape of [2400, 1800])
flow (the forward flow of the distorted images, with the shape of [2, 2400, 1800])

The first thing you need to do is to crop the dataset to patches for training. Change arguments to your own and run the following commands. For help message about optional arguments, run python xxx.py --h

python local_patch.py   # crop images and flows to local patches and local patch flows
python global_patch.py  # crop images to global patches

Training

Run the following command for training and change the optional arguments like dataset directory, etc.

python train.py

Use a Pre-trained Model

You can download the pre-trained model here.

Run the following command for resizing and cropping the test document image to local and global patches and estimating the patch flows:

python eval.py [--imgPath [PATH]] [--modelPath [PATH]]
               [--saveImgPath [PATH]] [--saveFlowPath [PATH]]
               
--imgPath             Path to input image
--modelPath           Path to pre-trained model
--saveImgPath         Path to saved cropped image
--saveFlowPath        Path to saved estimated flows

Stitch flow

Download the Windows executable program here to stitch the patch flows to get the image flow.

Run the following command:

Graphcut.exe [Input Path] [Output Path]

[Input Path] is the path to input patch flows with the shape of [yNum, xNum, 2, patchH, patchW], where yNum and xNum are the number of patch in y and x direction, patchH and patchW are the height and width of a local patch.

[Output Path] is the path to the output stitched flow with the shape of [2, H, W].

Notes: The path should be absolute path with "//" due to the path parse function in the program. e.g. "H://Release//test data//2_patchFlows.npy"

Resampling

Import resampling.rectification function to resample the distorted image by the stitched flow.

The distorted image should be a Numpy array with the shape of [H, W, 3] for a color image or [H, W] for a greyscale image, the stitched flow should be an array with the shape of [2, H, W].

The function will return the resulting image and a mask to indicate whether each pixel will converge within the maximum iteration.

To help you follow all these steps, we also give an example with all the intermediate results here in the test data folder.

Illumination Correction

Training

Run the following command and change the optional arguments for training.

python train_illumination.py

Use a Pre-trained Model

You can download the pre-trained illNet model here and pre-trained vgg model here.

Run the following command for testing:

python eval_illumination.py [--imgPath [PATH]] [--savPath [PATH]] [--modelPath [PATH]]
                            
--imgPath             Path to input image
--savPath             Path to saved output
--modelPath           Path to pre-trained model

docproj's People

Contributors

Stargazers

Watchers

Forkers

fendaq kapitsa2811 storms0 cqray1990 wuyunxiangwyx wxk2008 cuimiao187561 10183308 jingmouren templeblock labimage wwwanghao peterzs kuan-li lxyzler happog peterzhousz liuzhuang1024 fxwfzsxyq liuguoyou hongchow lele-xie lhwcv hciilab rkshuai ljwdust syshensyshen jingwanli6666 linnawang76 xqyd yanqi1811 teresasun sporterman wizaron xiaoyubing challenging6 yangtong1989 yjingyu kien-nguyen-ngoc shenshenzhanzhan wyc2015fq wenjiawang0312 buyanfangqi aishmittal gpb123q yuhengfdada alwc marvis kiruthikaadhi martinhoang11 tanapol-aigen toydogcat gitwithmch ajeffries0492 wanghuogen yangyin2016 manojyasaswi yangsuhui yuansky amseej apinzonf ashwin580 huyutao3550346 lyqsr c-song mfkiwl rococostudio spyt2h xgmiao tonyyuanmd husterrc elijahahianyo tinyriver jackzhousz solee7650 yinlu6 webstorage119 wujushan williamqf-ai iamxd666 thibautsauv

docproj's Issues

pre_trained model dont work well on your paper data

Hello! I used your geometry_model to correct distorted images. I followed your doc and tried the test_data, everything is just right.
But then I used images from your paper, it cant get the similar result. It looks like just resized it and changed little.

input

output

GPU个数

请问那个光照矫正模型训练最少需要几个GPU？？

patchFlows to stitchedFlow

I dont know how to run this graphcut,can you give me some suggestions plz

Click .exe directly come up a problem like this.

Generate dataset with Blender

Hello, I'm trying to generate dataset for my own images, but got stuck on extracting the texture coordinates when given camera view coordinates. So I wonder if you are planning to open source your scripts for dataset generating?
If not, could you please give some hints on how to transform the pixel coordinates in rendered image to the UV coordinates?
Thank you !

请问光照校正的那个模型具体怎么训练？

Source code

Hi!

Do you plan you open up the source code before conference?

Thanks!

about the train dataset

The dewarp result in your paper is excellent. However, the links of pre-trained model and dataset are all invalid, i also try to use Blender to generate synthetic distorted document image, it is hard for me to master the software. so, could you please provided the dataset and pre-trained model again, very appreciated for your kindly reply !

torchvision import _C ERROR

Hello every one i trained the illumnation model on my dataset and when i tried to run the evaluation i am getting this error:

from torchvision import _C
ImportError: DLL load failed: Le module spécifié est introuvable.

torch version 1.1.0

The link to the dataset on OneDrive is missing

I can't access to that.
The message is 'This link has been removed.'
Can you restore the file? [OneDrive]

I am sorry but it is not allowed to use BitTorrent on my environment :(

The dataset doesn't have the original scanned image

Hi,
when I follow your work, I find that the dataset you provide donesn't incorporate the scanned images, which are used as gorund truth to train the illumination correction network.
Could you release this datavset

Thank you！

_

Stitching The Flows

Hi,
I tried the stitching the tool in windows. It worked. However, I went thoroughly through the paper for how the stitching works. Can you suggest some good direction on how that can be implemented efficiently like you had the binaries in windows? This project is really intriguing. Thanks.

Here is my google colab impimentation of the project, it dosent seem to work at all even on the documents used by this project

https://colab.research.google.com/drive/1NmKyW8jceBMpfkcfrozWxdwK7JduxBrX?usp=sharing
Please tell me if there is something I'm doing wrong in the notebook

resampling of sample data

I just used the sample data with ground truth npy, and use the resampling.py to unwarp the imgs. The results are not good. The background cannot be cropped.

Graphcut.exe can not work.

Excellent work for document rectification!

But I got some problem when using the Graphcut.exe to get the stitchedFlow.npy
terminal as this:
PS D:\abpycharm\DocProj\Stitching> .\Graphcut.exe "\test data\2_patchFlows.npy" "\test data\5.npy"
\test data\2_patchFlows.npy
\test data\5.npy
flow dir: \test data\2_patchFlows.npy

But there is no 5.npy generated. I confirmed that 2_patchFlows is regulatory size as[11,8,2,256,256]
Please help me !

numba cann't run

Thanks for your work.
Can you share the version of numba, numpy and skimage?
my local version:
numpy:1.14.0
skimage:0.13.1
numba:0.38.0

Using @cuda.jit, a error occurred.
`numba.cuda.cudadrv.error.NvvmError: Failed to compile

libnvvm : error: -arch=compute_61 is an unsupported option
NVVM_ERROR_INVALID_OPTION`
I try to update numba to the lasest, but it requires the version of numpy laster than 1.15.0, so skimage is not compitable

The dataset torrent has no peers to download from.

Hi @xiaoyu258, the dataset torrent provided in the README has no peers detected to download from.

The model does not work well

   I use your pre_trained model to inference the my data and the small dataset,the results are not so good.And I use the small dataset to train a new model,I use the new model to infenrence one of the imgs in the dataset,it is not effective ether.Can you tell me the reason,What do i need to pay attention to？
  Looking forward to your reply,thanks a lot!

Can you provide the version of each library in your local environment? Or can you provide your local environment docker？

illumination network dataset

Thank you for sharing such wonderful work. Where can I get illumination network dataset?

Blender instruction

Hi @xiaoyu258

Thanks for sharing the code. Could you share more details of how to use blender to render the surface flow? so that we can introduce our own dataset for training, thanks

bishe

which version used for numba

we face issue in executing: resampling.py
my numba lib version is 0.51.2
OS: Windows 10
Python 3.7

Error :
Traceback (most recent call last):
File "resampling.py", line 203, in
resImg, resMsk = rectification(distortedImg, flow)
File "resampling.py", line 194, in rectification
iterSearch[blockspergrid, threadsperblock](padu, padv, paddistorted, resultImg, maxIter, precision, resultMsk)
File "C:\Users\testing\AppData\Local\Programs\Python\Python37\lib\site-packages\numba\cuda\compiler.py", line 770, in call
self.stream, self.sharedmem)
File "C:\Users\testing\AppData\Local\Programs\Python\Python37\lib\site-packages\numba\cuda\compiler.py", line 861, in call
kernel = self.compile(argtypes)
File "C:\Users\testing\AppData\Local\Programs\Python\Python37\lib\site-packages\numba\cuda\compiler.py", line 935, in compile
kernel.bind()
File "C:\Users\testing\AppData\Local\Programs\Python\Python37\lib\site-packages\numba\cuda\compiler.py", line 576, in bind
self.func.get()
File "C:\Users\testing\AppData\Local\Programs\Python\Python37\lib\site-packages\numba\cuda\compiler.py", line 446, in get
ptx = self.ptx.get()
File "C:\Users\testing\AppData\Local\Programs\Python\Python37\lib\site-packages\numba\cuda\compiler.py", line 414, in get
arch = nvvm.get_arch_option(*cc)
File "C:\Users\testing\AppData\Local\Programs\Python\Python37\lib\site-packages\numba\cuda\cudadrv\nvvm.py", line 345, in get_arch_option
return 'compute%d%d' % arch
TypeError: not enough arguments for format string

Looks to me this might be a version mismatch issue
can someone help with which version we need to use in the window and Linux?
Batter if provided all prerequisite lib name and version that needed that might help.

We check the code is right：

I don‘t know what is the problem.
Thank you!

Can you provide a e-mail for asking more details？

DocProj blender file

@xiaoyu258
it's better that you upload the blender file, so that we can generate using our own images.
So that you wont need to upload the entire 65GB dataset. The blender file will be enough.

xiaoyu258 / docproj Goto Github PK

docproj's Introduction

DocProj

Prerequisites

Geometric Rectification

Dataset Generation

Training

Use a Pre-trained Model

Stitch flow

Resampling

Illumination Correction

Training

Use a Pre-trained Model

docproj's People

Contributors

Stargazers

Watchers

Forkers

docproj's Issues

Excellent work for document rectification!

Recommend Projects

Recommend Topics

Recommend Org