[CVPR 2022] We unify pixel-to-pixel transformation and color-to-color transformation in a coherent framework for high-resolution image harmonization. We also release 100 high-resolution real composite images for evaluation.

Python 84.67% C++ 6.78% Cuda 7.22% Shell 0.53% C 0.60% Objective-C 0.20%

image-harmonization high-resolution-image-harmonization image-composition

cdtnet-high-resolution-image-harmonization's People

Contributors

Stargazers

Watchers

Forkers

ironheads tsing-coco boxfishlab tiamat-tech zivzone twh-178 sunpro108 sacj1889 songyang86

cdtnet-high-resolution-image-harmonization's Issues

关于训练结果？

我依据ISSAM，查看训练的可视化结果：我没有看到有任何变化，设置 120个epoch，现在是12个epoch的结果，

我的损失函数采用了：L = L pix + L rgb + L ref . L1 loss，没有添加3DLUT 的tv_cons以及mncons loss计算（如下计算），
loss = mse + opt.lambda_smooth * (weights_norm + tv_cons) + opt.lambda_monotonicity * mn_cons
细化的最终输出参照小分辨率的输出设计：
output = attention_map * image + (1.0 - attention_map) * self.to_rgb(conv_2)

我不确定，是我设计有问题还是什么问题，这是三个L1 loss的训练损失

使用HAdobe5k_2048.pth复现的结果比较奇怪

您好，非常棒的工作！

我想要用这个模型对custom image做推理，所以我按照

python3 evaluate_model.py CDTNet ./HAdobe5k_2048.pth --gpu 0 --datasets HAdobe5k --hr 2048 --lr 512 --save_dir ./CDTNet_2048_result

来进行测试。

我只测试了一张图，也就是说，HAdobe5k_test.txt里面只包含：

a3630_1_5.jpg
a3630_1_1.jpg
a3630_1_2.jpg
a3630_1_3.jpg
a3630_1_4.jpg

测试出来后指标结果如图：

图片结果的效果也比较差。

感觉自己是不是哪里设置错了，请问是我加载的模型不对吗？

关于网络细节

Our paper

ISSAM paper

想请教一下老师，这个像素转换的实现，仅仅包含ISSAM项目中的Encoder以及Decoder（第二张图像的）编解码还是说也包含了前半部分（HRNet + OCR）只是没有画出来，我看咱们论文（第一张图像）只有编解码的（Encoder和Decoder实现），是否只有编解码这部分，没有HRNet+OCR这部分

请问能否把HAdobe5k数据集打一个压缩包上传，谢谢

下载时间很久都耗费在文件下载链接IO了，如果能直接打压缩包获取就好了，感谢

如果针对640*640的图像是否也可以取得较好的结果呢

目前我们的数据集都是640*640左右的，不知道CDTNet按照理论是否也可以产生较为理想的结果呢

Request test results

Dear author, I am doing a comparative test recently, and I need the test results of your model, but I noticed that the online disk link of 256X256 on the iHarmony4 test set is broken, could you update it again? Thank you very much for receiving your reply!

Inconsistent results for 100 HR real composite images.

Hi there,

This is a good job. However, I found a small problem that the number 25 and number 49 harmonized images of CDTNet are inconsistent with the given composite images.

About the memory cost described in the Introduction.

Hello there,

I have noticed that in the introduction of your paper, it is said that iDIH will cost more than 20GB memory when harmonize a 2048*2048 image. However, in our test, it seems it cost only about 2.5GB. We conduct the test as follows:

        with torch.no_grad():
            input_tmp = torch.randn(1, 3, 2048, 2048).cuda()
            mask_tmp = torch.randn(1, 1, 2048, 2048).cuda()
            start = torch.cuda.memory_allocated() / 1024 / 1024
            self.output = self.net(input_tmp,mask_tmp)
            end_max = torch.cuda.max_memory_allocated() / 1024 / 1024

            print("Max_memory:", (end_max-start))

Is there any wrong above? And I found that if I enable the grad, then the memory cost is about 20GB. So should I test that without the codeline "with torch.no_grad()" ?

Looking forward to your reply, many thanks.

bcmi / cdtnet-high-resolution-image-harmonization Goto Github PK

cdtnet-high-resolution-image-harmonization's People

Contributors

Stargazers

Watchers

Forkers

cdtnet-high-resolution-image-harmonization's Issues

Recommend Projects

Recommend Topics

Recommend Org