shangbuhuan13 / so-pose Goto Github PK

View Code? Open in Web Editor NEW

66.0 66.0 10.0 39.72 MB

This repository contains codes of ICCV2021 paper: SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

License: Apache License 2.0

Python 98.64% Shell 0.18% C 0.97% C++ 0.21%

so-pose's People

Contributors

Stargazers

Watchers

Forkers

thu-da-6d-pose-group wx-b mfkiwl ericzhengyx codest4ck pamyuu zengletian1491 lyltc1 larsoncs

so-pose's Issues

LINEMOD results

Hi @shangbuhuan13 , I have seen HERE that you have uploaded the metrics for LMO and YCB-V. Can you also upload the results for LM?

Best,
D

Hi! Thanks for opensourcing the code! I cannot download models by the link: https://drive.google.com/file/d/136ExcMykxsVVSzOiGQVYspq1fx9Hjd6R/view?usp=sharing (error is: "Sorry, the file you have requested does not exist."). Does it work?

缺失lib.egl_renderer

您好，请问作者可以再提供一次lib.egl_renderer吗，其他以前的网盘都过期了，十分感谢。
https://github.com/shangbuhuan13/SO-Pose/blob/b1cfa9c20bfbb0b4ccd1e8c421e392958628aa4f/core/gdrn_selfocc_modeling/tools/lmo/lmo_2_vis_poses.py#:~:text=from%20lib.egl_renderer.egl_renderer_v3%20import%20EGLRenderer

配置文件中`TRAIN2=("lmo_pbr_train")`是否必要

您好!
我准备使用gdrn_selfocc_multistep_40E.py配置文件复现实验，但是我发现在生成xyz后lm/train_pbr/xyz_crop已经很大，并且准备生成train_pbr/Q0,但是这个我估计要3T的存储空间，所以想问下配置文件中TRAIN2=("lmo_pbr_train",),的这个是否必要?

linemod results with resnet50 or resnet34

Hi, I have a small question about the results on linemod dataset.

I see that in the paper, the backbone should be ResNet34, however, in the codebase, it seems like ResNet50 (

SO-Pose/configs/gdrn_selfocc/lm/gdrn_2rothead_multistep_02CT.py

Line 53 in a3a61d2

PRETRAINED="mmcls://resnet50_v1d",

I also run the linemod experiments with exact the config file in the repo, getting result ADI.10 ~95.5 with ResNet50. So I would like to confirm with you what is the Backbone used in linemod results?

question about backbone in experiment configs for LM dataset

Hi,

Thanks for your great work! I have a question about the backbone in LM datasets. In your paper, you said that "As backbone we leverage ResNet34 [6] for all experiments on the LM dataset", but the config files in this repo seems different:

       BACKBONE=dict(
            FREEZE=False,
            PRETRAINED="mmcls://resnet50_v1d",
            INIT_CFG=dict(
                _delete_=True,
                type="mm/ResNetV1d",
                depth=50,
                in_channels=3,
                out_indices=(3,),
            ),
        ),

and the output feature dimension is 2048, not 512 as in GDR-Net.

Could you help to clarify this? Thanks!

生成xyz_crop时碰到的一些问题

您好，我之前在研究gdrnet的相关工作，但一直无法成功运行生成xyz_crop的程序。而我在您开源的这份代码的readme中看到您提及generate p.py对应的是2d-3d matching的groundtruth，这里生成的是否就是gdrnet中需要的xyz_crop呢。如果是的话，我是应该运行generate_pbr_P.py还是generate_pbr_P_fast.py呢？生成的xyz_crop和gdrnet中要求的是否一致呢？

关于configs/gdrn_selfocc/lm参数设置

在模型文件GDRN.py中用到了"Q0_DEF_LW"和"HANDLE_SYM"这两个参数，但在configs/gdrn_selfocc/lm 文件夹中的配置文件没有设置这两个参数，请问这两个参数应该设置什么数值？

请问在训练过程中，cpu无法跑满怎么办。

如图所示，cpu无法跑满，导致速度很慢，请问这个有解决办法吗。不知道是不是因为你们基于detectron2的问题还是别的。
感谢解答。
我跑了很多天了，设置多个num_workers后，cpu无法跑满，导致我的速度非常慢，这个怎么办呢。是和你们依托detectron有关吗，还是你别的原因，我看你们在代码中，好像是也有遇到相关问题吗

question about implementation of 2D cross layer consistency

Hi,

Thanks again for your great work. I have a question about the implementation of 2D consistency loss

SO-Pose/core/gdrn_selfocc_modeling/losses/crosstask_projection_loss.py

Line 158 in a3a61d2

loss = ((loss_x + loss_y + loss_z) / (z_mask_sum.sum())) / 572.3 # depends on K

I am confused why the loss is divide by 572.3. In datasets/BOP_DATASETS/lm/camera.json I see the camera information is

{
  "cx": 325.2611,
  "cy": 242.04899,
  "depth_scale": 1.0,
  "fx": 572.4114,
  "fy": 573.57043,
  "height": 480,
  "width": 640
}

Also, will this impact YCBV dataset, since it has different camera intrinsic parameters? Thanks!

Since we need ground truth 2D-3D matching and self-occlusion results, we provide generation methods in .gdrn_selfocc_modeling/tools. Please refer to generate_*.py.

您好！
您的工作真的太棒了。
其次想问下您，“ Please refer to generate_*.py.” 真实的自遮挡坐标需要我们运行什么指令才可以生成吗？还是说在训练的过程中，自己就会生成呢，

请问在可视化结果时可以提供一下你们的lib.egl_renderer吗，感谢。

如题目所示，请问作者可否提供下可视化结果时用到的lib.egl_renderer，无比感谢。
https://github.com/shangbuhuan13/SO-Pose/blob/b1cfa9c20bfbb0b4ccd1e8c421e392958628aa4f/core/gdrn_selfocc_modeling/tools/lmo/lmo_2_vis_poses.py#:~:text=from%20lib.egl_renderer.egl_renderer_v3%20import%20EGLRenderer