Giter Club home page Giter Club logo

Comments (5)

DC1991 avatar DC1991 commented on September 17, 2024

Hi @gachiemchiep Thanks for your interest of the paper. The output value of R is the coordinate of 3D bounding box which is a 24D vector in the paper, and the output value of T is [x, y, z]. The labeling process is not available yet, but we use the implementation in this git (https://github.com/thodan/bop_toolkit) to transfer the 3D object model to the scene.

from g2l_net.

gachiemchiep avatar gachiemchiep commented on September 17, 2024

@DC1991
Thank you for your reply.
I understand the meaning of R. So what is the meaning of [x, y, z] of T?
I will lurk into bop_toolkit to find more detail about creating training dataset.

from g2l_net.

DC1991 avatar DC1991 commented on September 17, 2024

@gachiemchiep Sorry for unclear description. [x,y,z] means the 3D coordinate of T which is the translation vector.

from g2l_net.

gachiemchiep avatar gachiemchiep commented on September 17, 2024

@DC1991 Thank you for your explaination
I'm trying to visualize the detection result.

Is the depth data and RGB use the same coordinate origin? So the [x, y, z] can be understood as:

  1. (x, y) = coordinate of detected point in image space
  2. z = the value of depth.

Sorry I'm totally loss at understanding the T (translation). Is T the translation value between image and depth ?

from g2l_net.

DC1991 avatar DC1991 commented on September 17, 2024

@gachiemchiep . We use RGB to locate the 2D bounding box of the object, and we transform the depth image to point cloud with known camera parameters. So [x,y,z] is the 3D coordinates of the points.

from g2l_net.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.