Giter Club home page Giter Club logo

3dfasterrcnn's Introduction

This is the code for our TMI paper "Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN". Only coarse stage code is thoroughly tested. The stage 2 code was released in Nov 2023, but there might be issues when using it. For more details, please check out

https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9494574

Our code was tested with Python 3.8, Tensorflow 2.5, CUDA 11.2 and cuDNN 8.1. To use the algorithm without recompiling, the same environment as ours is required to train/test models. Note that even slightly different environment from ours may lead to unexpected training/inference errors (make sure all dynamic libraries are loaded successfully!). If you want to use the code in a different environment, you MUST recompile the CUDA source codes in folders CropAndResize3D and non_max_suppression to obtain the new crop_and_resize_op_gpu.so and non_max_suppression_op.so! We have provided instructions in ReadMe.txt inside both CropAndResize3D and non_max_suppression for recompiling. After this, you should either change the paths in landmark_detection_group1_size24.py (lines 48 and 51) or replace the old ones in the current folder with the new ones (the latter is recommended).

The algorithm is designed to be general. That is, users can use any imaging modalities that are prefered in their applications. To use our code in a different application, an user should prepare the data (i.e., images and ground truth landmark locations) the same way as ours (see folders images_1.6 and coordinates_1.6; the coordinates are in the order of x, y and z) and save them in folders for images and ground truth respectively. Note that users should carefully choose the down-sampling rate, as a too large image spacing may lose a lot of image details that are essential for accurate landmark localization. Depending on the sizes of your data, users should choose the smallest down-sampling rate possible. Please also note that the default hyper-parameters should give some reasonable predictions but they are not guarantteed to be optimal in different datasets and applications. We encourage the users to tune the parameters when using this code.

Training command:

python landmark_detection_group1_size24.py

Inference command (this is only an example; specific path to model varies):

python landmark_detection_group1_size24.py --mode=inference --weight-path=./models/3d_faster_rcnn20210810T1758/3d_faster_rcnn.18.h5

If you find our work useful, please consider citing our paper:

@article{chen2021fast,
title={Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN},
author={Chen, Xiaoyang and Lian, Chunfeng and Deng, Hannah H and Kuang, Tianshu and Lin, Hung-Ying and Xiao, Deqiang and Gateno, Jaime and Shen, Dinggang and Xia, James J and Yap, Pew-Thian},
journal={IEEE Transactions on Medical Imaging},
year={2021},
publisher={IEEE}
}

3dfasterrcnn's People

Contributors

xychen2022 avatar xychenunc avatar

Stargazers

 avatar  avatar  avatar  avatar Shine avatar

Watchers

 avatar

3dfasterrcnn's Issues

Problem in training process

Hi, I'm tring to reproduct your work code after recompiling the NMS and 3D Crop files with CUDA =11.3, tf = 2.5.0. Python = 3.8. However I still getting the error messages. Could you help with this problem, thanks a lot!
image
image
image
image

Problems with Training

Hi, I got a problem when I start training, here is the error message:
image
Im using Python3.8, tensorflow2.5 with cuda 11.8(Changing the version doesnt seem possible since im using a public server). Could you tell me how this error message may come from?

Ask for code

Sorry to disturb you. I submit a paper regarding landmark localization on medical images, one of the reviewers asked me to compare ours with your method. Could you send me the whole code to my email ([email protected]). I have to contact you in this way since I can't find your email in your paper. I'm looking forward to your reply, thanks.

Require full code

Thank you for your great work. We are very interested in your project.
Our current project is related to medica image landmark prdiction. We want to compare different landmark detection method. However, the code seems only contain the coarse stage as you described, which may lead to an unfair comparison.
I'd appreciate it if you could share your fine stage code. This is my email: [email protected].
Looking forward to your reply.

Request for full code

Hi, there!
Nice work! I'm working on landmark detection on 3D CT image, and your work really helps me a lot. However, I found only coarse stage code here, could you plz provide the full code? my email: [email protected].
Many thanks! :D

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.