Comments (6)
Hello.. candidatesv2 are also negative examples. (there are around 400.000 negatives there)
Basically that is the most important source of negatives. The edge examples only let the network know that non-lung-tissue is also not a lung nodule.
Another (small) source of negatives are the false positives that were predicted after one round of training on LUNA16.
The networks learns 2 things at once
1: Lung nodule y/n.. (non lung tissue should always be n)
2: Malignancy (0 if not a lung nodule, 0.1-25 if lung nodule).
I train/predict 32x32x32 cubes. The prediction is nodule Y/N.
If Yes then I also look at the malignancy..
Malignancy is the only thing I work with for the final prediction.
I hope that makes things clearer.
It's quite a complex solution with all the different label sources.
from kaggle_ndsb2017.
Quite clear and really a complicated and refined work..
But the question is where is candidate v2 from?
In step1_preprocess_luna16.py, seems that you generate your negative samples from two files: lidc.xml and annotation_excluded.csv(it's candidate.csv?).
So, where they are from?
If I don't have such files in my case, I should cut lung-tissue cubes randomly (does not contain a nodule) manually as negative samples?
from kaggle_ndsb2017.
In the resources folder there is a link to "resources.rar" in the readme.md.
This file contains all the data you need and even more.
In the resources.rar there is a folder "luna16_annotations".
In that folder there is candidatesv2.csv.
This file is directly taken from the LUNA16 competition.
Look here for more:
LUNA16 data
from kaggle_ndsb2017.
Got it.
I am in Tianchi (a competition held by Alibaba, China). In my case, only nodules information were given.
Seems that I need train a 3d unet to generate false positive samples firstly.
from kaggle_ndsb2017.
Hi I looked at the competition..
My chinese is not too good :S
I do think this approach can be translated to that competition since the #6 team of the datascience bowl is #1 now at your competition.
Good luck!
from kaggle_ndsb2017.
Tianchi's english version lacks important information-_-||
4th place in kaggle is now the first place in Tianchi.
So we need to do more.
Thanks.
from kaggle_ndsb2017.
Related Issues (20)
- settings.MANUAL_ANNOTATIONS_LABELS_DIR and pos_labels_dir = settings.LUNA_NODULE_LABELS_DIR HOT 2
- Accuracy calculation HOT 3
- why do you select "sigmoid" and "None" rather than "Relu" or "LeakyReLU" in last layer of CNN? HOT 3
- Getting confused about where to keep the dataset HOT 10
- Cam we get the CT viewer? HOT 7
- Kaggle dataset not available. Can we skip the ndsb dataset? HOT 3
- Where is NDSB data ? HOT 1
- One question about helpers.py HOT 2
- Testing Data HOT 1
- directory missing HOT 1
- about the step2_train_noudle_detector HOT 1
- Can I get the ensemble model? HOT 1
- Train model takes too many hours, how to optimize it. HOT 1
- test data HOT 1
- some questions about evaluating the canditate nodules of the luna16_nodule_predictions HOT 3
- some questions about the evaluation of the malignancy of the nodules HOT 1
- the transformation of nodule coordinate HOT 2
- Some question about the label preprocess HOT 1
- imformation about malignancy HOT 1
- error with the code
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kaggle_ndsb2017.