Comments (10)
Sorry for the confusion. Actually, a more accurate description would be inside/outside likelihood, which measures the likelihood of a pixel that is inside/outside of the category-specific foreground mask in a region of interest. There is an additional ``null" category for normalizing with the object categories.
from fcis.
I've got a similar question. How do you produce final class-specific mask from these inside/outside maps? Do you explicitly subtract the inside and outside values and then apply sigmoid for every cell?
You mention " For mask prediction, a softmax operation produces the per-pixel foreground probability ([0 ;1])." and "For a positive ROI, for each (inside, outside) score pair, one should be high and the other should be low, depending whether the corresponding pixel is inside or outside the object boundary", but I failed to understand how exactly this conversion is done.
Probably I am missing something very obvious.
from fcis.
Hi, basically, a softmax is applied on the 2-d inside/outside maps of the corresponding ground-truth category, to produce the instance mask.
from fcis.
Please let me rephrase the question. If I get the report well, we have a tensor of shape 2 * k * k * (C + 1)
, i.e. for a given category it's of shape 2 * k * k
and we need to regress a single k * k
one (the foreground mask), correct?
In your response I don't understand how you merge two softmax'd inside and outside maps into a single mask. Sorry for re-asking.
from fcis.
The 2-d score maps are pre softmax. They are fed into a softmax layer to produce 1-d foreground probability.
from fcis.
And then you apply softmax for every spatial location separately, right?
(I am asking this because from my understanding, the softmax layer SoftMax(X_i) = exp(X_i) / [sum_j exp(X_j)] does not change the tensor dimensions)
from fcis.
I think @daijifeng001 says that apply softmax over inside and outside map and calculate the likelihood of inside:
softmax(F_ij) = exp(F_ij) / (exp(F_ij) + exp(B_ij)),
where F_ij denotes a pixel at i,j on Foregroud(Inside) Map, and B for Backgroud(Outside) Map.
In this case, for each pixel on both maps you get a single value so 2*k*k
reduced to 1*k*k
from fcis.
We have updated the description in our Arxiv paper https://arxiv.org/abs/1611.07709
And we have just released the code. We sincerely apologize for the delay. This is due to switching from our internal Caffe version to the public MXNet, which provides good support of fast multi-GPU training & inference.
Enjoy! It is worth noting that:
-FCIS provides a simple, fast and accurate framework for instance segmentation.
-Different from MNC, FCIS performs instance mask estimation and categorization jointly and simultaneously and estimates class-specific masks.
-We did not exploit the various techniques & tricks in the Mask RCNN system, like increasing RPN anchor numbers (from 12 to 15), enlarging the image (shorter side from 600 to 800 pixels), utilizing FPN features and aligned ROI pooling. These techniques & tricks should be orthogonal to our simple baseline.
from fcis.
Thanks! Sorry for asking too many questions by email!
from fcis.
@vadimkantorov Thanks a lot for your interest!
from fcis.
Related Issues (20)
- TypeError: _update_params_on_kvstore() takes exactly 4 arguments (3 given) HOT 1
- there is an error when trained on ResNet-50 model HOT 1
- Index out of bounds error in line 143 of proposal_annotator.py HOT 1
- Some problems encountered when using deconvolution
- mxnet.base.MXNetError: [15:20:32] src/engine/threaded_engine.cc:320: Check failed: device_count_ > 0 (-1 vs. 0) GPU usage requires at least 1 GPU HOT 1
- The result finetune with cityscapes is very poor
- Network structure
- When I train my own coco-like datasets,I successfully train and test the model,but the result of all AP is 0. HOT 5
- If my data is small,such as 6000 images,is it difficult to train the model? HOT 3
- my own data set ,test map is 0
- 有哪位遇到过训练成功,测试map=0的情况吗 HOT 3
- mxnet.base.MXNetError: [16:31:18] src/operator/nn/./cudnn/cudnn_convolution-inl.h:449: Check failed: e == CUDNN_STATUS_SUCCESS (3 vs. 0) cuDNN: CUDNN_STATUS_BAD_PARAM HOT 1
- pip install -r requirements.txt HOT 3
- unable to install ./init.sh error HOT 3
- Problems when launching demo.py
- Is it possible to run FCIS on AMD?
- Trouble building on Windows 10 (TypeError: expected str, bytes or os.PathLike object, not NoneType) HOT 2
- In-depth guide to getting FCIS working on Windows 10 HOT 3
- [HELP] problem with cpu_nms HOT 1
- Result file for COCO test-dev instance segmentation
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fcis.