Convolutional Neural Network (CNN) for Brain Tumor Detection & Location Classification from MRI images via OpenCV & PyTorch

Significance

According to research study conducted in 2015, based on 255 uninterrupted eight-hour workdays per year, radiologists are needing to review one image every three to four seconds to meet workload demands. However, the overall workload per radiologist is still increasing as MRI and CT scans are becoming more prevalent and the use of imaging contrasts/agents are increasing the complexity of information potentially gleaned from these scans. As a result, there are increasing cases of burnout amongst diagnostic radiologists. The aim of this project is to develop a deep learning approach that radiologists can potentially rely upon for to reduce their workloads. During this process, I hope to establish expertise in using PyTorch to develop deep learning models in a clean, and efficient manner with the hopes of being able to apply these skills to other similarly framed problems in the future.

Overall Dataset Characteristics

3 tumor-associated classes (Glioma, Pituitary, Meningioma) of mostly 512 x 512 sized images
Non-tumor class has randomly sized images
Scans are taken at different orientations: Axial, Coronal, and Sagittal

Training Dataset & Class Breakdown

Composed of 5712 Images -> Split into ⅔ Training & ⅓ Validation

Glioma	Meningioma	Pituitary	No Tumor
1321	1339	1457	1595

It is clear that there is a fairly even distribution amongst the respective classes, alleviating class imbalance concerns. Otherwise, I would need to incorporate techniques such as minority class penalties or a weighted random sampler to curate batches with evenly distributed classes.

Test Dataset & Class Breakdown

Composed of 1311 images

Glioma	Meningioma	Pituitary	No Tumor
300	306	300	405

Model Objective

The goal is to accurately classify an MRI Brain image into one of these 4 classes, with the tumor classes varying based on brain location.

Data Processing Workflow

Converting Image to Grayscale (Eliminates 2 image channels which were unnecessary & simplifies processing)

2. Apply a binary mask to separate brain from background and aid identification of Region of Interest (ROI) where contour is drawn

Use coordinates of drawn contour to crop to ROI

Above figure provides cropped ROI image on the left and the original image on the right

Standardize images to fixed size of (256, 256) for training convolutional neural network

On the left: final standardized, processed CNN input & On the right: original image if simply resized to (256,256)

Image processing resulted in heavy elimination of background, amplifying area occupied by ROI in (256,256) frame

Hyperparameter Optimization Results

Optimizing number of layers, filters per layer, kernel sizes, & number of training epochs, batch size, learning rate, L2 Regularization

This was not a true gridsearch as it is typically a pseudo outer product to yield all combinations of all realistic values chosen for each of the tunable hyperparameters. Due to time constraints, some of the hyperparameters (batch size, learning rate, L2 regularization) were established over fewer tests compared to other hyperparameters. However, further optimization of those hyperparameters would not yield as much of a performance gain in comparison to optimizing the other hyperparameters. Also, this is a sample of the entire hyperparameter optimization testing results, but highlights the final chosen values for the tunable hyperparameters.

Convolutional Neural Network Architecture

Used Batch Normalization & ReLU as the Activation Function throughout the Convolution Layers

Classification Metrics on Test Dataset

Overall Accuracy: 90.5%

It is clear that the model is great at classifying the no tumor and pituitary class. However, it is clear the decision boundary the model is using to distinguish between the meningioma and glioma is not as robust as the decision boundaries used between the other classes. This is reflected in the precision and recall for the meningioma and glioma classes, but clearly evident from the confusion matrix revealing an overall 60 misclassifications where the meningioma image is actually a glioma image and vice versa.

Future Steps

Leverage PyTorch Dataset Class & DataLoaders to increase code efficiency, enable adaptability in data processing, and enable CPU parallelization to expedite batch generation and overall training process
Try a transformer-based approach and see how a more advanced architecture compares to current model generated from scratch

krajshuffle / mri_brain_cnn_classification Goto Github PK

mri_brain_cnn_classification's Introduction

Convolutional Neural Network (CNN) for Brain Tumor Detection & Location Classification from MRI images via OpenCV & PyTorch

Significance

Overall Dataset Characteristics

Training Dataset & Class Breakdown

Test Dataset & Class Breakdown

Model Objective

Data Processing Workflow

Hyperparameter Optimization Results

Convolutional Neural Network Architecture

Classification Metrics on Test Dataset

Future Steps

mri_brain_cnn_classification's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent