Giter Club home page Giter Club logo

kaggle-histopathologic-cancer-detection's Introduction

Kaggle Histopathologic Cancer Detection Competition

Code used for competition submission. This solution got me in the top 14% on the private leaderboard (out of 1157 competitors). Unfortunately there was a data leak near the end of the competition. By exploiting the leak, some participants were able to get a perfect score.

Overview

The goal is to detect the presence of metastatic cancer in image patches taken from pathology scans (i.e. binary classification task).

Approach

I tried different CNN architectures and trained with weights initialized based on ImageNet. The main steps leading to performance gain were:

  • data augmentation (using the imgaug library)
  • test time augmentation
  • ensembling

Training took about 4-6 hours (depending on CNN architecture) on a P4000 GPU for 12 epochs. Increasing the number of epochs might lead to performance improvements (although I saw signs of overfitting after about 9-10 epochs already).

The final ensemble yielded a 0.9696 AUC on the private leaderboard.

Dependencies

  • tensorflow 1.5.0
  • keras 2.2.4
  • opencv-python
  • imgaug
  • sklearn
  • pandas

kaggle-histopathologic-cancer-detection's People

Contributors

demonictoaster avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.