Giter Club home page Giter Club logo

amld20-anomaly-detection's Introduction

AMLD20 - Anomaly Detection

This repository provides the resources for the talk and accompanying hands-on exercises on Anomaly Detection at the EPFL Extension School Workshop - Machine Learning and Data Visualization at the Applied Machine Learning Days 2020.

Slides for the workshop are available here.

Dataset

The data is based on the KDD-CUP 1999 challenge on network intrusion detection. A description of the original task can be found here. The data provided for this workshop has been adapted from the NSL-KDD version.

Anomaly detection

Anomaly detection can be treated as a supervised classification task. However this approach struggles when the portion of anomalies (here network attacks) is small. Instead we showcase an approach using Isolation Forests.

The user can select the size of training dataset and vary its contamination rate, including a dataset without any anomalies. The model is then trained on this dataset and used to predict anomalies on a separate test set and evaluate the performance.

Hands-on exercises

The simplest way to run the hands-on exercises with Google's Colab or Binder in the cloud and interacting with them through your browser. Alternatively, you can choose to take a look at the already executed notebook in the Offline View.

Open In Colab Binder Generic badge

Getting started:

If you are using Colab you need to execute the first cell. Otherwise you can skip this and start with loading settings and functions. If you want to execute a cell, make sure it is selected and then press SHIFT+ENTER or the 'Play' button.

amld20-anomaly-detection's People

Contributors

christianluebbe avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.