Giter Club home page Giter Club logo

cyberbotnet3's Introduction

Cyber Botnet assignment 3 ๐Ÿ‘บ

Code for Group 66 python implementation of Cyber Data Analytics assigment 3 CS4035. ๐Ÿ”’

Team members:

Project structure ๐Ÿ“‚

The structure of the project is presented per task:

Sampling

  • reservoir_sampling.py - implementation for reservoir sampling with testing for multiple reservoir sizes

Sketching

  • countminsketch.py - contains the CountMinSketch class.
  • CountMinSketch.ipynb - the actual analysis and plots.

Flow data discretization

  • Flow visualization.ipynb - notebook for visualizing different features for the infected host
  • flow_visualization_utils.py - helper functions for generating the plots from notebook
  • flow_discretize.py - implementation of the discretization of flags and bytes followed by combining into a single discrete feature

Botnet profiling

  • Profiling.ipynb - contains the full analysis. note that it takes some time and ram to run

Flow classification

  • flow_classification.py - train and test Random Forest classifier for identifying a netflow probability of being a botnet

Bonus

  • bonus.py - implementation of the generation method for adversarial data

โ— The actual testing using adversarial data is in the files corresponding to profiling and classification tasks

Others

  • logger.py - logging system for generating folders initial structure and saving application logs to HTML files
  • utils.py - helper functions used for multiple tasks
  • config.txt - configuration file

Additional folders

  • data\ - for storing data files with BATADAL datasets
  • output\ - for storing plots at high resolution (Better to be inspected if the ones from the report are too small due to page limit)
  • logs\ - for storing a couple of logs files referred in the report

Data ๐Ÿ’พ

If unable to clone the repository, download the CTU-13 dataset as follows:

  • for Task 1+2 Scenario 6 file capture20110816.pcap.netflow.labeled
  • for the other tasks Scenario 10 file capture20110818.pcap.netflow.labeled

โ— After downloading the files, place them into the data\ folder

Instructions for cloning ๐Ÿ“

The data files were uploaded using Git LFS being over 100MB. Git LFS is needed to clone the repository. Install it manually or try to use downlopad_data_files.sh.

Installation ๐Ÿ’ป

The scripts can be run in Anaconda Windows/Linux environment.

You need to create an Anaconda ๐Ÿ python 3.6 environment named cyber3. Inside that environment some addition packages needs to be installed. Run the following commands inside Anaconda Prompt โŒจ:

(base) conda create -n cyber3 python=3.6 anaconda
(base) conda activate cyber3
(cyber3) conda install -c conda-forge tqdm 
(cyber3) conda install -c conda-forge mmh3

cyberbotnet3's People

Contributors

simionandrei avatar mihai1voicescu avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.