Giter Club home page Giter Club logo

dbscan-pycompss's Introduction

DBSCAN 4 PyCOMPSs

1. Introduction

DBSCAN for PyCOMPSs is a distributed approach to the well known clustering algorithm proposed for the first time in 1996 here. The application is implemented in Python and its parallelisation is performed by the COMPSs framework.

2. Files

In this repository you will find the following files and directories:

  • DBSCAN.py contains the main algorithm and task invokation. It requires however classes included in the /classes/ folder.
  • /classes/ contains two modules imported by DBSCAN.py
    • One of them is a custom-built data class.
    • The second one is a disjoint-set data structure (merge-find set) found here.
  • run.shshell scripts to run the algorithm both in localhost and in a cluster with COMPSs installed.
  • launchDBSCAN.py script to run a batch of executions using launch.sh as launcher.
  • launch.sh launcher for a slurm based cluster.
  • Gen_Data_DBSCAN.py python script to generate randomly shaped clustering datasets as the ones in /data/.
  • /data/bunch of datasets to test the algorithm in.
  • /ext_versions/ contains other DBSCAN implementations that might be useful for benchmarking.
    • DBSCAN_Seq.pySequential naive (all vs all) implementation of the algorithm.
  • /kmeans/contains an implementation of the k-means algorithm, in PyCOMPSs, used as well for benchmarking.
  • script_times.py post-processing script to gather times from a big batch of executions.

3. Requirements

  1. Python 2.7.x (with NumPy) COMPSs won't work with Python 3.x
  2. COMPSs Latest, if you are trying to install it this manual might be useful.
  3. Pandas 0.21 (this is the one I use, older versions may work as well but they need to support callables as arguments to the pd.skip_rows method.

4. Inquires and Contact

For any inquires or problems when trying to run the algorithm or COMPSs itself, don't hesitate to contact me at: a=carlos.segarra b=bsc.es mailto: a @ b

dbscan-pycompss's People

Contributors

csegarragonz avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.