Giter Club home page Giter Club logo

clustering_crypto's Introduction

Clustering_Crypto

Utilizes k-means clustering, the elbow method and principal component analysis to find significant clustering in crypto currency data before and after dimensional reduction


Technologies

Language: Python 3.9.12

Libraries used:

Pandas - For the creation and visualization of Data Frames

Jupyter Labs - An ipython kernel for interactive computing in python

PyViz hvPlot - A high level python library for interactive data visualization

SKLearn - Simple and effective python library for predictive data analysis


Installation Guide

If you are using an anaconda or a conda environment chances are pandas, hvplot and jupyter labs are already installed in your virtual environment. In addition, installing scikit learn will be necessary

For a full install activate a conda development environment and run in GitBash if not already installed:

    conda install pandas
    conda install jupyterlab
    conda install -c pyviz hvplot

To install the other dependencies not included in the anaconda environment run:

    pip install -U scikit-learn

Check the to make sure everything has been installed properly

    conda list pandas
    conda list hvplot
    conda list jupyter lab
    conda list scikit-learn

Usage

To run this jupyter lab notebook you will need to use GitBash and navigate to where you have exported the files associated with this project and activate your dev environment. Next, this project can be ran by navigating to the crypto_investments.ipynb jupyter notebook file and clicking the double arrow as seen below:

double_arrow

This will run the jupyter notebook and each cell has proper pseudocode directing the viewer as to which analyses are being shown. Additionally, the markdown cells provided also clue the reader in to what is going on.

There are a few interactive graphs along the way.


Highlights:

First the clustering is chosen using the elbow curve method which is used to train a model to cluster the full dataset

elbow

Plot of clusters from full dataset

full_data

Next, a PCA analysis is run to dimensionally reduce the crypto data and is plotted below

pca


Contributors

Created by Silvano Ross while in the UW FinTech Bootcamp

Contact Info: email: [email protected] | GitHub | LinkedIn


License

MIT

clustering_crypto's People

Contributors

silvanoross avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.