Giter Club home page Giter Club logo

Hi there 👋

datatalking's github stats GitHub Streak

Andrew Schell's Projects

code2flow icon code2flow

Turn your Python and Javascript code into DOT flowcharts

computer-science icon computer-science

:mortar_board: Path to a free self-taught education in Computer Science!

constants icon constants

multi-environments application constants python module

coverpage icon coverpage

What is new, what I've been doing, what you can look at.

crunchbasewrapper icon crunchbasewrapper

Scraper to get data from crunchbase.com and read - write the data using SQLite database and JSON file.

cudamat icon cudamat

Python module for performing basic dense linear algebra computations on the GPU using CUDA.

cudf icon cudf

cuDF - GPU DataFrame Library

customerchurn icon customerchurn

Every model did a job well done and best among them all was Logistic Regression that is because data is mostly lineraly seperable, that is why linear kernel of SVM performed better than radial basis funcion, RBF, kernel (or poly kernel, I gave it poly degree four, however, it could be increased to arbitrary number). An other very good predictor have always been Random Forest that is because though it is random, it takes data from many, many forests and finally it finds something that is good enough, it is quite fast as well, compared to svm with poly kernel (with a high degree of polynomial). Similarly KNN has been good at predicting for classificatoin, like this, where classes are seperable linearly, however, it can also perform well where data is clustered this is beauty of KNN but this was simple classification problem and it did a good job. However Naive Bay did worse than others for this problem, that is because it guesses things based on probability rather than finding a pattern in things. Lastly K means Clustering technique was performed. Though there is similarity between classification and clustering however, they are not quite the same thing, for example sometimes data can be mapped on graph through a circle and it may appear in a circular form, though it would still be single class (classification problem) but it would not be possible for a simple clustering technique to perform as good for this task. This is what we see happening in this example. K means clustering was totally robbed of its glory by the problem (dataset) because datapoints were not clustered. The dataset has more than 3 dimensions so it can't be plotted. However, I will try to use data exatraction technique (if you extend my research time and allow me a few more days to submit this assignment) and bring it to two or three features (if they could describe the variation in between predicted and true values. Your sincerely Ashar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.