Giter Club home page Giter Club logo

data-science-projects's Introduction

Data Science Projects

Motivated by 24 Ultimate Data Science Projects To Boost Your Knowledge and Skills (& can be accessed freely)

Built in

Libraries: scikit learn, pandas, seaborn, keras

Classification

  1. Iris Species

Dataset: 3 Iris species with 50 samples each and 4 properties, i.e., Sepal Length, Sepal Width, Petal Lenth, Petal Width (in cm).

Problem: classify 3 Iris species

Classification techniques: Logistic Regression, Naive Bayes, kNN, SVM, Decision Tree, Boosted Tree, Random Forest, MLP

  1. Loan Prediction

Dataset: custumers' details such as Gender, Marital Status, Education, Number of Dependents, Income, Loan Amount, Credit History and others.

Problem: predict loan eligibility (Y/N)

Classification techniques: Linear SVC, SVC, kNN, Random Forest

  1. Wine Quality

Dataset

Problem:

Classification techniques:

Regression

  1. Bigmart Sales

Dataset: 2013 sales data for 1559 products across 10 stores in different cities with certain attributes of each product and store such as weight, maximum retail price, size of store and so on.

Problem: predict sales

Regression techniques: Linear Regression, Neural Network

  1. Boston House Price

Dataset: 506 cases with 14 attributes for each.

Problem: predict NOX (nitrous oxide level) and MEDV (median value of a home price)

Regression techniques: Linear Regression, ElasticNetCV, LassoCV

  1. Time Series

Dataset:

Problem:

  1. Height Weight Prediction

Dataset: Height and weight records of 5000 men and 5000 women.

Problem: Predict Weight/Height

Regression techniques: Linear Regression

Clustering

  1. Turkiye Student Evaluation

Dataset: 5820 evaluation scores provided by students from Gazi University in Ankara (Turkey), total of 28 course specific questions and additional 5 attributes.

Problem: Cluster

Clustering techniques: KMeans, MeanShift, BayesianGaussianMixture, AgglomerativeClustering

Dimensionality reduction

Materials

Every notebook attachs helpful reading-materials. Here are some general ones:

  1. Choosing models

scikit-learn algorithm cheat-sheet

  1. Choosing features

data-science-projects's People

Contributors

tracy2811 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.