Giter Club home page Giter Club logo

arshadkhurshid / capstone-project-udacity Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 4.94 MB

A company invest lot on employee to train them and make them ready for next generation business. Once you invest in skill enhancement of an employee you need to use it for benefit of business. Employee may be agitated even if they are being paid well as human have aspiration and if aspiration is fulfilled then they perform to their maximum capability. I was looking for dataset which has features that can be reason for employee to quit and Kaggle gave me one. Retaining an employee means retaining knowledge and they are the one who grooms the people working in one level down thus helping to increase and knowledge base for all.

Home Page: https://www.kaggle.com/ludobenistant/hr-analytics

Jupyter Notebook 100.00%
supervised-learning adaboost gradient-boosting roc grid-search kfold-cross-validation

capstone-project-udacity's Introduction

Machine Learning Engineer Nanodegree

Supervised Learning

Project: Capstone Project

Install

This project requires Python 3.6 and the following Python libraries installed:

NumPy
Pandas
matplotlib
scikit-learn

You will also need to have software installed to run and execute an iPython Notebook

We recommend to install Anaconda, a pre-packaged Python distribution that contains all of the necessary libraries and software for this project.

Code

Code is provided in the CapstoneProject.ipynb notebook file. It uses the HR_comma_sep.csv dataset file. Code uses numpy, python, matplotlib and scikit learn libraries. Code uses classification technique of Supervised Learning to train the model. Code has implemented ADABoost Classifier, Gradient Descent Classifier, SVM and Stochastic gradient Descent to calculate performance based on the different methods of training using different techniques.

Run

In a terminal or command window, navigate to the top-level project directory capstone_project/ (that contains this README) and run one of the following commands:

ipython notebook CapstoneProject.ipynb

or

jupyter notebook CapstoneProject.ipynb

This will open the iPython Notebook software and project file in your browser.

Data

This data set contains total of 14999 rows and 10 columns. Target variable (left) is imbalance dataset. It contains 3751 records of employee who have left the company. 11428 records of employee who stayed in the company. Here we are trying to predict employee who can quit. Dataset has been extracted from Kaggle.

Dataset Link: https://www.kaggle.com/ludobenistant/hr-analytics

Features
  • satisfaction_level : Level of Satisfaction
  • last_evaluation :Time since Last performance Evaluation
  • number_project :Number of Project completed while at work
  • average_montly_hours : Average monthly hours at workplace
  • time_spend_company : Number of years spent in the company
  • Work_accident : Whether the employee had a workplace accident
  • promotion_last_5years: Wehter employee was promoted in last 5 years
  • sales : Department they work for
  • Salary : Relative level of Salary(high)
Target Variable
  • left : Whether employee left the workplace or not (0 - stayed and 1 - Left)

capstone-project-udacity's People

Contributors

arshadkhurshid avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.