Giter Club home page Giter Club logo

kaggle-titanic's Introduction

Introduction

My solution to the Kaggle Titanic competition. Achieving accuracy score of 78% (0.77512). Note: running the code may last hours. It took around 2 hours of execution time on an early 2014 MacBook Pro 2.3Ghz 8 core machine.

Instructions

$ python main.py

Dependencies

python 2.7.9

pandas 0.15.2

sklearn 0.15.2

matplotlib 1.4.2

Project Files

README.md This readme file describing the project.

test.csv Testing dataset.

train.csv Training dataset.

main.py Starting execution point of this project.

age.py Script handeling the age feature.

embarked.py Script handeling the embarked feature.

importance.py Script handeling extracting feature importances.

interaction_features.py Script handeling the creation of interaction features.

name.py Script handeling the name feature.

scale.py Script handeling the scaling of features.

sex.py Script handeling the sex feature.

featrue_importances.png Figure demonstrating feature importances.

learning_curves.png Figure demonstrating the training and testing learning accuracy curves.

roc_curve.png Figure demonstrating the ROC curve.

optimize.py Script handeling the optimization of the classifier's hyperparameters.

learning_curves.py Script handeling the creation and ploting of learning curves.

predict.py Script handeling the survival prediction of the testing set.

result.csv The output prediction set.

best_params.csv Intermediate result of the optimal hyperparameters.

load.py Script handeling the loading of training and testing datasets.

preprocess.py Script handeling the preprocessing and cleaning of datasets.

kaggle-titanic's People

Contributors

moeabdol avatar shekharkanodia avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.