Giter Club home page Giter Club logo

data_science's Introduction

Data Science Project Fall 2014

Fall 2014 Data Science Course at Stony Brook with Prof. Steven Skiena. This was a completely project based course. My team and I worked on the the Ghool Pool Project.

Challenge:

Ghoul pool is a game of prediction which involves guessing when someone will die. Our challenge is to predict/model the risks of people dying . We are given 32 celebrities, from different professions and countries, at a high risk of dying, because of age or lifestyle. The goal is to predict the death (probability) for each of the personalities. Models that make such predictions are used heavily in the life insurance industry.

Various Machine learning regression models have been used.

You can find the complete report and all analysis on : http://www3.cs.stonybrook.edu/~skiena/591/final_projects/ghoul_pool/

data_science's People

Contributors

aashray avatar pereddy98 avatar varshapaidi avatar

Watchers

 avatar  avatar  avatar

Forkers

anujverma1710

data_science's Issues

Need to make Freebase scrapper better so that it does not get stuck

@pereddy98 Sravanthi, you can work on this. This is priority. I can help you as well. We need to report how much data we have and that is a section by itself in this report. So I think it is important to professor. Let's work on getting this write.

We both can own this.

Please give this priority. We need a large data set with many parameters also, both will important to show good progress in this section.

Comment here with any ideas or if you face any issues.

Structure new data correctly

Once, we have a decent chuck of the new (freebase) data. We need to decide on a file structure of it. Basically a schema of the the column we need in our fields, some columns will need to be derived from other columns, like the isAlive and age fields.

Life Tables Evaluation Video

Show your initial attempt at evaluation.
Show me reacting badly.
Explain why I am right and make clear with examples (maybe me explain).
Show how to do it right.
Present the results assuming only dead people.
Recalculate using live people too.
Which does better?

Halloween pumpkin video

Prof : Your group needs to make principled predictions on Halloween day, meaning that you have probability distributions which you trust for the death probabilities for all the figures. Film yourself on Halloween tight next some pumpkins or other halloweeen stuff making your predictions, going thrugh the top five highest probability figures. Then do it again making a count down from the top ten, ending on your highest probabilty figure.

Bubble chart video

Profs task description : The chart you made with death probability with circule colors that picked Mugabi is very good, but the discussion. Produce a video where you are panned very tight over a large clear version of the graph (presuambly on a good screen) and explain it from the start and what it means and why you pick Mugabi

Life Tables Explanation Video

Explain life tables. Describe what they are and where they came from.

Do a screen capture of your mouse explaining a web page -- show the life tables -- maybe look up Prof Skiena at 53 -- joke about me surviving the end of the course

Show WHO vs. SS tables and explain why one might be more appropriate than the other.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.