Giter Club home page Giter Club logo

dat_sf_18's Introduction

DAT_SF_18

Course materials for General Assembly's Data Science course in San Francisco (10/27/15 - 1/26/16).

Logistics

  • Dates: 10/27/15 - 1/26/16, Tuesday - Thursday 6:30-9:30
  • Holidays (no class): 11/26 (Thanksgiving), 12/21 - 1/1 (winter break)
  • Location: 225 Bush Street, Classroom 4
  • Instructor: Francesco Mosconi
  • Experts-in-Residence: Dylan Hercher, Otto Stegmaier

Course Description

Foundational course in data science, including machine learning theory, case studies and real-world examples, introduction to various modeling techniques, and other tools to make predictions and decisions about data. Students will gain practical computational experience by running machine learning algorithms and learning how to choose the best and most representative data models to make predictions. Students will be using Python throughout this course.

Required Setup

Completion Requirements

In order to receive a General Assembly Letter of Completion for Data Science, upon completion of the course, students must:

  • Complete and submit 80% of all course assignments (homework, homework reviews, labs, quizzes). Students who miss more than 20% of assignments will not be eligible for the course certificate.
  • Attend at least 80% of classes (miss no more than 4 classes)
  • Complete and subimt the course midterm test.
  • Complete and submit the course final project, completing all functional and technical requirements on the project rubric, including delivering a presentation.

Assignments, milestones and feedback throughout the course are designed to prepare students to deliver a quality course project.

Course Outline

The weekly schedules for lecture content, lab content, and homework assignments are subject to change according to the needs & preferences of the class.

Course Schedule

Week Tuesday Thursday
1 10/27: Introduction to Data Science, Git setup 10/29: Python & Linear Algebra review
2 11/03: Cleaning and imputing Data 11/05: Data Sources
3 11/10: Introduction to Machine Learning, Regression 11/12: Cross Validation and Naïve Bayes
4 11/17: Regression and Regularization 11/19 Logistic Regression
5 11/24: Imbalanced Classes and Evaluation Metrics 11/26: Thanksgiving -- No Class
6 12/01: Support Vector Machines 12/03: Decision Trees
7 12/08: Ensemble Techniques 12/10: Review of Supervised Learning
8 12/15: K-Means Clustering and Unsupervised learning 12/17: Dimensionality Reduction
9 01/05: Recommendation systems 01/07: Natural Language Processing and Text Mining
10 01/12: Database Technologies 01/14: Map Reduce
11 01/19: Data Products 01/21: Final project presentations
12 01/26: Final project presentations

Final Project Milestones

Final Project Milestones

FP milestone Deliverable Due
1 Elevator Pitch & Data Sources 12/10
2 Draft Analysis 01/05
3 Final Project Due 01/21

Office Hours

Instructor Times Available method
Dylan
Otto Tuesday 5:30pm -6:30pm Classroom 4 or slack
Francesco Tuesday & Thursday slack (quickest response) or hangouts by appointment

Slack

You've all been invited to use Slack for chat during class and the day. Please consider this the primary way to contact other students. Dylan will be in Slack during class to handle questions. All instructors will be available on Slack during office hours (listed above).

Resources

Working in the terminal

Statistical Learning Theory

Algorithms

Python

dat_sf_18's People

Contributors

ghego avatar ostegm avatar mrtial avatar dhercher avatar vanessaohta avatar

Watchers

James Cloos avatar

dat_sf_18's Issues

HW2 Review

Hi Charlie,

Your comments made your code very easy to follow. I like your project idea-- it was cool to see all the box plots of the different divisions next to each other in your figure, especially since you had to do some tedious data cleaning.

Derek

@ghego
@dhercher
@ostegm

HW 4 Review

Hi! I liked the way you predicted when the projectile would hit the ground, I had to solve the quadratic equation but it seemed like there would be a cleaner way. I also thought your looping through all of the polynomials for questions 3 was a smart idea. I had not heard of StratifiedShuffleSplit but seems like a really good idea for dealing with imbalanced classes. Your comments were also very helpful and comprehensive.

HW 1 Review

Hey Charlie!

Your readme file is missing your personal/professional interests.

Also, I just learned about using Markdown to better format my readme file. With Markdown, you can make things italicized or bold. If you're interested, you can checkout this link for more info!

Perhaps our instructors @ghego, @ostegm or @dhercher can teach us some cool markdown techniques in class.

HW6 Review

Good idea using dummies to deal with the categorical data in the first 2 columns! That would probably help when using it with the normalized data as well, as the data would be in binary!

Did you normalize with Standard scaler as well, did min max produce better results?

@ghego

Test Issue

I have no idea what is "Issue".
But I am following the instructions of homework assignment.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.