Giter Club home page Giter Club logo

machine-learning-course's Introduction

Machine Learning #1 and #2

This repo contains materials for the introductory/intermediate Machine Learning (ML) courses (3 credits total) taught in the MSc in Business Analytics program at the Central European University (CEU), 2020. The material for 2018 and 2019 can be found on branches '2018' and '2019'. The material for the previous 2-credit course taught in 2016 and 2017 can be found here.

Course Design and Instructors

Zoltán Papp
Jenő Pál (TA)

Initial Course Designer (and Instructor pre-2018):

Szilárd Pafka

Course Description and Objectives

The breakdown below (into ML #1 an #2) is somewhat adhoc and mainly to comply with administrative requirements. Both courses will intertwine general ML concepts, algorithms and software implementations/tools and will aim to strike a balance of theory and practice with the goal of equiping students with both the foundations to understand the ML methodology and also with the skills needed for using ML in practical business applications.

Data Science and Machine Learning 1 (Concepts):

After an overview of the entire data science landscape this course will focus on machine learning. The course will introduce the main fundamental concepts in machine learning (supervised learning, training, scoring, accuracy measures, test set, overfitting, cross validation, model capacity, hyperparameter tuning, grid and random search, regularization, ensembles, model selection etc.) The concepts will be illustrated with R code therefore it requires prior familiarity with R.

Data Science and Machine Learning 2 (Tools):

This course will build on the previous one (which introduced the basic concepts in machine learning) and will discuss state-of-the-art algorithms for supervised learning (linear models, lasso, decision trees, random forests, gradient boosting machines, neural networks, support vector machine, deep learning etc.). A large part of the course will be dedicated to using (hands-on) the software tools for machine learning used by data scientists in practice (various high-performance R packages, xgboost, libraries for deep learning etc.).

Grading

The two courses are graded with the same structure but completely separately from each other.

  • 45% Weekly Assignments (homework exercises). These will be submitted using Moodle.
  • 45% Final Exam
  • 10% Quizzes at the beginning of each lecture, except the first lectures of each course. Missing a lecture or being late will result in 0% for the actual quiz score.

Assignments

Assignment acceptance policy and achievable grades:

  • 100% until due date
  • 50% within 24 hours past due date
  • 0% after that.

Assignment dates and deadlines for ML #1:

  • Feb 12, due on Feb 24 3:00 PM

Assignment dates and deadlines for ML #2:

  • Feb 26, due on Mar 9 3:00 PM
  • (tentative) Mar 18, due on Mar 27 3:00 PM

Final exam

Final exam for ML1: Feb 19 1:30 PM Final exam for ML2: Mar 30 3.30 PM

Announcements and Q&A

Class announcements and student Q&A will be done via Moodle.

Syllabus and Schedule:

ML #1

ML 1.1: Lab: penalized linear models: ridge, LASSO, elastic net. Lecture | Lab.

ML 1.2: Lab: Unsupervised learning. Clustering (k-means, hierarchical). PCA. Lecture | Lab.

ML #2

ML 2.1: Understanding and tuning parameters for trees, random forests and gradient boosting machines. Impact of correlated features. Support vector machines. Tools: R packages, xgboost, lightgbm Lecture | Lab.

ML 2.2: Neural networks and deep learning. Reinforcement Learning. Evolutionary Computing. Tools: R packages, Keras. Lecture | Lab.

ML 2.3: Ensembles, Stacking. Lecture | Lab.

ML 2.4: Recap and summary. Lecture | Lab.

machine-learning-course's People

Contributors

paljenczy avatar pappzoltan avatar szilard avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.