Giter Club home page Giter Club logo

mlcourse.ai's Introduction

ODS stickers

mlcourse.ai โ€“ Open Machine Learning Course

License: CC BY-NC-SA 4.0 Slack Donate Donate

The final session has launched on September 2, 2019. You can join at any point till the end of the session (November 22). Fill in this form to participate, please explore the main page mlcourse.ai as well.

Mirrors (:uk:-only): mlcourse.ai (main site), Kaggle Dataset (same notebooks as Kernels)

Outline

This is the list of published articles on medium.com ๐Ÿ‡ฌ๐Ÿ‡ง, habr.com ๐Ÿ‡ท๐Ÿ‡บ. Also notebooks in Chinese are mentioned ๐Ÿ‡จ๐Ÿ‡ณ and links to Kaggle Kernels (in English) are given. Icons are clickable.

  1. Exploratory Data Analysis with Pandas ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernel
  2. Visual Data Analysis with Python ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernels: part1, part2
  3. Classification, Decision Trees and k Nearest Neighbors ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernel
  4. Linear Classification and Regression ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernels: part1, part2, part3, part4, part5
  5. Bagging and Random Forest ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernels: part1, part2, part3
  6. Feature Engineering and Feature Selection ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernel
  7. Unsupervised Learning: Principal Component Analysis and Clustering ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernel
  8. Vowpal Wabbit: Learning with Gigabytes of Data ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernel
  9. Time Series Analysis with Python, part 1 ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡จ๐Ÿ‡ณ. Predicting future with Facebook Prophet, part 2 ๐Ÿ‡ฌ๐Ÿ‡ง, ๐Ÿ‡จ๐Ÿ‡ณ Kaggle Kernels: part1, part2
  10. Gradient Boosting ๐Ÿ‡ฌ๐Ÿ‡ง ๐Ÿ‡ท๐Ÿ‡บ, ๐Ÿ‡จ๐Ÿ‡ณ, Kaggle Kernel

Lectures

Videolectures are uploaded to this YouTube playlist. Introduction, video, slides

  1. Exploratory data analysis with Pandas, video
  2. Visualization, main plots for EDA, video
  3. Decision trees: theory and practical part
  4. Logistic regression: theoretical foundations, practical part (baselines in the "Alice" competition)
  5. Ensembles and Random Forest โ€“ part 1. Classification metrics โ€“ part 2. Example of a business task, predicting a customer payment โ€“ part 3
  6. Linear regression and regularization - theory, LASSO & Ridge, LTV prediction - practice
  7. Unsupervised learning - Principal Component Analysis and Clustering
  8. Stochastic Gradient Descent for classification and regression - part 1, part 2 TBA
  9. Time series analysis with Python (ARIMA, Prophet) - video
  10. Gradient boosting: basic ideas - part 1, key ideas behind Xgboost, LightGBM, and CatBoost + practice - part 2

Fall 2019 assignments

All deadlines are 20:59 GMT+1 (London time), check out also this Google calendar

  1. Exploratory data analysis of Olympic games with Pandas, nbviewer. Deadline: September 15
  2. Trees, forests and boosting
  • Quiz 1. Trees and forests nbviewer. Deadline: September 27
  • Part 1. Classification and regression trees, nbviewer. Deadline: October 6
  • Part 2. Beating a baseline in a Kaggle competition, CatBoost starter. Deadline: October 6
  1. Linear classification and regression models
  • Quiz 2. Math behind linear models, nbviewer. Deadline: October 25
  • Part 1. User Identification with Logistic Regression, nbviewer. Deadline: October 27
  • Part 2. Random Forest and Logistic Regression in credit scoring and movie reviews classification, nbviewer. Deadline: October 27

Demo assignments, just for practice

The following are demo versions. Full versions are announced during course sessions.

  1. Exploratory data analysis with Pandas, nbviewer, Kaggle Kernel, solution
  2. Analyzing cardiovascular disease data, nbviewer, Kaggle Kernel, solution
  3. Decision trees with a toy task and the UCI Adult dataset, nbviewer, Kaggle Kernel, solution
  4. Sarcasm detection, Kaggle Kernel, solution. Linear Regression as an optimization problem, nbviewer, Kaggle Kernel
  5. Logistic Regression and Random Forest in the credit scoring problem, nbviewer, Kaggle Kernel, solution
  6. Exploring OLS, Lasso and Random Forest in a regression task, nbviewer, Kaggle Kernel, solution
  7. Unsupervised learning, nbviewer, Kaggle Kernel, solution
  8. Implementing online regressor, nbviewer, Kaggle Kernel, solution
  9. Time series analysis, nbviewer, Kaggle Kernel, solution
  10. Beating baseline in a competition, Kaggle kernel

Kaggle competitions

  1. Catch Me If You Can: Intruder Detection through Webpage Session Tracking. Kaggle Inclass
  2. How good is your Medium article? Kaggle Inclass
  3. DotA 2 winner prediction Kaggle Inclass

Rating

Throughout the course we are maintaining a student rating. It takes into account credits scored in assignments and Kaggle competitions. They say, rating highly motivates to finish the course. Top students (according to the final rating) are listed on a special page.

Community

Discussions are held in the #mlcourse_ai channel of the OpenDataScience (ods.ai) Slack team.

The course is free but you can support organizers by making a pledge on Patreon (monthly support) or a one-time payment on Ko-fi. Thus you'll foster the spread of Machine Learning in the world!

Donate Donate

mlcourse.ai's People

Contributors

yorko avatar festline avatar voskresenskiy avatar datamove avatar mckenzypg avatar dnn37 avatar maximkeremet avatar vfdev-5 avatar maxnk avatar alexnich avatar varan42 avatar glevv avatar weakish avatar oussou-dev avatar asmolovskij avatar andrewd76 avatar ilbuono avatar justramgerry avatar sirius999999 avatar gsenseless avatar mdscntst avatar ser-serege avatar ptaiga avatar nsimonoff avatar mikhailsergeevi4 avatar kainierus avatar eldaniz avatar egorpolusmak avatar kirillpanin avatar itkinai avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.