Giter Club home page Giter Club logo

course-resources-ml-with-experts-budgets's Introduction

course-resources-ml-with-experts-budgets

Further student resources for DrivenData's 'Machine Learning with the Experts: School Budgets' DataCamp course.

To see the model, take a look at the notebook that builds the winning model.

To get the data, sign up for the competition and use the data download link!

To run the notebook, first install the dependencies with:

pip install -r requirements.txt

Then run:

jupyter notebook notebooks/1.0-full-model.ipynb

Project Organization

├── LICENSE
├── README.md   
├── data
│   ├── TestSet.csv
│   └── TrainingSet.csv
├── notebooks
│   └── 1.0-full-model.ipynb
├── requirements.txt
└── src
    ├── __init__.py
    ├── data
    │   └── multilabel.py
    ├── features
    │   └── SparseInteractions.py
    └── models
        └── metrics.py

Project based on the cookiecutter data science project template. #cookiecutterdatascience

course-resources-ml-with-experts-budgets's People

Contributors

pjbull avatar wsurles avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

course-resources-ml-with-experts-budgets's Issues

Data download

Hey, it seems the competition has ended hence I'm unable to download the dataset from the datadriven website. Is it possible for you to share the data somehow? i'd really like to try out the notebook! Any temporary link/dropbox anything works.. :)

Session keeps timing out

Background:
For the last four exercises in "Machine Learning with the Experts: School Budgets", Chapter "Improving your model", I cannot submit my answer because the session just keeps timing out. If I refresh the page, even before I hit submit answer, the console shows the following error:

Traceback (most recent call last):
File "", line 81, in
df = pd.read_csv('https://s3.amazonaws.com/assets.datacamp.com/production/course_2533/datasets/TrainingSetSample.csv', index_col=0)
File "", line 562, in parser_f
return _read(filepath_or_buffer, kwds)
File "", line 301, in _read
compression=kwds.get('compression', None))
File "", line 308, in get_filepath_or_buffer
req = _urlopen(str(filepath_or_buffer))
File "", line 163, in urlopen
return opener.open(url, data, timeout)
File "", line 466, in open
response = self._open(req, data)
File "", line 484, in _open
'_open', req)
File "", line 444, in _call_chain
result = func(*args)
File "", line 1297, in https_open
context=self._context, check_hostname=self._check_hostname)
File "", line 1256, in do_open
raise URLError(err)
urllib.error.URLError: <urlopen error [SSL: UNKNOWN_PROTOCOL] unknown protocol (_ssl.c:645)>

Replicate:

I can confirm when inputting the accurate code to exercise: https://campus.datacamp.com/courses/machine-learning-with-the-experts-school-budgets/improving-your-model?ex=9

The system accepts the code, lets me know I've passed but when I check the outline of the course it resets back to looking as if I never submitted an answer for that exercise:

screen shot 2018-08-07 at 11 21 47 am
screen shot 2018-08-07 at 11 22 33 am

UID: 2341064
Zendesk: https://datacamp.zendesk.com/agent/tickets/109837

Where is the dataset

Hey where can i download the dataset to practice in in offline, in my jupyter lab environment

Regex

I think the regex expression is wrong.

TOKENS_ALPHANUMERIC = '[A-Za-z0-9]+(?=\s+)'

Doesn't this mean that you only consider tokens if they contain only alphanumeric characters and are followed by white space ?

Example:
WORD1,WORD2, WORD3, WORD4 Word5

In the above sentence WORD4 and Word5 would be considered as tokens as the other words have a comma in them and as such are not valid tokens.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.