djgagne / ams-ml-python-course Goto Github PK
View Code? Open in Web Editor NEWMachine Learning in Python for Environmental Science Problems AMS Short Course Material
License: MIT License
Machine Learning in Python for Environmental Science Problems AMS Short Course Material
License: MIT License
Hello David,
I am Chiara Lepore, a Research Scientist at LDEO-Columbia University. We briefly met this summer over lunch at the Hail Workshop held in Boulder.
I have seen your tweet about your repo with the ML classes, and I am excited to give it a try.
I am contacting you, right now, to introduce you to a great open source effort brought forward by a large community, called Pangeo. Since you have shared your course (thank you!) I gather you are keen to share your efforts in an open source community, and I think you will find Pangeo very interesting and helpful.
Pangeo is a NSF EarthCube funded project and it is, as of now (it has changed scope in the past year), very much interested in allowing simple Big Data analysis in the cloud, with a specific focus on Geosciences. You can read more about it here on its website and check out the GitHub repo where we use the issue section to communicate and discuss further developments.
I thought about introducing you to Pangeo when I have looked at the section in which you carefully describe how to set up the Jupyter Hub Containers.
In fact, one of the main goal of Pangeo, is to develop cloud computing platforms that are already set up for people, like me for example, who could find complicated to set it up on their own. More over, Pangeo is right now - although it will soon be dismissed - providing a cloud computing platform open to everyone (after they request access through an issue on the repo) to try out Jupyter on the cloud.
Yesterday a new issue was opened in which folks interested in ML are going to discuss ways Pangeo can help simplify the workflow and provide support to people interested in ML. In fact, one of the goals of Pangeo, being a NSF funded proposal, is also to provide open source tutorials, workflows, and anything that can help people spun up and over the initial hump that sometimes setting up environments, dealing with large data, preprocessing etc, can create.
I am not the best person to describe the technical details of Pangeo infrastructure, but I hope you will join us for a chat and that Pangeo can help you with your ML course.
One specific thing we could try is to turn your course into a binder that runs on the pangeo binder service.
I would like to make the following changes to the short course before the SEA Conference:
Add headings.
Do actual backwards optimization on 32 subplot image.
Potentially switch out the ridge and lasso plot with https://stats.stackexchange.com/questions/341816/why-the-contour-of-lasso-and-ridge-regression-are-drawn-only-at-that-position-an.
Remove pre-processing and training.
Redo ordering:
Will be 6 hours (0900-1600 with one-hour lunch break).
We plan to cover the same topics roughly.
Just Ryan and DJ teaching.
I will put in Grad-CAM before novelty detection and leave novelty detection as a bonus topic.
Replace “Citation” with actual title of my notebook.
Remove the long list of “pip install”.
Put constants in dictionary?
Don’t cover plotting and norm/denorm.
I should just have one block that loads all the data, normalizes, and binarizes.
I also will focus less on CNN setup.
Do motivation for interpretation at the beginning.
I can probably include more details on evaluation.
Olah figure with all the different chunking options might be confusing, since we use only output neuron.
DJ has a flow chart that explains BWO (I will use this).
Put saliency before BWO.
Order: saliency, Grad-CAM, BWO, novelty detection.
I ran into a few quirks installing this on Windows (7 x64). First, an existing installation of python through Visual Studio was causing all kinds of problems, had to completely remove everything python and reinstall miniconda.
I also needed to run the anaconda command prompt as administrator for certain packages to install properly.
Finally, there were a few packages missing after walking through your setup instructions (local install). I had to add the following through conda install:
geos
cartopy
jupyterlab
Now it seems to be working.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.