Giter Club home page Giter Club logo

azure / azureml-examples Goto Github PK

View Code? Open in Web Editor NEW
1.6K 2.6K 1.3K 665.71 MB

Official community-driven Azure Machine Learning examples, tested with GitHub Actions.

Home Page: https://docs.microsoft.com/azure/machine-learning

License: MIT License

Python 12.22% Jupyter Notebook 83.01% Dockerfile 0.14% Shell 3.24% R 0.13% Julia 0.01% Java 0.08% Bicep 0.16% PowerShell 0.02% C# 0.71% PureBasic 0.14% TypeScript 0.11% Jinja 0.01%
azure azureml ml azure-machine-learning data-science

azureml-examples's Introduction

Azure Machine Learning examples

license: MIT

Welcome to the Azure Machine Learning examples repository!

About This Repository

The azureml-examples repository contains examples and tutorials to help you learn how to use Azure Machine Learning (Azure ML) services and features.

Getting Started

If you're getting started with Azure ML, consider working through our tutorials for the v2 Python SDK. You may also want to read through our documentation.

SDKs

The sdk/ folder houses the examples for the Azure ML SDKs across several languages.

We have an extensive collection of examples for the Azure ML Python SDK v2 in sdk/python.

We also offer some examples for our SDKs in other languages:

Azure Machine Learning extension for Azure CLI

The cli/ folder hosts our examples to use the Azure Machine Learning extension for Azure CLI.

Note: If you're looking for examples that submit Azure ML jobs that run non-Python code, see:

Supplementary Documentation

Contributing

We welcome contributions and suggestions! Please see the contributing guidelines for details.

Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. Please see the code of conduct for details.

azureml-examples's People

Contributors

anagarajan97 avatar balapv avatar bhimar avatar cassieesvelt avatar diondrapeck avatar elliotzh avatar fredms avatar gaugup avatar harneetvirk avatar iamrk04 avatar imatiach-msft avatar jeff-shepherd avatar kdestin avatar lostmygithubaccount avatar martinezgjuan avatar nancy-mejia avatar nick863 avatar raja7592 avatar rjaincc avatar rsethur avatar rtanase avatar santiagxf avatar sdgilley avatar sharma-riti avatar vadthyavath avatar vlbejan avatar xanwal avatar ynpandey avatar yuanzhuangyuanzhuang avatar zetiaatgithub avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

azureml-examples's Issues

Add dataset example

What notebook?
Add training example using a FileDataset and a SRC

Additional context
Add any other context here.

distributed XGBoost example

What notebook?
What would you like to see in an example notebook?

Additional context
Add any other context here.

Tutorials should have and pass a well-defined content quality bar

Some Tutorial notebooks do not have any explanation but only have codes. We need to set up a bar for a notebook to be a part of Tutorials.

in order to qualify as tutorial, notebook should have:

requirements (what you need to understand / run)
"deliverables" (what you get from running the tutorial)
some kind of written content (not code) talking through what's happening

Some examples:

setup.ipynb -> setup.py

What notebook?
refactor for easier automation - look into RG delete/create?

Additional context
Add any other context here.

notebook -> examples

Describe the issue or request
for:

  • issue templates
  • other?

Additional context
Add any other context about the problem here.

Error messages on "failed to load ..." when running setup script

The script setup.py logs error messages

Here are some examples:

Failure while loading azureml_run_type_providers. Failed to load entrypoint hyperdrive = azureml.train.hyperdrive:HyperDriveRun._from_run_dto with exception (azureml-telemetry 1.17.0 (/home/amin/miniconda/lib/python3.7/site-packages), Requirement.parse('azureml-telemetry~=1.15.0')).

Failure while loading azureml_run_type_providers. Failed to load entrypoint automl = azureml.train.automl.run:AutoMLRun._from_run_dto with exception (azureml-telemetry 1.17.0 (/home/amin/miniconda/lib/python3.7/site-packages), Requirement.parse('azureml-telemetry~=1.15.0')).

Guessing these are caused by our requirements.txt file not "pinning" dependency versions, since this doesn't reproduce in a fresh Conda environment.

Should we "nudge" users towards a clean Conda or virtual environment? In addition, maybe "pin" package versions? This will be especially relevant as SDK VNext lands.

Especially relevant since our notebooks / code don't mention version information.

Could use "dependabot" to grab new Python package versions.

Likely related to: Task 907711: AutoML: Pin version of the portalocker package to 1.7.0.

get `notebook-run` action properly running on PRs

Describe the issue or request
if pull_request is added to the triggers, it fails because forked repos don't have the Azure credentials. Need to investigate how to work around this.

Additional context

Replace CI/AMLNB samples

Describe the issue or request
replace samples AML experience with this and other examples

Additional context
Add any other context about the problem here.

lightgbm gpu examples

What example? Describe it

example: lightgbm gpu
description: single node single gpu, single node multi gpu, multi node multi gpu

Additional context

  • using python api

need more compute targets

Describe the issue or request
we are supporting a wide range of scenarios and, particularly for GPUs, need different compute targets

add to setup.ipynb, configure in testing workspace

Additional context
Add any other context about the problem here.

fastai notebook broken

Which notebook?
fastai/train-pets-resnet34.ipynb

Describe the issue
not working

To Reproduce
run the notebook

Expected behavior
notebook runs

Screenshots
If applicable, add screenshots to help explain your problem.

Environment details

  • any

Additional context
Add any other context about the problem here.

The setup script should be named something besides setup.py

When users come across a repo with setup.py in the root, they expect to be able to "install the repo" via pip install .. However, that is not what this repository's setup.py does.

Recommendation: name it something like setup-aml-infrastructure.py.

Update docs.microsoft.com

Describe the issue or request
With links to new notebooks

Additional context
Add any other context about the problem here.

The setup script should default to the user's location

Currently, setup.py has a hardcoded default location of eastus2 for the workspace. It should have some basic logic to default to the user's location (if possible, not sure this is).

In theory, this could be done calling ARM client SDK - but's that's super slow. Hoping there's a fast way.

Another way to handle this would be to make setup.py interactive (could use cookie cutter, or have some config file for this).

Best - use Hydra for configuring setup.py. Cleanly isolates what setup.py does from how it does it.

Can't setup an AzureML Dask Cluster

Describe the issue

When I follow along the using-dask tutorial to setup a Dask cluster, I fail to create a new cluster with an odd error that comes up:
[Errno 113] No route to host

I tried searching for it, but the results on Google will come with port and routing issues, that I don't know if are directly related to the problem here. Not sure.

Minimal example

from azureml.core import Workspace

ws = Workspace.from_config()

from azureml.core import Environment

env = Environment.get(workspace=ws, name="AzureML-Dask-CPU")

from dask.distributed import Client
from dask_cloudprovider import AzureMLCluster

cluster = AzureMLCluster(
    ws,
    vm_size="STANDARD_DS5_V2",
    environment_definition=env,
    initial_node_count=10,
    scheduler_idle_timeout=1200,
)

c = Client(cluster)
c

Additional context

  • Important to note that I am running all of this inside an Azure ML Compute Instance

hyperdrive runs stuck running forever

Which example?
rapids hyperdrive

Describe the issue
runs get stuck - need to set max experiment timeout

To Reproduce
Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Environment details

  • OS: [e.g. Windows 10, Ubuntu]
  • Azure ML SDK Version [e.g. 1.10.0]
  • other relevant details

Additional context
Add any other context about the problem here.

add PR template

Describe the issue or request
Add a PR template

Additional context
none

add cleanup workflow

cleanup:

  • workspaces (from concept)
  • compute targets
  • compute instances
  • webservices

distributed LightGBM example

What notebook?
What would you like to see in an example notebook?

Additional context
Add any other context here.

az ml install is bad

Describe the issue or request
the az ml installation command used in this repo (mostly in workflows) is not the publicly documented method

Additional context
this issue needs to be resolved before switching back: Azure/azure-cli#15434

unpin azureml-core version in triton local deploy notebook

Which notebook?
notebooks/triton/deploy densenet local.ipynb

Describe the issue
azureml-core is pinned to 1.15.0 via pip install in the notebook

To Reproduce
Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

Expected behavior
uses latest SDK installed from requirements.txt

Screenshots
If applicable, add screenshots to help explain your problem.

Environment details

  • OS: [e.g. Windows 10, Ubuntu]
  • Azure ML SDK Version [e.g. 1.10.0]
  • other relevant details

Additional context
Add any other context about the problem here.

xgboost gpu examples

What example? Describe it

example: xgboost gpu examples
description: single node single gpu, single node multi gpu, multi node multi gpu

Additional context

  • using python api

deployment auth

Which example?
Which example is the issue in?

Describe the issue
A clear and concise description of what the issue is.

To Reproduce
Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Environment details

  • OS: [e.g. Windows 10, Ubuntu]
  • Azure ML SDK Version [e.g. 1.10.0]
  • other relevant details

Additional context
Add any other context about the problem here.

make `setup.py` optionally interactive

Describe the issue or request
A clear and concise description or request for enhancement.

Additional context
Add any other context about the problem here.

datastore concept notebook is bad

Which notebook?
concepts/datastore

Describe the issue
uses TuringNLR datastore

To Reproduce
read

Expected behavior

Screenshots

Environment details

  • OS: [e.g. Windows 10, Ubuntu]
  • Azure ML SDK Version [e.g. 1.10.0]
  • other relevant details

Additional context

minor issue in introduction tutorial

Which example? Describe the issue

example: an-introduction/1.hello-world.ipynb
description: under understanding the code there are 2 minor issues

Additional context

  • link to Run Python API docs broken
  • experiment -> exp per code cell

The CIFAR 10 deepspeed example requires a V100 cluster

The example train/deepspeed-cifar10.py requires compute target gpu-V100-2

This seems like overkill and makes the examples less accessible. Is it really necessary?

More generally, most subscriptions start out with a V100 quota of 0. The process for increasing this is painful and lengthy. Most all examples in the repo don't really need such heavy compute.

Also: consider provisioning low-pri compute (at least: mention).

Add automl example(s)

What notebook?
AutoML thing client

Additional context
Add any other context here.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.