The official repo for Azure Machine Learning Notebooks for Pipelines is this: aka.ms/aml-pipeline-notebooks. This repro introduces concepts in the alpha stage of development. As soon as the Notebooks become official versions, they will be moved from here to the official repo.
The Azure Machine Learning Pipelines enables data scientists to create and manage multiple simple and complex workflows concurrently. A typical pipeline would have multiple tasks to prepare data, train, deploy and evaluate models. Individual steps in the pipeline can make use of diverse compute options (for example: CPU for data preparation and GPU for training) and languages.
The Python-based Azure Machine Learning Pipeline SDK provides interfaces to work with Azure Machine Learning Pipelines. To get started quickly, the SDK includes imperative constructs for sequencing and parallelization of steps. With the use of declarative data dependencies, optimized execution of the tasks can be achieved. The SDK can be easily used from Jupyter Notebook or any other preferred IDE. The SDK includes a framework of pre-built modules for common tasks such as data transfer and compute provisioning.
Data management and reuse across pipelines and pipeline runs is simplified using named and strictly versioned data sources and named inputs and outputs for processing tasks. Pipelines enable collaboration across teams of data scientists by recording all intermediate tasks and data.
In the official repo at aka.ms/aml-pipeline-notebooks, there are two types of notebooks:
- The first type of notebooks will introduce you to core Azure Machine Learning Pipelines features. These notebooks below belong in this category, and are designed to go in sequence; they're all located in the "intro-to-pipelines" folder:
- aml-pipelines-getting-started.ipynb
- aml-pipelines-with-data-dependency-steps.ipynb
- aml-pipelines-publish-and-run-using-rest-endpoint.ipynb
- aml-pipelines-data-transfer.ipynb
- aml-pipelines-use-databricks-as-compute-target.ipynb
- aml-pipelines-use-adla-as-compute-target.ipynb
- The second type of notebooks illustrate more sophisticated scenarios, and are independent of each other. These notebooks include: