Giter Club home page Giter Club logo

microsoft-tdsp's Introduction

Team Data Science Process from Microsoft

Overview | Lifecycle | Roles & Tasks | Project Template | Project Execution | Data Science Utilities


This repository contains the Team Data Science Process (TDSP) from Microsoft. TDSP is an agile, iterative, data science process for executing and delivering advanced analytics solutions. It is designed to to improve collaboration and efficiency of data science teams in enterprise organizations. It is supported through four key components:

  • a data science lifecycle definition
  • a standardized project structrure (project documentation and reporting templates)
  • infrasctructure for project execution (compute and storage infrastructure, code repositories, etc.)
  • tools for data science project tasks (version control, data exploration and modeling, work planning. etc.)

For execution of data science projects, TDSP provides guidelines on how to structure collaborative teams and tasks for data science projects, and execute data science projects using Agile planning and version control.

To perform certain stages of a data science project efficiently and semi-automated manner, TDSP also provides data exploration and (semi)automated modeling tools in R and Python. These also provide standardized reports or artifacts.

TDSP resources on Azure

We provide documentation and end-to-end data science process walkthroughs and templates using different platforms and tools on Azure, such as Azure ML, HDInsight, Microsoft R server, SQL-server, Azure Data Lake etc.

In particular, here are instructions on how to execute data science life cycle steps in Azure ML.

Contributing to TDSP

We believe that with the help of the data science community, we can make TDSP even better, and can benefit more enterprises and individual data scientists to be more efficient. We welcome contributions to TDSP, either on documentation or on workflow or implementing TDSP on different tools for versioning or work items management. Feel free to contribute pages at TDSP/wiki.

If you have some useful data science tools and utilities to share, we encourage you to contribute to the TDSP-Utilities Github repository.

Release Notes

This is version 0.1.2 of TDSP. Version 0.1.1 was released in September 2016. We are continuously improving TDSP based on our further accumulated experience, and customer feedback.

Questions or suggestions

Should you have any questions or suggestions, please create a new discussion thread on the Issues Tab.


TDSP_LIFECYCLE


Last updated: Aug 15, 2017

microsoft-tdsp's People

Contributors

deguhath avatar gopitk avatar hangzh-msft avatar danielleodean avatar ehrlinger avatar

Watchers

James Cloos avatar ambastos1971 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.