Giter Club home page Giter Club logo

perfectthymetech / cloudscaleanalytics-v2-terraform Goto Github PK

View Code? Open in Web Editor NEW
10.0 0.0 1.0 6.96 MB

Cloud Scale Analytics (v2) to create a scalable data platform on Azure using a Data Management Zone, Data Landing Zones and Data Applications to build Data Products.

License: MIT License

azure cloud-scale-analytics data-platform datamesh enterprise-architecture enterprise-scale enterprise-scale-analytics terraform architecture cloudscaleanalytics

cloudscaleanalytics-v2-terraform's Introduction

CloudScaleAnalytics v2

This project revisits the Cloud Scale Analytics data platform reference architecture for Microsoft Azure. While the core principles of the architecture design have not changed, the next generation of the design will and enhance and introduce many new capabilities that will simplify the overall management, onboarding and significantly reduce the time to market.

Over the last couple of years, numerous data platforms have been built on the basis of Cloud Scale Analytics which resulted in a ton of learnings and insights. In addition to that, new services and features have been introduced, reached a GA status and common requirements have drifted. All these data points have been used to build this next iteration of the reference architecture for scalable data platforms on Azure.

The Cloud Scale Analytics reference architecture consists of the following core building blocks:

  1. The Data Management Zone is the core data governance entity of on organization. In this Azure subscription, an organization places all data management solution including their data catalog, the data lineage solution, the master data management tool and other data governance capabilities. Placing these tools inside a single subscription ensures a resusable data management framework that can be applied to all Data Landing Zones and other data sources across an organization.

  2. The Data Landing Zone is used for data retention and processing. A Data Landing Zone maps to a single Azure Subscription, but organizations are encouraged to have multiple of these for scaling purposes. Within a Data Landing Zone an orgnaization may implement one or multiple data applications.

  3. A Data Application environment is a bounded context within a Data Landing Zone. A Data Application is concerned with consuming, processing and producing data as an output. These outputs should no longer be treated as byproducts but rather be managed as a full product that has a defined service-level-agreement.

Cloud-scale Analytics v2

Terraform Modules

The Terraform Modules can be found by following these links:

Data Management Zone

Data Landing Zone

Project Dashboard & Backlog

We have a public GitHub project, which gives you visibility into the backlog and the status of issues of this project.

License

MIT License

Contributing

This project accepts public contributions. Please use issues, pull requests and the discussins feature in case you have any questions or concerns.

cloudscaleanalytics-v2-terraform's People

Contributors

dependabot[bot] avatar github-actions[bot] avatar marvinbuss avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Forkers

larsbulow

cloudscaleanalytics-v2-terraform's Issues

Feature: Add Databricks configuration to Data Landing Zone

Feature or Idea - What?

Add configuration of Databricks workspaces in Data Landing Zone.

Feature or Idea - Why?

This is required to make sure the workspace is connected to the right Unity catalog and ensure that correct governance policies are applied.

Feature: Add Data Product Functionality

Feature or Idea - What?

Add functionality to easily add Data Products to a Data Landing Zone.
This will enable to more easily onboard smaller project/business teams and give them some space to implement their project.

Feature or Idea - Why?

Today, this is very clunky and complex for customers. We can use configuration files to simplify this tremendously.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.