Giter Club home page Giter Club logo

data-warehouse_dbt_airflow's Introduction

Data Engineering: Data warehouse tech stack with PostgreSQL, DBT, Airflow

Objective

A city traffic department wants to collect traffic data using swarm UAVs (drones) from a number of locations in the city and use the data collected for improving traffic flow in the city and for a number of other tasks. Now the objective is creating a scalable data warehouse that will host the vehicle trajectory data extracted by analyzing footage taken by swarm drones and static roadside cameras.

Tech-Stack Flow Diagram

Data

The data can be found here

Requirements

Pip

Apache airflow

Python 3.5 or above

Docker and Docker compose

You can find the full list of requirements in the requirements.txt file

Install

Highly recommended to create a new virtual environment and install every required modules and libraries on the virtual environment.

Installing this application

  • You can clone and run the project using the following instruction
git clone https://github.com/Data-warehouse_DBT_Airflow.git
cd Data-warehouse_DBT_Airflow
pip install -r requirements.txt

Screenshots

The detailed use and implementation of the pipelines using Apache Airflow, DBT, postgres and Redash are found here.

Notebooks

The notebooks that are used in this project including EDA, data cleaning are found here in the Notebooks folder.

Scripts

All the scripts and modules used for this project relating to are found in the scripts folder

Tests

All the unit and integration tests are found here in the tests folder.

Tools used in this project

Authors

๐Ÿ‘ค Akubazgi Gebremariam

Show your support

Give a โญ if you like this project, and also feel free to contact me at any moment.

data-warehouse_dbt_airflow's People

Contributors

ekubay avatar

Watchers

 avatar

data-warehouse_dbt_airflow's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.