Massaki 's Projects
100 days of code challenge, but about Kafka (WIP)
just algorithms. nothing else
repo that hosts my blog
Building a ETL pipeline that extracts data from AWS S3 and stages them in AWS Redshift and transforms data into a set of dimensional tables, using the star schema architecture.
An LLVM compiler made in Python
These are Python solutions for the book Cracking the Coding Interview, 6th Edition by Gayle Laakmann McDowell.
In this project we will build an ETL pipeline that extracts their data from the data lake hosted on S3, processes them using Spark which will be deployed on an EMR cluster using AWS, and load the data back into S3 as a set of dimensional tables in parquet format.
In this project, I’ve applied what I’ve learned on data modeling with Postgres and build an ETL pipeline using Python. I’ve defined fact and dimension tables for a star schema for a particular analytic focus and written an ETL pipeline that transfers data from files in two local directories into these tables in Postgres using Python and SQL.
In this project, we'll apply the concepts learned in data modeling with Apache Cassandra and complete an ETL pipeline using Python. I will model the data by creating tables in Apache Cassandra to run queries. We are provided with part of the ETL pipeline that transfers data from a set of CSV files within a directory to create a streamlined CSV file to model and insert data into Apache Cassandra tables.
Orhcestrating Data PIpelines with Apache Airflow. We will create custom operators to perform tasks such as staging the data, filling the data warehouse and running checks. The tasks will need to be linked together to achieve a coherent and sensible data flow within the pipeline.
:mushroom:Udacity Data Engineering Nanodegree Project 3
Udacity Data Engineering project: ETL pipeline for a data lake hosted on S3.
A repo to study about distributed systems using Go
Django simple blog
Docker for Developers, published by Packt
A simple benchmarking between Dancer Perl with FastAPI python
A brief benchmark os using python with elastic search vs pyo3 with elasticsearch
Hangman game made with python using OOP progamming
Udacity Data Engineering Nanodegree Project 5
An app for homeschool planning
A json parser made in python
Lista de exercícios para a disciplina de Cálculo Numérico do curso de Matemática - Bacharelado da Universidade Federal de Santa Maria.
Lock resources against concurrent use
Config files for my GitHub profile.