Topic: data-engineering-pipeline Goto Github
Some thing interesting about data-engineering-pipeline
Some thing interesting about data-engineering-pipeline
data-engineering-pipeline,Data Engineering Project with Hadoop HDFS and Kafka
User: ahmetfurkandemir
data-engineering-pipeline, Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
User: alanchn31
data-engineering-pipeline,A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour metric table.
User: alero-awani
data-engineering-pipeline,A streaming ETL pipeline for Realtime Tweet Collection, Analysis and Reporting
User: alphanaksoyoglu
data-engineering-pipeline,Let your pipe lines flow thru the Python code in xonsh.
User: anki-code
data-engineering-pipeline,Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate
User: anna-geller
data-engineering-pipeline,Project demonstrating how to automate Prefect 2.0 deployments to AWS EKS
User: anna-geller
Home Page: https://discourse.prefect.io/t/how-to-build-a-prefect-2-0-poc-on-aws/1066
data-engineering-pipeline,Deploy a Prefect flow to serverless AWS Lambda function
User: anna-geller
data-engineering-pipeline,Code examples showing flow deployment to various types of infrastructure
User: anna-geller
data-engineering-pipeline,Get started with Prefect by scheduling your Prefect flows with GitHub Actions
User: anna-geller
Home Page: http://prefect.io/
data-engineering-pipeline,A data engineering pipeline for digital marketers.
Organization: antimoz-om
data-engineering-pipeline,Docker powered starter for geospatial analysis of lightning atmospheric data.
User: bayoadejare
Home Page: https://lightning-containers.streamlit.app/
data-engineering-pipeline,Solution for the Ultimate Student Hunt Challenge (1st place).
User: benedekrozemberczki
data-engineering-pipeline,Challenge to job: Data Scientist
User: brunocampos01
data-engineering-pipeline,Sample data store project to be hosted on a remote server or cluster. CICD using GitHub actions for SSH Deploy to remote server for docker compose.
User: charliesergeant
data-engineering-pipeline,ETL pipeline for construction permits data in Los Angeles built on AWS S3, Lambda and RDS PostgreSQL.
User: chrisammon3000
data-engineering-pipeline,Using Great Expectations and Notion's API, this repo aims to provide data quality for our databases in Notion.
Organization: datarootsio
data-engineering-pipeline,End-to-end data engineering processes for the NIGERIA Health Facility Registry (HFR). The project leveraged Selenium, Pandas, PySpark, PostgreSQL and Airflow
User: delelinus
data-engineering-pipeline,Data Engineering Projects including Data Modeling, Data Warehouse, Data Lake Development
User: dvu4
data-engineering-pipeline,Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database
User: dylanzenner
data-engineering-pipeline,Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DynamoDB as the database
User: dylanzenner
data-engineering-pipeline,Classwork projects and home works done through Udacity data engineering nano degree
User: immu0001
data-engineering-pipeline,F1 Data Pipeline
User: inosrahul
Home Page: https://lookerstudio.google.com/reporting/9fd225dd-a9b8-45d9-87dc-7d7dbae0c841
data-engineering-pipeline,end to end data engineering project
User: kaoutaar
data-engineering-pipeline,Marshmallow serializer integration with pyspark
User: ketgo
data-engineering-pipeline,A end-to-end real-time stock market data pipeline with Python, AWS EC2, Apache Kafka, and Cassandra Data is processed on AWS EC2 with Apache Kafka and stored in a local Cassandra database.
User: kishlayjeet
data-engineering-pipeline,An end-to-end Twitter Data Pipeline that extracts data from Twitter and loads it into AWS S3.
User: kishlayjeet
data-engineering-pipeline,The NHANES Data 'API' is a Python tool that simplifies access to the National Health and Nutrition Examination Survey (NHANES) dataset. This project provides an easy-to-use API to retrieve NHANES data, helping researchers, data scientists, health professionals, and other stakeholders access these valuable datasets.
User: kkrusere
Home Page: https://pypi.org/project/nhanes-pytool-api/
data-engineering-pipeline,Social Media Analysis, scalable solution, flexible deployment that analyses social media contents
User: koksang
data-engineering-pipeline,💜🌈📊 A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api 🌺
User: longnguyen010203
data-engineering-pipeline,An environment for analyzing Twitter
User: markditsworth
data-engineering-pipeline,Apache Spark Guide
User: mikeroyal
data-engineering-pipeline,An end-to-end data pipeline for building Data Lake and supporting report using Apache Spark.
User: minhky2185
data-engineering-pipeline,End-to-end data engineering pipeline with various technologies to ingest real time data.
User: nitindatta8
data-engineering-pipeline,Tiny Blocks to build large and complex data pipelines!
User: pyprogrammerblog
Home Page: https://tiny-blocks.readthedocs.io/en/latest/
data-engineering-pipeline,Spotify API, Airflow, Docker, AWS S3, Snowflake, dbt, localstack, Looker Studio
User: salimt
data-engineering-pipeline,An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
User: san089
data-engineering-pipeline,Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
User: san089
data-engineering-pipeline,ETL pipeline combined with supervised learning and grid search to classify text messages sent during a disaster event
User: sanjeevai
data-engineering-pipeline,A data pipeline from source to data warehouse using Taipei Metro Hourly Traffic data
User: shihwen
data-engineering-pipeline,This project repo 📺 offers a robust solution meticulously crafted to efficiently manage, process, and analyze YouTube video data leveraging the power of AWS services. Whether you're diving into structured statistics or exploring the nuances of trending key metrics, this pipeline is engineered to handle it all with finesse.
User: shiv-rna
data-engineering-pipeline,The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technologies such as Apache Airflow, Apache Spark, Tableau and couple of AWS services
User: siddharth271101
data-engineering-pipeline,Data Engineering ZoomCamp Course Project
User: tmaferreira
data-engineering-pipeline,Uber Data Analysis Project, an End-to-End Data Engineering Project from creating data pipelines to finally creating the dashboard.
User: umairthakur
Home Page: https://lookerstudio.google.com/s/nQI06ax2wMY
data-engineering-pipeline,Analysis of 311 Service Requests for the City of NYC (from 2010 to 2023) Tech: Prefect cloud, dbt core, BigQuery, Compute Engine, CloudRun, Artifact Registry, Terraform, Docker
User: verazab
Home Page: https://lookerstudio.google.com/reporting/65ee32b0-4626-4a39-8065-5d8c27380a1a
data-engineering-pipeline,One framework to develop, deploy and operate data workflows with Python and SQL.
Organization: vmware
data-engineering-pipeline,Formula 1 race data engineering project which utilises azure services and databricks to ingest and analyse the data.
User: waqarg2001
data-engineering-pipeline,This is an ETL project - extracting data from an ecommerce transactional database on RDS, transforming the data using AWS glue job, and loading it to a Redshift data warehouse, and connected it to Tableau for BI
User: yan-luo-au
data-engineering-pipeline,This repo contains the Data Engineering exercises I took in Datacamp.
User: zarexalvindaria
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.