mouhamed-jinja Goto Github PK
Name: Mohamed Younes
Type: User
Company: LigaData
Bio: Data Plumber
Location: Cairo
Name: Mohamed Younes
Type: User
Company: LigaData
Bio: Data Plumber
Location: Cairo
The simplest way to set up a cluster environment that includes Spark, Airflow, and Postgres.
BlogAPI - A RESTful API built with FastAPI and PostgreSQL for managing blog post data, including authentication, authorization, validation, and error handling functionality.
This project leverages Hadoop, Spark, SQL, and Hive for efficient data integration, transformation, warehousing, and analytics. It provides a comprehensive solution for managing and analyzing large datasets.
kafka cluster with KRaft, and simple python producer and consumer
Integrate Kafka with Zookeeper with spark, to build simple streaming application, and the producer is simple python application
Discover the magic of database replication! This GitHub project showcases PostgreSQL and Docker Compose to optimize high-load reads. Explore the setup with one master and three followers for efficient, high-performance databases
Airflow Data Migration Project: A comprehensive Airflow project demonstrating data migration from PostgreSQL to AWS S3. Leverage the power of Airflow's operators, connections, and hooks to build robust and scalable data pipelines. Ideal for data engineering enthusiasts looking to learn and implement Airflow in real-world scenarios
"PostgresBlend Data Pipeline" is a comprehensive data integration solution designed to seamlessly merge diverse data sources into a unified PostgreSQL Data Warehouse. This project streamlines the process of integrating data from CSVs, JSON, Parquet, and MySQL databases, utilizing Apache Spark for efficient transformation and organization.
This repository contains Apache Airflow Directed Acyclic Graphs (DAGs) and associated scripts for orchestrating an Extract, Transform, Load (ETL) workflow. The workflow is designed to extract data from a source, perform transformations, and load it into a data warehouse.
Implemented a NN project for Sign Language Recognition using CNN. Dataset preprocessing, data augmentation, and model training with Adam optimizer. Evaluated performance using confusion matrix. Expertise in CNN-based image classification.
an end-to-end data streaming pipeline, seamlessly integrating Python, Kafka, Spark Streaming, Docker, and Airflow. Effortlessly process, transmit, and analyze real-time data with this comprehensive project, designed for efficient and scalable streaming applications
This project demonstrates my NLP and web development skills using FastAPI to create a text summarization application. Users can summarize text and save results to SQLite. The application is containerized with Docker for easy deployment. I gained experience with NLP, RESTful APIs with FastAPI, and containerizing apps.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.