Giter Club home page Giter Club logo

cs523's Introduction

Final Project of CS523 - Big Data Technology

Professor: Mrudula Mukadam

Team members of Group 1

  • Tam Van Vo - 610746
  • Minh Tuan Bui - 610582
  • Bao Nguyen Nguyen - 610116

Slide Presentation at here or OneDrive

Getting Started

Install Spark 2.4.4 on Hadoop 2.6

sh setup_spark2.sh

Setting enviroment $HADOOP_CONF_DIR

export HADOOP_CONF_DIR=$HADOOP_HOME/etc/hadoop

AWS Lambda function triggered by SNS topic

deploy lambda.py to AWS and add SNS trigger arn:aws:sns:us-east-1:470049585876:OPENAQ_NEW_MEASUREMENT

โš ๏ธ it will be triggered every minute and create many S3 files in bucket air-quality-live.

Publish data to Kafka

java -cp ./target/final-1.0-jar-with-dependencies.jar cs523.agent.App US pm25 2020-01-01T00:00:00 2020-02-01T00:00:00

Spark streaming from S3 bucket s3://air-quality-live

spark-submit --class cs523.App --master yarn ./target/final-1.0-jar-with-dependencies.jar s3

Spark streaming from Kafka server

spark-submit --class cs523.App --master yarn ./target/final-1.0-jar-with-dependencies.jar kafka

Spark SQL

spark-submit --class cs523.App --master yarn ./target/final-1.0-jar-with-dependencies.jar sql

Live visualization of air quality

python3 live_air_quality.py

cs523's People

Contributors

votamvan avatar dependabot[bot] avatar ennguyennguyen avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.