This project is the implementation in Airflow of a data pipeline workflow (Airflow DAG) that will automate the ETL of loading JSON files from S3 into a Redshift cluster.
The DAG is implemented to load data from S3 into staging tables in Redshift.
The data resides in S3 objects as JSON files, and so the SQL scripts use the COPY command in Redshift to import all data.