Add data exploration and visualization to your analytics cluster. Project files for the post, Installing Apache Superset on Amazon EMR: Add data exploration and visualization to your analytics cluster. Please see post for complete instructions on using the project's files.
python3 ./create_cfn_stack.py \
--ec2-key-name <your_key_pair_name> \
--ec2-subnet-id <your_subnet_id> \
--environment dev
python3 ./install_superset.py \
--ec2-key-path </path/to/my-key-pair.pem> \
--superset-port 8280
Open an SSH tunnel to master node using dynamic port forwarding
ssh -i </path/to/my-key-pair.pem> -ND 8157 hadoop@<public_master_dns>
Troubleshoot Superset process running on EMR Master.
lsof -i :8280