These experimental docker containers allow to develop and run data science applications using Apache Spark.
For this purpose, the following languages are supported:
- Java / Scala
- R using SparkR
- Python using PySpark
Also, a number of libraries for R and Python have been installed:
- Pandas
- SciPy
- NumPy
- scikit-learn
- Various R packages
Check out https://github.com/bwv988/datascience-playground for usage examples.
- Install libs & frameworks for using Deep Learning and Artifical Neural Networks.
- Apache Spark 2.1.1
- Python 3.x
- R 3.x