Distributed real time analytics using JavaScript. JavaScript re imagine of fabulous Apache Spark and Storm projects.
If you know underscore.js
or lodash.js
you may think of JS-Spark
as distributed version of them.
If you know Distributed-RPC systems like storm you will feel like home.
Hadoop is quite slow and requires maintaining cluster - we want to do better. Imagine that theres no need to setup expansive cluster/clud solutions. Use webrowsers! Easily scale to multiple clients. Clients do not need to install anything like Java or other plugins.
Setup in mater of minutes and you are good to go.
No need to setup expensive cluster. The setup takes 5 min and you are good to go. You can do it on one machine. Even on Raspberry Pi
-
Use as CDN ... Today most websites runs slower with more clients use them. But using JSpark you can totally reverse this trend. Build websites that run FASTER the more people use them
-
Use as ML tool may process in real time huge streams of data... while all clients still browse their favorite websites
-
Use as Big data analytics. Connect to Hadoop HDFS and process even terabytes of data.
-
Use to safely transfer huge ammount of data to remote computers.
-
Synchronize data between multiple smartphones.. even in Africa
-
No expensive cluster setup required!
-
Free to use.
Prerequisites, install any! server (ex: Node.js), then:
run `npm install`
Proof Of Concept(POC):
nodemon server.js
nodemon client.js
Start on your machine and see how the clients do all calculation.
For the Web Application:
run `node serverExpress.js`
wait for clients to do all heavy lifting
This project is about to reimplemented some nice things from the world of big data, so there are of course some nice resources you can use to dive into the topic: