ParSoDA (Parallel Social Data Analytics) is a Java programming library for simplifying the development of parallel social media mining application executed on High Performance Computing systems. ParSoDA defines a general framework for a social media analysis application that includes a number of steps (data acquisition, filtering, mapping, partitioning, reduction, analysis, and visualization), and provides a predefined (but extensible) set of functions for each data processing step. Thus, an application developed with ParSoDA is expressed by a concise code that specifies the functions invoked at each step. User applications based on the ParSoDA library can be run on both Apache Hadoop and Spark clusters. The current version of the library (v. 1.3.0 dated October 25, 2018) contains more than forty predefined functions organized in seven packages, corresponding to the seven ParSoDA steps.
xhendyagsx / parsoda Goto Github PK
View Code? Open in Web Editor NEWThis project forked from scalabunical/parsoda
ParSoDA (Parallel Social Data Analytics) is a Java library for social media analytics
License: GNU General Public License v3.0