Running Github Analysis with Spark

Project ID
FG-539
Project Categories
Computer Science
Project Keywords
Abstract
The project aims at analyzing Github data using Apache Spark. Spark will be used to achieve high performance and to manage the large volume of data
Use of FutureSystems
FutureSystems will be used to manage and run a Apache Spark cluster that will be used to run analysis on the data
Scale of Use
We would need 13 nodes for this analysis. to set-up the Apache Spark cluster