This project is for students enrolled in CSCI 4830 & 7000 in Fall 2014 at the University of Colorado. Students learn about "data center scale" computing, meaning large scale distributed systems suitable for data-centric (rather than purely computational) computing. Using VM resources students will use Hadoop, Spark and related resources and also build a distributed application.
Use of FutureSystems
Students will use FutureGrid to develop a distributed application as part of the course and also to develop their own projects for the course.
Scale of Use
Students will need to use a few VM's, primarily during the middle of the semester.