Course: P434 MapReduce Class Project

Project ID
Project Categories
This project is for an undergraduate distributed systems course (P434) taught by Scott Jensen at Indiana University for the Fall 2012 semester. In this project the students will use MapReduce to perform the ray tracing of an image based on a specification we provide them. Prior to this project they will have implemented a web service using Axis2 that provides a ray tracing service, but in that case the multi-threading is done on the client side to partition the image specification and submit multiple requests to the server. The client is then responsible for stitching the partitioned image together. In this second stage of the project, students will use MapReduce running on FutureGrid to perform the parallel computation for the ray tracing of the image using MapReduce to map the image specification as parallel computations and then reconstruct the final image in the reduce phase. The goal of the project is to introduce students to the MapReduce paradigm and through the use of FutureGrid, provide them the opportunity to learn about virtualized computing resources and cloud computing.
Use of FutureSystems
Students will be logging in to allocate a small number of VMs, configure the Hadoop software stack, and run their class project which performs parallel ray tracing.
Scale of Use
There are 15 students in the class, an instructor, and an associate instructor. The students will be running their parallel ray tracing jobs using a small set of VMs. In prior offerings of this class, a restriction of 4 VMs worked fine. Each run is expected to take less than 1/2 hour, and students will be told to release the resources after they have completed their runs.