Course: CIIC 8995: Mining Massive Datasets

Project Details

Project Lead
Edgar Acuna 
Project Manager
Edgar Acuna 
Institution
University of Puerto Rico Mayaguez, Mathematical Sciences  
Discipline
Computer Science (401) 
Subdiscipline
11.01 Computer and Information Sciences, General 

Abstract

This course provides a introduction to the analysis of massive datasets using Hadoop and MapReduce. The emphasis in the use of distributed computation paradigms to carry out algorithms for massive datasets.

Intellectual Merit

In this course, students will be exposed to the state-of-the-art computation models for analyzing massive datasets. Massive datasets are appearing now very often in the scientific community.

Broader Impacts

This project will provide research and educational opportunities to graduate students. Software, curriculum materials and publications attained under this project will be distributed to the scientific community as free, open source materials.

Scale of Use

The class size will be about 10 students