Investigate provenance collection for MapReduce

Project Details

Project Lead
Jiaan Zeng 
Project Manager
Jiaan Zeng 
Project Members
Felix Terkhorn, Abhirup Chakraborty  
Institution
Computer Science, Indiana University Bloomington  
Discipline
Computer Science (401) 

Abstract

Demonstrate provenance collection on FutureGrid by collecting provenance from Twister and visualizing them

Intellectual Merit

The first time to investigate data provenance in Twister/Hadoop. This work is research in D2I center supervised by Professor Beth Plale

Broader Impacts

collect and investigate data provenance from MapReduce framework

Scale of Use

20 or more VMs