Big Data for Science Summer School July 26-30 2010

Project Details

Project Lead
Judy Qiu 
Project Manager
Judy Qiu 
Institution
Indiana University, School of Informatics and Computing  
Discipline
Computer Science (401) 

Abstract

Support Big Data workshop with over 200 attendees from 10 sites across the USA. This summer school has selected hands on session on MapReduce, Eucalyptus and Appliances plus many presentations from leading researchers in data intensive computing

Intellectual Merit

Data Intensive computing is rapidly growing in importance and this summer school will advance broad knowledge of this field. We will develop tutorials for Hadoop and Twister on FutureGrid that can be used by other activities. We will also use FutureGrid for introductory tutorial using Eucalyptus and demonstrations of Virtual Appliance technology. In the “Big Data for Science” workshop, 200 students across 10 institutions (Arkansas High Performance Computing Center, University of Arkansas, Fayetteville; Electronic Visualization Laboratory, University of Illinois at Chicago; Indiana University, Bloomington; Institute for Digital Research and Education, University of California, Los Angeles; Michigan State University, East Lansing; Pennsylvania State University, University Park; University of Iowa, Iowa City; University of Minnesota Supercomputing Institute, Minneapolis; University of Notre Dame, Notre Dame, Indiana; and University of Texas at El Paso). Students in the workshop used FutureGrid in hands-on activities that covered, among others, Hadoop/MapReduce, Twister, Grid Appliance, and GroupVPN.

Broader Impacts

This project involves many students who will be able to get an understanding of modern data intensive computing

Scale of Use

Essentially all nodes on Sierra and India needed

Results

The workshop was successfully delivered with good reviews. In this “Big Data for Science” workshop, over 200 students across 10 institutions (Arkansas High Performance Computing Center, University of Arkansas, Fayetteville; Electronic Visualization Laboratory, University of Illinois at Chicago; Indiana University, Bloomington; Institute for Digital Research and Education, University of California, Los Angeles;  Michigan State University, East Lansing; Pennsylvania State University, University Park; University of Iowa, Iowa City; University of Minnesota Supercomputing Institute, Minneapolis; University of Notre Dame, Notre Dame, Indiana; and University of Texas at El Paso). Additionally 100 additional students attended via streaming video. Students in the workshop used FutureGrid in hands-on activities that covered, among others, Hadoop/MapReduce, Twister, Grid Appliance, and GroupVPN.
See http://salsahpc.indiana.edu/tutorial/index.html