Course: Large Scale Computing Infrastructure 2012 Master class

Project Details

Project Lead
Sergio Maffioletti 
Project Manager
Sergio Maffioletti 
Project Members
Riccardo Murri, Lukasz Miroslaw, Tyanko Aleksiev, Antonio Messina, Christian Panse, Daniel Strebel, Silvan Troxler, Tobias Klauser, Alexander Filitz, Louis-Marie LOE, Henrique Mendonca, Marc Tobler, Fabian Christoffel, Robert Sharp, Nicolas Baer, Carol Alexandru, Cristian Mezzanotte  
Institution
University of Zurich, GC3: Gric Computing Competence Center  
Discipline
Computer Science (401) 

Abstract

This is class [MINF4526] in the fall semester 2012 lectures at UZH http://www.vorlesungen.uzh.ch/HS12/suche/sm-50544810.modveranst.html This class will provide an introductory overview on large scale computing infrastructures. Taking both an engineering and end-user perspective, the main focus will be on how scientific usecases can be enabled on such systems, what are the main technical challenges that have to be addressed and what are the currently available solutions. The class will survey the current approaches in large scale computing. Grid and Clouds will be treated as different approaches to the same category of problems: integrating large scale high throughput scientific usecases. The first part will cover an in-depth study on actual technologies used in modern distributed infrastructures focusing on security, access protocols, data handling and usability. The second part will provide the basic tools needed to understand how to enable large scale data analysis. This part will be complemented with a basic introduction to the Python programming language and examples taken from actual use cases. For the practical exercises and assignments, access to grid and cloud infrastructure will be enabled. At the end of this course, students will have an understanding of what are the challenges that have ot be addressed when supporting large scale computing with a focus on how scientific applications could be scaled on large scale infrastructures. Basic knowledge on Distributed Systems as well as on Object Oriented Programming paradigm are required for a full understanding of all modules of the class.

Intellectual Merit

This class will provide an overview on large scale computing and how grid and cloud technologies could be empowered. The goal is to have students construct scalable applications using an AWS-like cloud systems (e.g. OpenStack, Eucalyptus).

Broader Impacts

At the end of this course, undergraduate and masters level students will have an understanding of what are the challenges that have ot be addressed when supporting large scale computing with a focus on how scientific applications could be scaled on large scale infrastructures.

Scale of Use

It would be preferable to have granted access to small set of resources (enough to spaws 15-30 small size VMs) every Thrusday (during lab sessions), plus the ability to access and spawns few dozen of VMs on a sporadic frequency.