E599/B649 Topics on High Performance Big Data Systems, Spring 2020

Project ID
FG-575
Project Lead
Project Categories
Computer Science
Project Keywords
Abstract

High-Performance Big Data Systems involve Large-Scale Data Analytics on High-Performance Computing (HPC) clusters optimized for data analysis. This course introduces research and development in hardware, algorithms and software for big data systems of transformational capability on computer architectures ranging from commodity clouds, hybrid HPC-clouds, and supercomputers. High-Performance systems are designed to scale and fully exploit the specialized features (communication, memory, energy, I/O, accelerator) of each different architecture. Students will study the literature of new hardware architectures (many-core and other emerging architectures) and benchmark on existing systems. Applications will range from pleasingly parallel, MapReduce, to Machine Learning/AI and Large Graph Analytics. A major student project aimed at a demonstration of the capabilities of High-Performance Big Data systems.

Use of FutureSystems
Classroom
Scale of Use
Up to 30 students running virtual machines, possibly at limited times.