E599/B649 Topics on High Performance Big Data Systems, Fall 2020
High-Performance Big Data Systems involve Large-Scale Data Analytics on High-Performance Computing (HPC) clusters optimized for data analysis. This course introduces research and development in hardware, algorithms and software for big data systems of transformational capability on computer architectures ranging from commodity clouds, hybrid HPC-clouds, and supercomputers. High-Performance systems are designed to scale and fully exploit the specialized features (communication, memory, energy, I/O, accelerator) of each different architecture. Students will study the literature of new hardware architectures (many-core and other emerging architectures) and benchmark on existing systems. Applications will range from pleasingly parallel, MapReduce, to Machine Learning/AI and Large Graph Analytics. A major student project aimed at a demonstration of the capabilities of High-Performance Big Data systems.