Course: Fall 2015 I590/I400 Big Data: Data Management

Project Details

Project Lead
Beth Plale 
Project Manager
Zong Peng 
Project Members
Roderick Head, Zong Peng, Raghava Reddy  
Institution
indiana university, school of informatics and computing  
Discipline
Computer Science (401) 

Abstract

Fall 2015 Data Science course in management, access, and use of big and complex data.   Projects in data cleaning, analysis and modeling.

Intellectual Merit

Fall 2015 Data Science course in management, access, and use of big and complex data. Projects in data cleaning, analysis and modeling

Broader Impacts

Data is abundant and its abundance offers potential for new discovery, and economic and social gain. But data can be difficult to use. It can be noisy and inadequately contextualized. There can be too big a gap from data to knowledge, or due to limits in technology or policy not easily combined with other data. This course will examine the underlying principles and technologies needed to capture data, clean it, contextualize it, store it, access it, and trust it for a repurposed use. Specifically the course will cover the 1) distributed systems and database concepts underlying noSQL and graph databases, 2) best practices in data pipelines, 3) foundational concepts in metadata and provenance plus examples, and 4) the developing theory in data trust and its role in reuse.

Scale of Use

twe weeks