Catalog

MSAN 694 - Distributed Computing (1)

Big data does not fit on a single machine and analysts must resort to clusters of machines cooperating to compute results. This course introduces students to map-reduce systems such as HADOOP and domain specific languages such as PIG. Students learn to re-express programs as map-reduce jobs and present them to environments such as Amazon's "Elastic Map-Reduce."