I&C SCI X425.18

Hadoop: In Theory and Practice

Today, organizations in every industry are being showered with imposing quantities of new information. Along with traditional sources, many more data channels and categories now exist. Collectively, these vastly larger information volumes and new assets are known as Big Data. Enterprises are using technologies such as MapReduce and Hadoop to extract value from Big Data. This course provides an in-depth overview of Hadoop and MapReduce, the cornerstones of big data processing. To crystalize the concepts behind Hadoop and MapReduce, you will work through a series of short/ focused exercises; you will configure and install a Hadoop cluster, write basic MapReduce programs, gain familiarity with advanced MapReduce programming practices, and utilize interfaces such as Pig and Hive to interact with Hadoop. You will also learn about real-world situations were MapReduce techniques can be used.

Pre-requisite: "This course is appropriate for developers who want to get acquainted with Hadoop concepts and who will be writing, maintaining and/or optimizing Hadoop jobs. Data Architects and IT Managers/Directors who want an in-depth look into Big Data technologies such as Hadoop and MapReduce also stand to benefit from this course.

Participants should have some programming experience; a background in Java is preferred, but experience with other programming languages such as PHP, Python, or C# is sufficient. Understanding of common computer science concepts is a plus. Prior knowledge of Hadoop is not required.
Access to a Windows PC. For example: WIndows XP, Windows 7, Windows 10 PC
Hadoop is written in Java, so you will need to have Java installed on your machine, version 6 or later. Sun’s JDK is the most widely used with Hadoop. In case you do not have Java installed, step-by-step instructions will be provided to install Java."

Icon
Course
Approximate Cost TBD
Format Online
Duration TBD
Total Credits 3