Big Data Analysis
A required course in the Data Science Certificate Program.
This course will help students navigate through the complex layers of Big Data while providing insight on ways to effectively use technologies and architectures to create and manage big data workflows. Concepts covered include an introduction to Big Data and related technologies, discussion of Big Data Processing Architectures, explanation of major concepts behind Big Data Management, and how all of those topics are applied in Big Data Analysis. Students will gain an understanding of the characteristics of big data and techniques for working on big data platforms through hands-on exercises in the tools and systems used by data scientists and data engineers including Hadoop (HiveQL & PIG), Apache Spark, and SparkSQL.
Required prerequisites: I&C SCI X427.05 Fundamentals of Data Science.
Click on "See Details" below and refer to "Special Notes" for additional section specific information.