Designing ETL Processes for Data Warehousing
The Extract, Transform, and Load (ETL) process is typically the most time-consuming, misunderstood, and underestimated task in building a data warehouse and other data integration applications. The ETL process addresses and resolves the challenges of extracting data from disparate operational source systems, storing it in the data staging area, profiling data for errors, cleaning and transforming the data, and mass loading it into the target enterprise data warehouse, data marts, or operational systems. Source systems may include mainframe, relational, ERP/CRM, real-time, web, and desktop systems. Microsoft SQL Server Integration Services (SSIS) will be used as the ETL tool for workshops and homework assignments. Prior knowledge of SSIS is helpful but not required.