Study Plan
When it comes to analyzing data, it often comes from various sources and is in different formats and is consequently less useful. For this reason, it is important to prepare data prior to integration (or ETL), and that’s why students will learn about the Talend Open Studio suite. We also teach how to visualize data with Python, a process through which students will then be able to answer questions and, ultimately, make decisions.
Data Store. The data warehouse and extract, transform and load (ETL) tools and processes.
The course focuses on data storage using a data warehouse and the extract, transform and load (ETL) process. It looks at the how the ETL process has evolved. Additionally, students learn about the creation of ETL processes for the effective management of large quantities of data in a data warehouse.
Effective Visualization of Information
This course addresses data visualization and explores the theoretical principles. Students will examine the visualization process and study the visualization context, use of color, gesalt theory principles, as well as data and attribute relationships. They will also look at real cases of visualization using graphs, data visualization libraries, and how to generate graphs in Python using Matplotlib and Seaborn. Finally, we will look at specific examples of data visualization.