Study Plan
0. Prework
This Prework stage introduces concepts that the program will take to greater depth, so students are well grounded from the first day and the entire group has an even level, which allows for further and better progress, as well as improving cooperation between all participants.
- Computer basics: Concepts including hardware and software, CPU, memory, storage devices, operating systems and networks.
- Introduction to programming languages: Explanation of what a programming language is, what it is used for, and the types of languages that exist (compiled and interpreted). An overview of the most commonly used languages today, and why they are used.
- Fundamental programming concepts: Discussion of variables, data types, operations, control flow structures (if/else, loops), and functions. How to break down a complex problem into smaller, more manageable subproblems. Issues are explained in a basic way to avoid overlapping with the Programming Fundamentals module (with Python).
- Development tools and good practices. Introduction to the use of an IDE, such as PyCharm or VSCode, as well as notebooks. Discussion about version control with Git. Underlining good programming practices, such as the importance of commenting code and following style conventions (Pythonic Code).
- Introduction to data structures: Overview of concepts such as arrays, lists, sets, dictionaries/maps and trees. The focus is not confined to a specific programming language, but also at pseudocode level. The aim is for students to understand what they are, what they are used for, and when it might be appropriate to use one data structure over another.
- Fundamental database concepts: Explanation of what a database is, what it is used for, and the various types that exist (e.g. relational and non-relational databases). Introduction to key concepts such as table, record, field, primary key, and relationships between tables.
1. Data Scientists’ Tools
50h.
Review of the key programming concepts needed to process and use data by means of code. Introduction to R programming language and an extensive overview of the capabilities that Python offers.
- Fundamental concepts of Python and libraries for data science: Numpy, Pandas.
- Python - intermediate and advanced.
- Data processing and visualization with Python.
2. Business Intelligence and Visualization
80h.
This module examines what databases are and discusses the main types. Students explore the world of relational database modeling and learn to program in SQL. In addition, the module looks at ETL processes and how they are designed and implemented.
- Database design.
- SQL Standard I
- SQL Standard II
- The data warehouse and ‘extract, transform and load’ (ETL) tools and processes.
3. Data Science. Data Analysis, Mining, and Visualization Technique
80h.
Review of the life cycle of data and how it affects the data analysis process. Students also explore the world of data visualization and learn to design dashboards in PowerBI.
- Data quality and life cycle.
- Data preparation and pre-processing.
- Visualization tools and techniques I
- Visualization tools and techniques II
4. Impact and Value of Big Data
80h.
This module examines big data analysis as a tool to address key research questions and issues. Students will understand what is meant by big data, how it has developed over time, the causes that have led to the emergence of big data technologies and how it compares with traditional business intelligence.
- Introduction to the world of big data.
- Business intelligence vs big data.
- Big data technologies.
- The value of data and applications across sectors
5. Big Data Tools and Technology
80h.
Discover and use the tools that comprise the ecosystem in order to process vast quantities of data. These include Spark, Hadoop, and NoSQL databases.
- HADOOP and its ecosystem.
- SPARK
- NoSQL databases
- Cloud platforms
6. Statistics for the Data Scientist
80h.
In this module students learn the fundamental concepts of programming in R, a programming language for statistical computing. The main statistical concepts essential for data analysis are also introduced here.
- Introduction to statistics.
- Probability and sampling.
- Inference and lineal regression.
- Experiment design.
7. Machine Learning
64h.
Students learn the fundamental concepts and algorithms of machine learning, one of the cornerstones of data science and artificial intelligence.
- Machine learning tools and the techniques and applications of supervised learning.
- Techniques and applications of unsupervised learning.
- Deep learning models and techniques.
- Cloud solutions for machine learning.
8. Capstone Project
50h.
This final project of the Master’s puts the knowledge and skills acquired by students into practice by applying it to a real-world case and with data connected to the business world. It is a mandatory component within the scope of the final part of the study plan and is carried out under the supervision of an assigned tutor. Students must also undergo a viva voce exam of their capstone project.