Nowadays, it's rare to find someone who hasn't heard of Machine Learning. Perhaps they don't know it, but they have used some application or virtual assistant at some point. With the aim of providing a brief introduction to Machine Learning and demystifying some phrases that are often repeated around ML, we're connecting at IMMUNE to talk about this topic, which is so fashionable today.
És la capacitat de les màquines per aprendre i millorar a partir de dades, sense estar programades explícitament.
Depending on the level of depth you want to reach, you can find different variations of the same definition. If you are looking for an informal definition:
It is making predictions from data.
Instead, if you dig a little deeper, you can find a slightly more formal definition:
The construction of a statistical model that is an “underlying” distribution from which the data has been drawn.
But wait! There's more! You can even take it to a more formal definition using mathematics.
A training dataset
A hypothesis class H:
An objective function and an optimisation method
The overview is a mapping:
Division of ML problems
Normally we always come across the typical division of Supervised o Unsupervised Learning. However, there are more ways to divide Machine Learning problems and based on your problem, we will talk about one or the other.
Supervised Learning | Unsupervised Learning
Parametric Models | Non-parametric Models
Modeling Approach | Optimization Techniques
When working with ML problems, a question might arise before we begin: Which is more important, drawing conclusions from data or making very good predictions?
That question is entirely valid and, in fact, a logical one to ask, it's called interpretabilité – prédictions. When we talk about inference, we usually talk about drawing clear conclusions from the data, such as how variable Y is affected by X, etc. But on the other hand, when we talk about predictions, we are talking about obtaining a clear and precise output from our model. They are two opposing viewpoints, but in practice, a mixture of both is usually worked with.
This makes us realise that there are models that are more easily interpretable than others. For example, linear regressions are very easy to interpreting but instead, they are few flexible since they only generate linear functions. Polynomial functions, on the other hand, are more flexible, as it can generate a larger number of “shapes”, but they are more complicated to interpret.
But... why does Machine Learning work?
Basically, Machine Learning works because we have an enormous amount of data (Big Data) alongside the mathematics that lie behind each model. The Law of Large Numbers He speaks to us about this very matter, in summary he says that the more data we have, the closer we will get to the original data distribution, meaning our model will be better.
Machine Learning in Industry
When a company tries to implement Machine Learning models in its projects, it may encounter several issues, here are some of the most common ones:
1. Run very powerful models
Sometimes there is a lack of resources to run them, and the cost of having a very powerful model running 24 hours a day is very high, which not all companies can afford. Sometimes, it's simply a problem of how to adapt models (BERT, GPT-2, ...) to your use case.
2.- Model deployments
Deploying Machine Learning models is not a trivial task; it's part of the end-to-end process of any ML project and can sometimes be challenging. This can be due to a lack of resources or because of project requirements you need to meet (latency, availability, etc.).
3.- Data
Data is a fundamental part of ML, however, there are sometimes restrictions on its use. Restrictions that are totally necessary because it is what allows us to protect the user, and as data scientists, we must promote that philosophy. Other times, there is simply not enough data governance, meaning it is not being valued within the company, and it is complicated to make use of it.
In summary, in this session we were with Alejandro Diaz talking about a brief introduction to Machine Learning and how to demystify some of the comments surrounding it. If you'd like more webinars like this, let us know, as well. here You will be able to find more information about our programmes.



