
Let's see what you are going to learn in this course, exactly.
A definition of machine learning and a description of the machine learning process.
Maybe the most important model categorization of machine learning models: models that have a target variable (supervised) vs. models that do not have one (unsupervised).
There are two purposes we can use machine learning models: prediction and inference. Here we'll discuss the difference between them.
Should we prefer a simple restrictive ML model or a more complex one? Here we discuss the pros and cons for each.
Here we present a couple of indicators that measure the prediction performance for the regression models.
The best indicators that help us estimate the predictive performance of the classification models.
This is a crucial issue in machine learning. Some models may have low bias, but high variance, other models may present high bias, but lower variance. Which is best? How to select an "optimal" model?
Machine learning models should work well on independent data sets. This is why it is absolutely necessary to validate them before actually using them.
The advantages and disadvantages of the validation set approach.
What is leave-one-out cross validation and what are its strengths and weaknesses.
What is k-fold cross-validation, how and when we should use it.
How to run an ordinary least squares regression in R, in order to predict the values of a numeric (quantitative) variable.
How to validate an OLS regression model using the validation set approach.
How to validate an OLS regression model using the k-fold cross-validation method.
When we have a lot of predictors in our regression model, we are often interested to identify the most relevant of them. Here we explain how we can do that using the best subset regression.
How to perform a forward selection regression in order to find the predictors that most influence the response variable.
How to perform a backward forward selection regression, with the same goal: find the predictors that have the strongest effect on the response variable.
How to validate the subset selection regression on an independent data set.
How to run a ridge regression in R and interpret the output.
How to validate the ridge regression using the k-fold cross-validation approach.
How to perform a lasso regression in R and interpret the results.
How to validate the lasso regression model with the k-fold cross-validation method.
How to run a PLS regression in R and interpret the output.
How to compute the prediction accuracy of a PLS regression model in the validation set.
How to run the logistic regression in R and interpret the coefficients.
How to compute the percentage of correctly classified cases for a logistic regression (in the whole sample).
How to draw and interpret the ROC curve - one of the most important output of a logistic regression.
How to test a logistic regression model in an independent data set.
If our logistic regression model has many predictors, we can use the lasso logistic regression to select the most effective of them. In this lecture we learn how to do that.
How to validate a lasso logistic regression model - and select the model with the highest prediction accuracy.
How to run a linear discriminant analysis and interpret the output.
How to test (validate) our linear discriminant analysis.
How to execute a quadratic discriminant analysis in R.
How to validate our quadratic discriminant model (check if it does a good prediction job in independent data sets).
Here we explain in detail how the naive Bayes procedure works, and give a simple illustration in a hypothetical situation.
How to apply the naive Bayes classification in R using the e1071 package.
How to validate a naive Bayes estimation model.
How to perform a naive Bayes classification in R using the naivebayes package (an how to test our model).
This course offers you practical training in machine learning, using the R program. At the end of the course you will know how to use the most widespread machine learning techniques to make accurate predictions and get valuable insights from your data.
All the machine learning procedures are explained live, in detail, on real life data sets. So you will advance fast and be able to apply your knowledge immediately – no need for painful trial-and-error to figure out how to implement this or that technique in R. Within a short time you can have a solid expertise in machine learning.
Machine learning skills are very valuable if you intent to secure a job like data analyst, data scientist, researcher or even software engineer. So it may be the right time for you to enroll in this course and start building your machine learning competences today!
Let’s see what you are going to learn here.
First of all, we are going to discuss some essential concepts that you must absolutely know before performing machine learning. So we’ll talk about supervised and unsupervised machine learning techniques, about the distinctions between prediction and inference, about the regression and classification models and, above all, about the bias-variance trade-off, a crucial issue in machine learning.
Next we’ll learn about cross-validation. This is an all-important topic, because in machine learning we must be able to test and validate our model on independent data sets (also called first seen data). So we are going to present the advantages and disadvantages of three cross-validations approaches.
After the first two introductory sections, we will get to study the supervised machine learning techniques. We’ll start with the regression techniques, where the response variable is quantitative. And no, we are not going to stick to the classical OLS regression that you probably know already. We will study sophisticated regression techniques like stepwise regression (forward and backward), penalized regression (ridge and lasso) and partial least squares regression. And of course, we’ll demonstrate all of them in R, using actual data sets.
Afterwards we’ll go to the classification techniques, very useful when we have to predict a categorical variable. Here we’ll study the logistic regression (classical and lasso), discriminant analysis (linear and quadratic), naïve Bayes technique, K nearest neighbor, support vector machine, decision trees and neural networks.
For each technique above, the presentation is structured as follows:
* a short, easy to understand theoretical introduction (without complex mathematics)
* how to train the predictive model in R
* how to test the model to make sure that it does a good prediction job on independent data sets.
In the last sections we’ll study two unsupervised machine learning techniques: principal component analysis and cluster analysis. They are powerful data mining techniques that allow you to detect patterns in your data or variables.
For each technique, a number of practical exercises are proposed. By doing these exercises you’ll actually apply in practice what you have learned.
This course is your opportunity to become a machine learning expert in a few weeks only! With my video lectures, you will find it very easy to master the major machine learning techniques. Everything is shown live, step by step, so you can replicate any procedure at any time you need it.
So click the “Enroll” button to get instant access to your machine learning course. It will surely provide you with new priceless skills. And, who knows, it could give you a tremendous career boost in the near future.
See you inside!