# Regression Machine Learning with R

**11 hours**left at this price!

- 5.5 hours on-demand video
- 10 articles
- 10 downloadable resources
- Full lifetime access
- Access on mobile and TV

- Certificate of Completion

Get your team access to 4,000+ top Udemy courses anytime, anywhere.

Try Udemy for Business- Read S&P 500® Index ETF prices data and perform regression machine learning operations by installing related packages and running code on RStudio IDE.
- Create target and predictor algorithm features for supervised regression learning task.
- Select relevant predictor features subset through Student t-test and ANOVA F-test univariate filter methods.
- Choose relevant predictor features subset through recursive feature elimination deterministic wrapper method.
- Designate relevant predictor features subset through least absolute shrinkage and selection operator embedded method.
- Extract predictor features transformations through principal component analysis.
- Train algorithm for mapping optimal relationship between target and predictor features.
- Test algorithm for evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent error metrics.
- Calculate generalized linear models such as linear regression or elastic net regression and select optimal linear regression coefficients regularization parameter through time series cross-validation.
- Compute similarity methods such as k nearest neighbors and select optimal number of nearest neighbors parameter through time series cross-validation.
- Estimate frequency methods such as decision tree and select optimal maximum tree depth parameter through time series cross-validation.
- Calculate ensemble methods such as random forest or extreme gradient boosting machine and select optimal number of randomly selected predictors or maximum trees depth parameters through time series cross-validation.
- Compute maximum margin methods such as linear or non-linear support vector machines and select optimal error term penalization parameter through time series cross-validation.
- Estimate multi-layer perceptron methods such as artificial neural network and select optimal node connection weight decay regularization parameter through time series cross-validation.
- Compare regression machine learning algorithms training and testing.

In this lecture you will view course disclaimer and learn which are its objectives, how you will benefit from it, its previous requirements and my profile as instructor.

In this lecture you will learn that it is recommended to view course in an ascendant manner as each section builds on last one and also does its complexity. You will also study course structure and main sections (algorithm learning, generalized linear models, similarity methods, frequency methods, ensemble methods, maximum margin methods and multi-layer perceptron methods).

In this lecture you will learn regression machine learning .TXT data file in .CSV format downloading, .TXT R script code file downloading, advanced forecasting models packages installation (caret, corrplot, e1071, elasticnet, forecast, kernlab, lars, nnet, party, penalized, plyr, quantmod, randomForest, rpart, xgboost) and RStudio Integrated Development Environment (IDE) project creation.

In this lecture you will learn section lectures’ details and main themes to be covered related to algorithm learning (algorithm features, features selection, features extraction, algorithm training and algorithm testing).

In this lecture you will learn non-linear or radial basis function support vector machine testing definition and main calculations (predict.train(), cbind(), index(), as.data.frame(), xts(), as.Date(), plot(), lines(), ts(), coredata(), accuracy() functions).

- R statistical software is required. Downloading instructions included.
- RStudio Integrated Development Environment (IDE) is recommended. Downloading instructions included.
- Practical example data and R script code files provided with the course.
- Prior basic R statistical software knowledge is useful but not required.
- Mathematical formulae kept at minimum essential level for main concepts understanding.

**Full Course Content Last Update 07/2018**

Learn regression machine learning through a practical course with R statistical software using S&P 500® Index ETF prices historical data for algorithm learning. It explores main concepts from basic to expert level which can help you achieve better grades, develop your academic career, apply your knowledge at work or do your business forecasting research. All of this while exploring the wisdom of best academics and practitioners in the field.

**Become a Regression Machine Learning Expert in this Practical Course with R**

Read S&P 500® Index ETF prices data and perform regression machine learning operations by installing related packages and running code on RStudio IDE.

Create target and predictor algorithm features for supervised regression learning task.

Select relevant predictor features subset through Student t-test and ANOVA F-test univariate filter methods.

Choose relevant predictor features subset through recursive feature elimination deterministic wrapper method.

Designate relevant predictor features subset through least absolute shrinkage and selection operator embedded method.

Extract predictor features transformations through principal component analysis.

Train algorithm for mapping optimal relationship between target and predictor features.

Test algorithm for evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent error metrics.

Calculate generalized linear models such as linear regression or elastic net regression and select optimal linear regression coefficients regularization parameter through time series cross-validation.

Compute similarity methods such as k nearest neighbors and select optimal number of nearest neighbors parameter through time series cross-validation.

Estimate frequency methods such as decision tree and select optimal maximum tree depth parameter through time series cross-validation.

Calculate ensemble methods such as random forest or extreme gradient boosting machine and select optimal number of randomly selected predictors or maximum trees depth parameter through time series cross-validation.

Compute maximum margin methods such as linear or non-linear support vector machines and select optimal error term penalization parameter through time series cross-validation.

Estimate multi-layer perceptron methods such as artificial neural network and select optimal node connection weight decay regularization parameter through time series cross-validation.

Compare regression machine learning algorithms training and testing.

**Become a Regression Machine Learning Expert and Put Your Knowledge in Practice**

Learning regression machine learning is indispensable for data mining applications in areas such as consumer analytics, finance, banking, health care, science, e-commerce and social media. It is also essential for academic careers in data mining, applied statistical learning or artificial intelligence. And it is necessary for business forecasting research.

But as learning curve can become steep as complexity grows, this course helps by leading you step by step using S&P 500® Index ETF prices historical data for algorithm learning to achieve greater effectiveness.

**Content and Overview**

This practical course contains 56 lectures and 5.5 hours of content. It’s designed for all regression machine learning knowledge levels and a basic understanding of R statistical software is useful but not required.

At first, you’ll learn how to read S&P 500® Index ETF prices historical data to perform regression machine learning operations by installing related packages and running script code on RStudio IDE.

Then, you’ll define algorithm features by creating target and predictor variables for supervised regression learning task. Next, you’ll only include relevant predictor features subset or transformations in algorithm learning through features selection and features extraction procedures. For features selection, you’ll define univariate filter methods, deterministic wrapper methods and embedded methods. For univariate filter methods, you’ll implement Student t-test and ANOVA F-test. For deterministic wrapper methods, you’ll implement recursive feature elimination. For embedded methods, you’ll implement least absolute shrinkage and selection operator or lasso. For features extraction, you’ll implement principal component analysis. After that, you’ll define algorithm training through mapping optimal relationship between target and predictor features within training range. For algorithm training, optimal parameters selection or fine tuning, bias-variance trade-off, optimal model complexity and time series cross-validation are defined. Later, you’ll define algorithm testing through evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent metrics within testing range. For scale-dependent metrics, you’ll define mean absolute error and root mean squared error. For scale-independent metrics, you’ll define mean absolute percentage error.

After that, you’ll define generalized linear models such as linear regression and elastic net regression. Next, you’ll implement algorithm training for mapping optimal relationship between target and predictor features within training range. For algorithm training, you’ll use only relevant predictor features subset or transformations through principal component analysis procedure and linear regression coefficients regularization optimal parameter estimation or fine tuning through time series cross-validation. Later, you’ll implement algorithm testing for evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent error metrics within testing range.

Then, you’ll define similarity methods such as k nearest neighbors. Next, you’ll implement algorithm training for mapping optimal relationship between target and predictor features within training range. For algorithm training, you’ll use only relevant predictor features subset or transformations through principal component analysis procedure and number of nearest neighbors optimal parameter estimation or fine tuning through time series cross-validation. Later, you’ll implement algorithm testing for evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent error metrics within testing range.

After that, you’ll define frequency methods such as decision tree. Next, you’ll implement algorithm training for mapping optimal relationship between target and predictor features within training range. For algorithm training, you’ll use only relevant predictor features subset or transformations through principal component analysis procedure and maximum tree depth optimal parameter estimation or fine tuning through time series cross-validation. Later, you’ll implement algorithm testing for evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent error metrics within testing range.

Then, you’ll define ensemble methods such as random forest and extreme gradient boosting machine. Next, you’ll implement algorithm training for mapping optimal relationship between target and predictor features within training range. For algorithm training, you’ll use only relevant predictor features subset or transformations through principal component analysis procedure and number of randomly selected predictors or maximum tree depth optimal parameter estimation or fine tuning through time series cross-validation. Later, you’ll implement algorithm testing for evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent error metrics within testing range.

After that, you’ll define maximum margin methods such as linear and non-linear or radial basis function support vector machines. Next, you’ll implement algorithm training for mapping optimal relationship between target and predictor features within training range. For algorithm training, you’ll use only relevant predictor features subset or transformations through principal component analysis procedure and error term penalization optimal parameter estimation or fine tuning through time series cross-validation. Later, you’ll implement algorithm testing for evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent error metrics within testing range.

Then, you’ll define multi-layer perceptron methods such as artificial neural network. Next, you’ll implement algorithm training for mapping optimal relationship between target and predictor features within training range. For algorithm training, you’ll use only relevant predictor features subset or transformations through principal component analysis procedure and node connection weight decay regularization optimal parameter estimation or fine tuning through time series cross-validation. Later, you’ll implement algorithm testing for evaluating previously optimized relationship forecasting accuracy through scale-dependent and scale-independent error metrics within testing range. Finally, you’ll compare regression machine learning algorithms training and testing.

- Undergraduates or postgraduates at any knowledge level who want to learn about regression machine learning using R statistical software.
- Academic researchers who wish to deepen their knowledge in data mining, applied statistical learning or artificial intelligence.
- Business data scientists who desire to apply this knowledge in areas such as consumer analytics, finance, banking, health care, e-commerce or social media.