Learning Path: R: Complete Machine Learning & Deep Learning

Unleash the true potential of R to unlock the hidden layers of data

Created byPackt Publishing

Last updated 6/2017

English

What you'll learn

Develop R packages and extend the functionality of your model
Perform pre-model building steps
Understand the working behind core machine learning algorithms
Build recommendation engines using multiple algorithms
Incorporate R and Hadoop to solve machine learning problems on Big Data
Understand advanced strategies that help speed up your R code
Learn the basics of deep learning and artificial neural networks
Learn the intermediate and advanced concepts of artificial and recurrent neural networks

Course content

3 sections • 213 lectures • 17h 36m total length

The Course Overview7:44
This video gives an overview of the entire course.
Performing Univariate Analysis5:22
In this video, we will take a look at how to perform univariate analysis.
Bivariate Analysis – Correlation, Chi-Sq Test, and ANOVA5:42
The goal of this video is to perform bivariate analysis in R using three cases.
Detecting and Treating Outlier3:20
In this video, we will see how to detect and treat outliers.
Treating Missing Values with `mice`3:59
The goal of this video is to see how to treat missing values in R.
Building Linear Regressors7:35
In this video we'll see what is linear regression, its purpose, when to use it, and how to implement in R.
Interpreting Regression Results and Interactions Terms5:19
We'll see how to interpret regression results and Interaction effects in this video
Performing Residual Analysis & Extracting Extreme Observations Cook's Distance3:25
In this video we will discuss what is residual analysis and detect multivariate outliers using Cook's Distance
Extracting Better Models with Best Subsets, Stepwise Regression, and ANOVA4:39
The goal of this video is to understand how to do model selection and comparison using best subsets, stepwise regression and ANOVA.
Validating Model Performance on New Data with k-Fold Cross Validation2:29
In this video we will see how to do k-fold cross validation in R.
Building Non-Linear Regressors with Splines and GAMs5:19
The goal of this video is check out how to build non-linear regression models using Splines and GAMs.
Building Logistic Regressors, Evaluation Metrics, and ROC Curve12:38
Our goal in this video would be to understand logistic regression, evaluation metrics of binary classification problems, and interpretation of the ROC curve.
Understanding the Concept and Building Naive Bayes Classifier9:23
In this video, we will understand the concept and working of naïve Bayes classifier and how to implement the R code.
Building k-Nearest Neighbors Classifier7:01
In this video, we will look at what k-nearest neighbors algorithms, how does it works and how to implement it in T.
Building Tree Based Models Using RPart, cTree, and C5.06:32
The goal of this video is to understand how decision trees work, what they are used for, and how to implement then.
Building Predictive Models with the caret Package8:11
The goal of this video is know what the various features of the caret package are and how to build predictive models.
Selecting Important Features with RFE, varImp, and Boruta5:19
The goal of this video is to know how to do feature selection before building predictive models.
Building Classifiers with Support Vector Machines8:03
In this video, we will look at how support vector machines work.
Understanding Bagging and Building Random Forest Classifier5:06
In this video, we will look at the concept behind bagging and random forests and how to implement it to solve problems.
Implementing Stochastic Gradient Boosting with GBM5:18
Let's understand what boosting is and how stochastic gradient boosting works with GBM.
Regularization with Ridge, Lasso, and Elasticnet8:52
In this video, we will look at what regularization is, ridge and lasso regression, and how to implement it.
Building Classifiers and Regressors with XGBoost10:10
Let's look at how XG Boost works and how to implement it in this video.
Dimensionality Reduction with Principal Component Analysis5:04
Our goal in this video would be to reduce the dimensionality of data with principal components, and understand the concept and how to implement it in R.
Clustering with k-means and Principal Components3:16
In this video, we will understand the k-means clustering algorithm and implement it using the principal components.
Determining Optimum Number of Clusters5:24
In this video, we will analyze the clustering tendency of a dataset and identify the ideal number of clusters or groups.
Understanding and Implementing Hierarchical Clustering2:36
The goal of this video is to understand the logic of hierarchical clustering, types, and how to implement it in R.
Clustering with Affinity Propagation5:24
How to use affinity propagation to cluster data points? How is it different from conventional algorithms?
Building Recommendation Engines9:00
How to build recommendation engines to recommend products/movies to new and existing users?
Understanding the Components of a Time Series, and the xts Package5:41
The goal of this video is to understand what a time series is, how to create time series of various frequencies, and the enhanced facilities available in the xts package.
Stationarity, De-Trend, and De-Seasonalize4:07
The goal of this video is to understand the characteristics of a time series: stationarity and how to de-trend and de-seasonalize a time series.
Understanding the Significance of Lags, ACF, PACF, and CCF3:49
In this video, we will introduce the characteristics of time series such as ACF, PACF, and CCF; why they matter; and how to interpret them.
Forecasting with Moving Average and Exponential Smoothing2:25
Our goal in this video would be to understand moving average and exponential smoothing and use it to forecast.
Forecasting with Double Exponential and Holt Winters3:22
In this video, we will understand how double exponential smoothing and holt winter forecasting works, when to use them, and how to implement them in R.
Forecasting with ARIMA Modelling5:26
Let's look at what ARIMA forecasting is, understand the concepts, and learn how ARIMA modelling works in this video.
Scraping Web Pages and Processing Texts9:24
In this video, we'll take a look at how to scrape data from web pages and how to clean and process raw web and other textual data.
Corpus, TDM, TF-IDF, and Word Cloud9:06
Our goal in this video is to know how to process texts using tm package and understand the significance of TF-IDF and its implementation. Finally, we see how to draw a word cloud in R.
Cosine Similarity and Latent Semantic Analysis7:20
Let's see how to use cosine similarity and latent semantic analysis to find and map similar documents.
Extracting Topics with Latent Dirichlet Allocation5:07
In this video, we will see how to extract the underlying topics in a document, the keywords related to each topic and the proportion of topics in each document.
Sentiment Scoring with tidytext and Syuzhet4:23
Let's check out how to perform sentiment analysis and scoring in R.
Classifying Texts with RTextTools3:57
How to classify texts with machine learning algorithms using the RTextTools package?
Building a Basic ggplot2 and Customizing the Aesthetics and Themes7:18
The goal of this videos is to understand what is the basic structure of to make charts with ggplot, how to customize the aesthetics, and manipulate the theme elements.
Manipulating Legend, AddingText, and Annotation3:31
In this video, we will see how to manipulate the legend the way we want and how to add texts and annotation in ggplot.
Drawing Multiple Plots with Faceting and Changing Layouts3:18
The goal of this video is to understand how to plot multiple plots in the same chart and how to change the layouts of ggplot.
Creating Bar Charts, Boxplots, Time Series, and Ribbon Plots5:25
How to make various types of plots in ggplot such as bar chart, time series, boxplot, ribbon chart,and so on.
ggplot2 Extensions and ggplotly3:11
In this video, we will understand what the popular ggplot extensions are, and where to find them, and their applications.
Implementing Best Practices to Speed Up R Code5:46
We will discuss the best practices that should be followed to minimize code runtime in this video.
Implementing Parallel Computing with doParallel and foreach4:22
Let's tackle the implementation of parallel computing in R.
Writing Readable and Fast R Code with Pipes and DPlyR5:39
The goal of this video is understand how to work with DplyR and pipes.
Writing Super Fast R Code with Minimal Keystrokes Using Data.Table6:38
In this video, we will discuss how to manipulate data with the data.table package, how to achieve maximum speed, and what the various features of data.table are.
Interface C++ in R with RCpp11:09
Our main focus in this video is to understand how to write C++ code and make it work in R. Also leverage the speed of C++ in R, interface Rcpp with R, and write Rcpp code.
Understanding the Structure of an R Package5:02
We'll take a look at the components of an R package in this video.
Build, Document, and Host an R Package on GitHub7:09
In this video, we will look at how to create an R Package so that it can be submitted to CRAN.
Performing Important Checks Before Submitting to CRAN4:05
We will understand the mandatory checks and common problems faced by developers when creating R packages in this video.
Submitting an R Package to CRAN3:10
The goal of this video is to show how to submit an R package to CRAN.

The Course Overview4:38
This is give you brief information about the course.
Downloading and Installing R6:10
R must be first installed on your system to work on it.
Downloading and Installing RStudio3:10
RStudio makes the process of development with R easier.
Installing and Loading Packages5:46
R packages are an essential part of R as they are required in all our programs. Let's learn to do that.
Reading and Writing Data5:54
You must know how to give data to R to work with data. You will learn that here.
Using R to Manipulate Data5:46
Data manipulation is time consuming and hence needs to be done with the help of built-in R functions.
Applying Basic Statistics4:47
R is widely used for statistical applications. Hence it is necessary to learn about the built in functions of R.
Visualizing Data3:33
To communicate information effectively and make data easier to comprehend we need graphical representation. You will learn to plot figures in this section.
Getting a Dataset for Machine Learning2:38
Because of some limitations, it is a good practice to get data from external repositories. You will be able to do just that after this video.
Reading a Titanic Dataset from a CSV File8:36
Reading a dataset is the first and foremost step in data exploration. We need to learn to how to do that.
Converting Types on Character Variables3:05
In R, since nominal, ordinal, interval, and ratio variable are treated differently in statistical modeling, we have to convert a nominal variable from a character into a factor.
Detecting Missing Values3:18
Missing values affect the inference of a dataset. Thus it is important to detect them.
Imputing Missing Values4:30
After detecting missing values, we need to impute them as their absence may affect the conclusion.
Exploring and Visualizing Data4:24
After imputing the missing values, you should perform an exploratory analysis to summarize the data characteristics.
Predicting Passenger Survival with a Decision Tree3:58
The exploratory analysis helps users gain insights into how single or multiple variables may affect the survival rate. However, it does not determine what combinations may generate a prediction model. We need to use a decision tree for that.
Validating the Power of Prediction with a Confusion Matrix2:08
After constructing the prediction model, it is important to validate how the model performs while predicting the labels.
Assessing performance with the ROC curve2:32
Another way of measuring performance is the ROC curve.
Understanding Data Sampling in R3:30
When there are huge datasets, we can find the characteristics of the entire dataset with a part or sample of the data. Hence data sampling is essential.
Operating a Probability Distribution in R5:41
Probability distribution and statistics are interdependent. To provide a justification to the statistical information, we need probability.
Working with Univariate Descriptive Statistics in R5:09
Univariate statistics deals with a single variable and hence is very simple.
Performing Correlations and Multivariate Analysis3:01
To analyze the relation among more than two variables, multivariate analysis is done.
Operating Linear Regression and Multivariate Analysis3:25
Assessing the relation between dependent and independent variables is carried out through linear regression.
Conducting an Exact Binomial Test3:48
To validate that the experiment results are significant, hypothesis testing is done.
Performing Student's t-test3:13
To compare means of two different groups, one- and two-sample t-tests are conducted.
Performing the Kolmogorov-Smirnov Test4:43
Comparing a sample with a reference probability or comparing cumulative distributions of two data sets calls for a Kolmogorov- Smirnov test.
Understanding the Wilcoxon Rank Sum and Signed Rank Test2:04
The Wilcoxon Test is a non-parametric test for null hypothesis.
Working with Pearson's Chi-Squared Test5:08
To check the distribution of categorical variables of two groups, Pearson's chi-squared test is used.
Conducting a One-Way ANOVA4:15
To examine the relation between categorical independent variables and continuous dependent variables, Anova is used. When there is a single variable, one-way ANOVA is used.
Performing a Two-Way ANOVA4:02
When there are two categorical values to be compared, two-way ANOVA is used.
Fitting a Linear Regression Model with lm4:53
Linear regression is the simplest model in regression and can be used when there is one predictor value.
Summarizing Linear Model Fits5:20
To obtain summarized information of a fitted model, we need to learn how to summarize linear model fits.
Using Linear Regression to Predict Unknown Values2:51
It would be really convenient for us if we could predict unknown values. You can do that using linear regression.
Generating a Diagnostic Plot of a Fitted Model3:57
To check if the fitted model adequately represents the data, we perform diagnostics.
Fitting a Polynomial Regression Model with lm2:16
In the case of a non-linear relationship between predictor and response variables, a polynomial regression model is formed. We need to fit the model. This video will enable you to do that.
Fitting a Robust Linear Regression Model with rlm2:15
An outlier will cause diversion from the slope of the regression line. In order to avoid that, we need to fit a robust linear regression model.
Studying a case of linear regression on SLID data6:38
We will perform linear regression on a real-life example, the SLID dataset.
Reducing Dimensions with SVD2:11
GLM generalizes linear regression by allowing the linear model to be related to the response variable via a link function and by allowing the magnitude of the variance of each measurement to be a function of its predicted value.
Applying the Poisson model for Generalized Linear Regression1:33
GLM allows response variables with error distribution other than a normal distribution. We apply the Poisson model to see how that is done.
Applying the Binomial Model for Generalized Linear Regression2:02
When a variable is binary, we apply the binomial model.
Fitting a Generalized Additive Model to Data3:13
GAM has the ability to deal with non-linear relationships between dependent and independent variables. We learn to fit a regression using GAM.
Visualizing a Generalized Additive Model1:26
Visualizing a GAM helps it to understand better.
Diagnosing a Generalized Additive Model3:38
You can also diagnose a GAM model to analyze it.
Preparing the Training and Testing Datasets3:44
Training and testing datasets are both essential for building a classification model.
Building a Classification Model with Recursive Partitioning Trees6:10
A partitioning tree works on the basis of split condition starting from the base node to the terminal node.
Visualizing a Recursive Partitioning Tree3:03
Plotting the classification tree will make analyzing the data easier. You will learn to do this now.
Measuring the Prediction Performance of a Recursive Partitioning Tree2:48
Before making a prediction, it is essential to compute the prediction performance of the model.
Pruning a Recursive Partitioning Tree2:37
There can be parts in a dataset which are not essential for classification. In order to remove these parts, we have to prune the dataset.
Building a Classification Model with a Conditional Inference Tree1:56
Conditional inference trees are better than traditional classification trees because they adapt the test procedures for selecting the output.
Visualizing a Conditional Inference Tree2:38
Visualizing a conditional inference tree will make it easier to extract and analyze data from the dataset.
Measuring the Prediction Performance of a Conditional Inference Tree2:10
Like the prediction performance of a traditional classification tree, we can also evaluate the performance of a conditional inference tree.
Classifying Data with the K-Nearest Neighbor Classifier5:31
k-nearest neighbor classifier is a non parametric lazy learning method. Thus it has the advantages of both the types of methods.
Classifying Data with Logistic Regression4:37
Classification in logistic regression is done based one or more features. It is more robust and doesn't have as many conditions as the traditional classification model.
Classifying data with the Naïve Bayes Classifier6:16
The Naïve Bayes classifier is based on applying Bayes' theorem with a strong independent assumption.
Classifying Data with a Support Vector Machine5:57
Support vector machines are better at classification because they can capture complex relations between data points and provide both linear and non-linear classifications
Choosing the Cost of an SVM2:56
To control our training errors and margins, we use the cost function. The SVM classifier is affected by the cost.
Visualizing an SVM Fit3:33
To visualize the SVM fit, we can use the plot function.
Predicting Labels Based on a Model Trained by an SVM3:48
We can use the trained SVM to predict labels on a model.
Tuning an SVM2:47
According to the desired output, you may need to generate different combinations of gamma and cost to train different SVMs. This is called tuning.
Training a Neural Network with neuralnet4:07
A neural network is used in classification, clustering and prediction. Its efficiency depends on how well you train it. Let's learn to do that.
Visualizing a Neural Network Trained by neuralnet2:21
We can use the trained SVM to predict labels on a model.
Predicting Labels based on a Model Trained by neuralnet3:07
Similar to other classification models, we can predict labels using neural networks and also validate performance using confusion matrix.
Training a Neural Network with nnet2:45
Nnet provides the functionality to train feed-forward neural networks with backpropagation.
Predicting labels based on a model trained by nnet2:49
As we have already trained the neural network using nnet, we can use the model to predict labels.
Estimating Model Performance with k-fold Cross Validation3:42
The k-fold cross-validation technique is a common technique used to estimate the performance of a classifier as it overcomes the problem of over-fitting. In this video we will illustrate how to perform a k-fold cross-validation.
Performing Cross Validation with the e1071 Package3:22
In this video, we will illustrate how to use tune.svm to perform 10-fold cross-validation and obtain the optimum classification model.
Performing Cross Validation with the caret Package2:59
In this video we will demonstrate how to perform k-fold cross validation using the caret package.
Ranking the Variable Importance with the caret Package2:21
This video will show you how to rank the variable importance with the caret package.
Ranking the Variable Importance with the rminer Package2:30
In this video, we will illustrate how to use rminer to obtain the variable importance of a fitted model.
Finding Highly Correlated Features with the caret Package2:13
In this video we will show how to find highly correlated features using the caret package.
Selecting Features Using the Caret Package4:58
In this video, we will demonstrate how to use the caret package to perform feature selection.
Measuring the Performance of the Regression Model3:57
To measure the performance of a regression model, we can calculate the distance from the predicted output and the actual output as a quantifier of the performance of the model. In this video we will illustrate how to compute these measurements from a built regression model.
Measuring Prediction Performance with a Confusion Matrix2:07
In this video we will demonstrate how to retrieve a confusion matrix using the caret package.
Measuring Prediction Performance Using ROCR2:46
In this video, we will demonstrate how to illustrate an ROC curve and calculate the AUC to measure the performance of a classification model.
Comparing an ROC Curve Using the Caret Package3:43
In this video we will use the function provided by the caret package to compare different algorithm-trained models on the same dataset.
Measuring Performance Differences between Models with the caret Package3:41
In this video we will see how to measure performance differences between fitted models with the caret package.
Classifying Data with the Bagging Method7:53
The adabag package implements both boosting and bagging methods. For the bagging method, the package first generates multiple versions of classifiers, and then obtains an aggregated classifier. Let's learn the bagging method from adabag to generate a classification model.
Performing Cross Validation with the Bagging Method1:56
To assess the prediction power of a classifier, you can run a cross validation method to test the robustness of the classification model. This video will show how to use bagging.cv to perform cross validation with the bagging method.
Classifying Data with the Boosting Method6:04
Boosting starts with a simple or weak classifier and gradually improves it by reweighting the misclassified samples. Thus, the new classifier can learn from previous classifiers. One can use the boosting method to perform ensemble learning. Let's see how to use the boosting method to classify the telecom churn dataset.
Performing Cross Validation with the Boosting Method2:06
Similar to the bagging function, adabag provides a cross validation function for the boosting method, named boosting.cv. In this video, we will learn how to perform cross-validation using boosting.cv.
Classifying Data with Gradient Boosting7:09
Gradient boosting creates a new base learner that maximally correlates with the negative gradient of the loss function. One may apply this method on either regression or classification problems. But first, we need to learn how to use gbm.
Calculating the Margins of a Classifier5:30
A margin is a measure of certainty of a classification. It calculates the difference between the support of a correct class and the maximum support of an incorrect class. This video will show us how to calculate the margins of the generated classifiers.
Calculating the Error Evolution of the Ensemble Method2:18
The adabag package provides the errorevol function for a user to estimate the ensemble method errors in accordance with the number of iterations. Let's explore how to use errorevol to show the evolution of errors of each ensemble classifier.
Classifying Data with Random Forest7:01
Random forest grows multiple decision trees which will output their own prediction results. The forest will use the voting mechanism to select the most voted class as the prediction result. In this video, we illustrate how to classify data using the randomForest package.
Estimating the Prediction Errors of Different Classifiers4:35
At the beginning of this section, we discussed why we use ensemble learning and how it can improve the prediction performance. Let's now validate whether the ensemble model performs better than a single decision tree by comparing the performance of each method.
Clustering Data with Hierarchical Clustering8:40
Hierarchical clustering adopts either an agglomerative or a divisive method to build a hierarchy of clusters. This video shows us how to cluster data with the help of hierarchical clustering.
Cutting Trees into Clusters3:30
In this video we demonstrate how to use the cutree function to separate the data into a given number of clusters.
Clustering Data with the k-Means Method4:10
In this video, we will demonstrate how to perform k-means clustering on the customer dataset.
Drawing a Bivariate Cluster Plot3:32
We will now illustrate how to create a bivariate cluster plot.
Comparing Clustering Methods4:15
In this video we will see how to compare different clustering methods using cluster.stat from the fpc package.
Extracting Silhouette Information from Clustering2:40
In this video we will see how to compute silhouette information.
Obtaining the Optimum Number of Clusters for k-Means2:48
In this video we will discuss how to find the optimum number of clusters for the k-means clustering method.
Clustering Data with the Density-Based Method6:42
In this video, we will demonstrate how to use DBSCAN to perform density-based clustering.
Clustering Data with the Model-Based Method4:37
In this video, we will demonstrate how to use the model-based method to determine the most likely number of clusters.
Visualizing a Dissimilarity Matrix3:23
A dissimilarity matrix can be used as a measurement for the quality of a cluster. In this video, we will discuss some techniques that are useful to visualize a dissimilarity matrix.
Validating Clusters Externally4:12
In this video, we will demonstrate how clustering methods differ with regard to data with known clusters.
Transforming Data into Transactions3:35
Before starting with a mining association rule, you need to transform the data into transactions. This video will show how to transform any of a list, matrix, or data frame into transactions.
Displaying Transactions and Associations2:14
The arule package uses its own transactions class to store transaction data. As such, we must use the generic function provided by arule to display transactions and association rules. Let's see how to display transactions and association rules via various functions in the arule package.
Mining Associations with the Apriori Rule7:24
Association mining is a technique that can discover interesting relationships hidden in transaction datasets. This approach first finds all frequent itemsets and then generates strong association rules from frequent itemsets. In this video, we see how to perform association analysis using the apriori rule.
Pruning Redundant Rules2:26
This video will show you how to determine the number of principal components using the Kaiser method.
Visualizing Association Rules5:06
Besides listing rules as text, you can visualize association rules, making it easier to find the relationship between itemsets. In this video, we will learn how to use the aruleViz package to visualize the association rules.
Mining Frequent Itemsets with Eclat3:36
An apriori algorithm performs a breadth-first search to scan the database. So, support counting becomes time consuming. Alternatively, if the database fits into the memory, you can use the Eclat algorithm, which performs a depth-first search to count the supports. Let's see how to use the Eclat algorithm.
Creating Transactions with Temporal Information2:41
In addition to mining interesting associations within the transaction database, we can mine interesting sequential patterns using transactions with temporal information. This video demonstrates how to create transactions with temporal information.
Mining Frequent Sequential Patterns with cSPADE4:15
In contrast to association mining, we should explore patterns shared among transactions where a set of itemsets occurs sequentially. One of the most famous frequent sequential pattern mining algorithms is the Sequential Pattern Discovery using Equivalence classes (SPADE) algorithm. Let's see how to use SPADE to mine frequent sequential patterns.
Performing Feature Selection with FSelector7:37
This video will give you an introduction on how to perform feature selection with the FSelector package.
Performing Dimension Reduction with PCA7:18
Principal component analysis (PCA) is the most widely used linear method in dealing with dimension reduction problems. This video will show you how to use it.
Determining the Number of Principal Components Using the Scree Test3:34
This video demonstrates how to determine the number of principal components using a scree plot. Let's have a look at it.
Determining the Number of Principal Components Using the Kaiser Method2:05
This video will show you how to determine the number of principal components using the Kaiser method.
Visualizing Multivariate Data Using biplot3:16
Let's see how to use biplot to plot both variables and data
Performing Dimension Reduction with MDS5:37
In MDS, you can either use a metric or a nonmetric solution. This video illustrates how to perform MDS on the swiss dataset.
Reducing Dimensions with SVD3:18
You may require several times, reducing the dimension of matrices while working on datasets. Let us see how we could do this with SVD.
Compressing Images with SVD3:05
Let's see how to perform SVD on the classic image processing material, Lenna.
Performing Nonlinear Dimension Reduction with ISOMAP4:34
This video will show you how to perform a nonlinear dimension reduction with ISOMAP. This is one of the approaches to manifold learning and generalizes linear frameworks to nonlinear data structures.
Performing Nonlinear Dimension Reduction with Local Linear Embedding4:54
This video will give you a short introduction of how to use LLE on an s-curve data
Preparing the RHadoop Environment5:35
In order to prepare the RHadoop environment we need to download the Cloudera and QuickStart VM.
Installing rmr23:52
Installation of R and the rmr2 package is essential to perform MapReduce, which is used in performing data processing and analysis.
Installing rhdfs4:15
In order to access HDFS resources you need to install rhdfs on every task node.
Operating HDFS with rhdfs5:47
You can easily operate HDFS from the R console with the help of rhdfs.
Implementing a Word Count Problem with RHadoop5:26
You will understand how rmr2 is used for word count in this video.
Comparing the Performance between an R MapReduce Program & a Standard R Program5:03
Comparing Hadoop and a standard R program can help us decide which language is best suited for our needs.
Testing and Debugging the rmr2 Program3:48
Since running a MapReduce program will require a considerable amount of time, testing and debugging become very important.
Installing plyrmr3:12
Plyrmr makes data manipulation operations easy.
Manipulating Data with plyrmr3:52
Writing a MapReduce program can be difficult for non-developers. Hence plyr-like operations can be used to manipulate data.
Conducting Machine Learning with RHadoop4:38
You can perform machine learning operations with RHadoop.
Configuring RHadoop Clusters on Amazon EMR5:28
Multinode clusters can be deployed with the help of Amazon EMR on RHadoop.

The Course Overview5:22
This video provides an overview of the entire course.
Fundamental Concepts in Deep Learning7:42
The main objective is to understand the fundamental concepts and key features that make it so special and different from the classical Machine Learning approach.
Introduction to Artificial Neural Networks7:57
The goal of this video is to learn more about Artificial Neural Networks and their vast world of variations, explore the basic architectures of ANNs in detail and talk about their possible implementations in R.
Classification with Two-Layers Artificial Neural Networks8:02
Applying what you have learned about the Multilayer Perceptron algorithm to a real-world application, which classifies handwritten digits in images.
Probabilistic Predictions with Two-Layer ANNs6:33
To get probabilistic predictions using Artificial Neural Networks and specifically in the context of a multi-class classification problem.
Introduction to Multi-hidden-layer Architectures4:31
To add multiple hidden layers to the basic Multilayers Perceptron algorithm in order to build more complex models of the world and increase the accuracy of our predictions.
Tuning ANNs Hyper-Parameters and Best Practices6:12
The goal of this video is to learn the best practices for tuning the hyper-parameters of an ANN and being able to generalize well on the data we have never seen before. This would be the latest essential skill to acquire in order to get the best out of our ANN solution.
Neural Network Architectures4:57
The goal of this video is to learn more about Multi-hidden-layer Neural Networks and how to use them in order to solve the practical problem of classifying handwritten digits within the R language.
Neural Network Architectures Continued8:02
The goal of this video is to apply what we have learned about Multi-hidden-layer ANNs to a new real-world problem and get more confidence in the use of the H2O package.
The LearningProcess5:35
The main objective is to understand the optimization process behind and common to every Deep Learning model with a more formal definition with respect to what was previously introduced.
Optimization Algorithms and Stochastic Gradient Descent8:11
To explore with more details, the most common algorithm to minimize the loss function called Stochastic Gradient Descent.
Backpropagation6:44
The goal of this video is to understand how to actually learn the weights of our Deep Learning model using Stochastic Gradient Descent through Backpropagation, the standard way of computing the gradient for Artificial Neural Networks.
Hyper-Parameters Optimization7:17
To get to tune the hyper-parameters automatically in order to minimize the error on the validation set.
Introduction to Convolutional Neural Networks9:57
This first video, will be an introduction to the fundamental concepts behind Convolutional Neural Networks. The main objective of this video is to motivate their use highlighting the differences from classical feed-forward neural networks.
Introduction to Convolutional Neural Networks Continued10:35
The goal of this video is to learn more about Convolutional Neural Networks, concluding our dissertation on the layer-wise structure of a CNN and understand how to design architecture suitable for your specific problem.
CNNs in R10:41
The aim of this video is to understand how to actually implement CNNs in R, and use it to solve real-world problems.
Classifying Real-World Images with Pre-Trained Models8:29
The goal of this video is to learn about the concept of transfer Learning, and how we can use and exchange DL pre-trained models to solve even new tasks with a very tiny computational overhead.
Introduction to Recurrent Neural Networks11:57
The aim of this video is to introduce the fundamental concepts behind Recurrent Neural Networks. The main objective is to underline their main differences from classical feed-forward neural networks and CNNs.
Introduction to Long Short-Term Memory8:07
The aim of this video is to learn more about a specific type of Recurrent Neural Networks, called Long Short-Term Memories, a natural extension of classical RNNs for dealing with long-term dependencies.
RNNs in R8:55
The aim of this video is to understand how to actually implement RNNs in R, and use it to solve real-world problems.
Use-Case – Learning How to Spell English Words from Scratch6:35
The aim of this video is to learn how to train and use an LSTM to solve a complex problem like predicting the next character in a sentence given the occurrences of the previous characters.
Introduction to Unsupervised and Reinforcement Learning6:44
The aim of this video is to understand the main differences from classical supervised learning and how they can be combined together.
Autoencoders4:56
In this video, you will learn more about a specific unsupervised learning algorithm called Autoencoders. This type of Artificial Neural Networks are simple and effective solutions for learning efficient representation of data without any supervision.
Restricted Boltzmann Machines and Deep Belief Networks7:44
The aim of this video is to learn about two very important unsupervised Deep Learning algorithms for features hierarchies: Restricted Boltzmann Machines and Deep Belief Networks.
Reinforcement Learning with ANNs7:22
The aim of this video is to get a quick picture on the main approaches for solving reinforcement learning tasks with Deep Learning.
Use-Case – Anomaly Detection through Denoising Autoencoders6:52
The aim of this video is to learn how to train and use an Autoencoders in R with the H2O package, for solving a real-world anomaly detection task.
Deep Learning for Computer Vision7:19
The aim of this video is to spark new inspiration for creatively applying Deep Learning techniques to real-world problems in Computer Vision.
Deep Learning for Natural Language Processing6:04
The aim of this video is to creatively apply Deep Learning techniques to real-world problems in Natural Language Processing NLP.
Deep Learning for Audio Signal Processing5:01
In this video, we'll creatively apply Deep Learning techniques to real-world problems in ASP.
Deep Learning for Complex Multimodal Tasks4:32
The aim of this video is to introduce some of the most successful applications of Deep Learning for complex multimodal tasks.
Other Important Applications of Deep Learning5:24
In this video, let's take a look at some of the most successful applications in other fields we didn't mention before.
Debugging Deep Learning Systems5:56
The aim of this first video is to learn how to deal with models which do not behave as they should.
GPU and MGPU Computing for Deep Learning4:57
In this video, you will learn how to speed-up the training and deploy complex DL models.
A Complete Comparison of Every DL Packages in R4:40
The aim of this video is to present a complete overview on every available R package for Deep Learning and Neural Networks.
Research Directions and Open Questions4:37
In this video, you will learn about the most interesting research directions and open question for the long-term developments of Deep Learning toward truly intelligent agents.

Requirements

Basic knowledge of R would be beneficial
Knowledge of linear algebra and statistics is required

Description

Are you looking to gain in-depth knowledge of machine learning and deep learning? If yes, then this Learning Path just right for you.

Packt’s Video Learning Paths are a series of individual video products put together in a logical and stepwise manner such that each video builds on the skills learned in the video before it.

R is one of the leading technologies in the field of data science. Starting out at a basic level, this Learning Path will teach you how to develop and implement machine learning and deep learning algorithms using R in real-world scenarios.

The Learning Path begins with covering some basic concepts of R to refresh your knowledge of R before we deep-dive into the advanced techniques. You will start with setting up the environment and then perform data ETL in R. You will then learn important machine learning topics, including data classification, regression, clustering, association rule mining, and dimensionality reduction. Next, you will understand the basics of deep learning and artificial neural networks and then move on to exploring topics such as ANNs, RNNs, and CNNs. Finally, you will learn about the applications of deep learning in various fields and understand the practical implementations of scalability, HPC, and feature engineering.

By the end of the Learning Path, you will have a solid knowledge of all these algorithms and techniques and be able to implement them efficiently in your data science projects.

Do not worry if this seems too far-fetched right now; we have combined the best works of the following esteemed authors to ensure that your learning journey is smooth:

About the Authors

Selva Prabhakaran is a data scientist with a large e-commerce organization. In his 7 years of experience in data science, he has tackled complex real-world data science problems and delivered production-grade solutions for top multinational companies.

Yu-Wei, Chiu (David Chiu) is the founder of LargitData, a startup company that mainly focuses on providing Big Data and machine learning products. He has previously worked for Trend Micro as a software engineer, where he was responsible for building Big Data platforms for business intelligence and customer relationship management systems. In addition to being a startup entrepreneur and data scientist, he specializes in using Spark and Hadoop to process Big Data and apply data mining techniques for data analysis.

Vincenzo Lomonaco is a deep learning PhD student at the University of Bologna and founder of ContinuousAI, an open source project aiming to connect people and reorganize resources in the context of continuous learning and AI. He is also the PhD students' representative at the Department of Computer Science of Engineering (DISI) and teaching assistant of the courses machine learning and computer architectures in the same department.

Who this course is for:

The Learning Path is for machine learning engineers, statisticians, and data scientists who want to create cutting-edge machine learning and deep learning models using R

Learning Path: R: Complete Machine Learning & Deep Learning

What you'll learn

Explore related topics

Course content

Mastering R Programming54 lectures • 5hr 12min

R Machine Learning solutions124 lectures • 8hr 20min

Deep Learning with R35 lectures • 4hr 4min

Requirements

Description

Who this course is for: