Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

R Ultimate 2024: R for Data Science and Machine Learning

Name: R Ultimate 2024: R for Data Science and Machine Learning
Rating: 4.5 (403 reviews)

R Basics, Data Science, Statistical Machine Learning models, Deep Learning, Shiny and much more (All R code included)

Created byBert Gollnick

Last updated 5/2024

English

English [Auto],

What you'll learn

learn all aspects of R from Basics, over Data Science, to Machine Learning and Deep Learning
learn R basics (data types, structures, variables, and more)
learn R programming (writing loops, functions, and more)
data im- and export
basic data manipulation (piping, filtering, aggregation of results, data reshaping, set operations, joining datasets)
data visualisation (different packages are learned, e.g. ggplot, plotly, leaflet, dygraphs)
advanced data manipulation (outlier detection, missing data handling, regular expressions)
regression models (create and apply regression models)
model evaluation (What is underfitting and overfitting? Why is data splitted into training and testing? What are resampling techniques?)
regularization (What is regularization? How can you apply it?)
classification models (understand different algorithms and learn how to apply logistic regression, decision trees, random forests, support vector machines)
association rules (learn the apriori model)
clustering (kmeans, hierarchical clustering, DBscan)
dimensionality reduction (factor analysis, principal component analysis)
Reinforcement Learning (upper confidence bound)
Deep Learning (deep learning for multi-target regression, binary and multi-label classification)
Deep Learning (learn image classification with convolutional neural networks)
Deep Learning (learn about Semantic Segmentation)
Deep Learning (Recurrent Neural Networks, LSTMs)
More on Deep Learning, e.g. Autoencoders, pretrained models, ...
R/Shiny for web application development and deployment

Course content

31 sections • 204 lectures • 22h 42m total length

Course Overview5:17
Explore the course structure from basics to machine learning, covering data types, import/export, data manipulation, visualization, and deep learning across supervised, unsupervised, and reinforcement learning.
R and RStudio (Overview and Installation)9:31
Install and configure R and RStudio for a local data science setup. Use open source, cross-platform support, graphics, and over ten thousand packages; enable a dedicated library and dark theme.
How to get the code2:16
Visit the home page and open the material section to access course files. Download the static zip or clone the GitHub repository, then extract or work with the local files.
How to get the code (alternative)0:25
RStudio Introduction / Project Setup9:57
Set up an RStudio project, load and organize files with clear naming, and use the four windows—coding, console, environment, and history—to run code and manage packages.
File Formats8:58
Explore file formats for R workflows, from scripts and notebooks to markdown, html, and pdf, with interactive graphs, latish documents, and speedups using c++.
Rmarkdown Lab9:26
Learn to build reproducible reports with rmarkdown by combining code chunks and text in chapters with a table of contents; render interactive html documents with plots and interactive tables.
Package Handling1:03

Basic Data Types 1016:57
Explore the six basic data types in R: numeric, integer, complex, logical, character, and raw, and how coercion governs flexible representations like vectors, matrices, data frames, and arrays.
Basic Data Types Lab15:02
Explore basic data types in R, build and inspect vectors of numeric, integer, boolean, and character values, and practice type coercion and casting with as.numeric and as.integer.
Matrices and Arrays Lab7:22
Explore matrices and arrays in R by creating vectors, converting to matrices with specified rows and columns, and building multi-dimensional arrays with adjustable dimensions, including byrow options.
Lists8:11
Learn to create and explore lists in R by combining vectors of different types, naming elements, and accessing items with brackets and the dot operator.
Factors13:44
Create and manipulate factors in r by defining fixed levels and enforcing valid values, and learn to handle data import issues when comma decimals affect factor and numeric conversions.
Dataframes8:37
Learn to create and manipulate data frames in R, comparing base data.frame and Tybalt options, access columns, and delete columns to manage tabular data.
Strings Lab24:05
Learn string handling in R using a 1984 text template to clean, tokenize, remove numbers and hyphens, and analyze word lengths and character frequencies, including Winston and O'Brien.
Datetime17:02
Explore time and date handling in R by converting strings to date and time formats, creating and parsing timestamps with lubridate, and visualizing incoming versus outgoing flow over 15 weeks.

Operators7:54
Explore arithmetic, logical, and special operators in R, using vectors A and B to perform element-wise calculations, comparisons, integer division, modulus, and the %in% operator.
Loops 1015:16
Explore loops in R for data science, focusing on for loops for known iterations, while loops for unknown iterations, and do-while loops; learn vectorization to avoid explicit loops.
Loops Lab9:16
Explore the three loop types in r, including the for and repeat loops, by defining sequences, building nested loops, and printing letters from a to t with break-based exits.
Functions 1014:46
Explore how functions encapsulate reusable code with inputs, logic, and outputs. Learn to bundle functions into packages and use default values, named parameters, and parameter order.
Functions Lab (Intro)1:27
Learn to code functions in R by constructing the Fibonacci sequence, where each term equals the sum of the two preceding ones, and return the first n elements on request.
Functions Lab (Coding)18:57
Build and test a fibonacci function in R by creating and expanding a fibonacci vector, using length and indexing, and handling edge cases like 1, 2, 0, and negative inputs.

Data Import Lab9:25
Import data into R from CSV, Excel, JSON, and SPSS formats using dedicated packages, handling separators and options. Explore exporting workflows to save results.
Data Export Lab4:36
Learn how to export data in R using write and safeguard, specifying the object and file, creating CSV exports without headers, and exporting SPSS formats and multiple objects.
Web Scraping Intro1:06
Learn web scraping in R by extracting data from web resources like Wikipedia and converting tables into a data frame, avoiding manual copy-paste.
Web Scraping Lab7:01
Learn to perform web scraping in R by downloading a Wikipedia wind power table, extracting it with XPath, cleaning numeric values, and displaying the first five columns.

Piping 1012:35
Piping 101 shows how piping passes the left-hand input to the next function, turning vector, log, differences, exponential, and rounding to one decimal place into a readable workflow.
Filtering 1015:52
Filter data in R by indexing vectors and data frames. Learn one-based indexing, slicing, selecting elements with c(), and using $ or brackets to access columns.
Filtering Lab10:18
Master filtering in R with the diamonds dataset using the deployer package, applying filter, sample, slice, top, and select operations, including whitelist and blacklist options to shape columns and rows.
Filtering Exercise0:03
Filtering Solution0:02
Data Aggregation 1014:46
Explore data aggregation using group by and summarize to compute group-level statistics, such as means, across multiple groupings in R.
Data Aggregation Lab4:53
Explore data aggregation in R using a population dataset from the World Health Organization. Group by country, calculate min, max, absolute and relative increases from 1995 to 2015.
Data Aggregation Exercise0:03
Data Aggregation Solution0:03
Data Reshaping 1013:20
Learn how to reshape data into tidy format, where each variable has its own column, each observation its own row, and each value its own cell, enabling easier analysis.
Data Reshaping Lab11:43
Explore data reshaping in R by converting between wide and tidy formats, then regroup and plot results using tidy data principles, with practical examples of group by and summarize.
Data Reshaping Exercise0:03
Data Reshaping Solution0:02
Set Operations 1011:30
Explore set operations in R using intersect, union, and setdiff to compare two sets, identify overlaps, include all observations, and obtain left-hand side results based on argument order.
Set Operations Lab2:21
Explore set operations in R with two vectors a and b, using intersect to find the overlap, setdiff to identify unique values, and union to combine all values.
Joining Datasets 1017:32
Explore how to join datasets in R using left join, inner join, and other variants by aligning on indices, keeping left data intact, and handling missing values.
Joining Datasets Lab5:34
Learn to prepare and join data frames A and B with the deployer package, performing left, right, and full joins on a common index and renaming value columns.

Visualisation Overview2:54
Explore data visualization in R, from base plotting to plotty, with hover and zoom, and leaflet for geospatial plots and digraphs for time series. Also cover Sankei diagrams and TriMet.
ggplot 10111:04
Discover ggplot 101 by mapping data with aesthetics, building scatter, histograms, and box plots from the diamonds dataset, and using facets, scales, and jitter to reveal patterns.
ggplot Lab17:48
Explore and visualize the diamonds data set using ggplot to create one- and two-variable plots, including discrete and continuous variables, density, violin, dot plots, and faceted color and size mappings.
plotly Lab (Intro)2:18
Explore Plotly lab intro to create interactive visualizations with World Happiness Report data, linking GDP and life expectancy to happiness scores, featuring hover details, zoom, area highlight, and easy sharing.
plotly Lab11:21
Build interactive scatter plots in R using plotly, combining 2015–2016 happiness data with GDP, life expectancy, and freedom to reveal patterns through subplots and hover text.
leaflet Lab (Intro)2:24
Explore leaflet for interactive geospatial visualizations by analyzing the 1854 London cholera data, mapping deaths with red circles and pumps with green circles to reveal waterborne transmission.
leaflet Lab9:11
Load the dataset, compute the median longitude and latitude, and create an interactive geospatial visualization with Leaflet, mapping deaths and pumps as red and green circles.
dygraphs Lab (Intro)1:19
Explore dynamic time series with the digraphs package. Interactively view metal prices, hover for current values, and adjust periods to study time windows.
dygraphs Lab10:22
Load metal price data from a web API into xts time series, then create dynamic dygraphs plots with rebasing to 100, a range selector, and linked plots.

Outlier Detection 10111:16
Explore univariate and multivariate outlier detection in R using z-score, extreme value analyses, and Deep Skin. Learn to handle outliers with imputation, trimming, or top, bottom or zero coating.
Outlier Detection Lab (Intro)1:20
Explore outlier detection using box plots of the iris data set, identify outliers with z-score analysis, and compare the scan technique to reveal observations that stand out.
Outlier Detection Lab20:04
Explore outlier detection in iris data using box plots and z-score thresholds. Implement per-species limits with Q1, Q3, IQR, and compare with a clustering approach and PCA visualization.
Outlier Detection Exercise0:03
Outlier Detection Solution0:02
Missing Data Handling 1016:08
Master missing data handling in R via data imputation, using visual patterns to guide simple deletion, univariate imputation, and models like mice, miss forest, and miss ranger.
Missing Data Handling Lab (Intro)1:02
Explore missing data handling in a credit approval dataset by analyzing patterns and applying techniques such as univariate and multivariate imputation, and removing observations as needed.
Missing Data Handling Lab (1/1)16:47
Advance your missing data handling skills by analyzing patterns in credit approval data, then apply univariate and multivariate imputations with Miss Ranger and Miss Forest to compare results.
Regular Expressions 1014:25
Master string handling with Stringer and regular expressions by detecting patterns, locating their positions, replacing matches, and extracting values using anchors.
Regular Expressions Lab16:19
Learn to use the Stringer package in R to detect, locate, replace, split, and test patterns with regular expressions, including anchors, quantifiers, and lookaround.

AI 1015:06
Explore artificial intelligence, its relation to machine learning and deploying, with examples like Google Maps and Google Search, and study deep learning, neural networks, image classification, and ape species distinction.
Machine Learning 1017:09
Explore how machine learning differs from classical programming, and learn supervised, unsupervised, and reinforcement learning, including classification and regression, with accuracy and r-squared as evaluation metrics.
Models5:33
Explore building and evaluating a machine learning model with supervised learning, using training data, validation data, and hold-out testing data to predict the target variable from independent predictors.

Regression Types 1013:40
Explore regression as a supervised learning technique for continuous targets, including univariate and multivariate regression. See how linear, quadratic, and higher-order relationships shape models, from univariate regression to multivariate surfaces.
Univariate Regression 1015:48
Explore univariate linear regression with one independent variable to predict a dependent variable using a linear model, and see how size and price relate via slope, intercept, and squared error.
Univariate Regression Interactive4:01
Explore univariate linear regression through an interactive dashboard, adjust the estimated slope and offset, and observe how noise affects mean squared error and model fit.
Univariate Regression Lab12:10
Explore univariate linear regression by building a model to predict mass from height, handling an outlier, and evaluating predictions with r-squared in a Star Wars data set.
Univariate Regression Exercise2:20
Explore univariate regression with a linear model linking speed to distance using Hubble telescope data, calculate adjusted r-squared, assess correlation, make predictions, and estimate the Habu constant.
Univariate Regression Solution7:51
Load univariate regression data, fit a linear model of velocity versus distance (adjusted R-squared 0.978), generate predictions across speeds, and compute the Hubble constant as velocity over distance (about 72.1).
Polynomial Regression 1012:12
Explore polynomial regression to capture non-linear patterns using higher-order terms within a linear regression framework. Learn how to balance model complexity and data fit to avoid overfitting and misleading R-squared.
Polynomial Regression Lab13:59
Create and analyze synthetic data to explore polynomial regression in r, plotting observations and residuals. Compare linear and higher-order models with poly and as.is, using adjusted r-squared to avoid overfitting.
Multivariate Regression 1014:41
Explore multivariate linear regression, using multiple predictors to predict the target y, and verify linearity, patternless residuals, low multicollinearity, and normal residuals via scatter and correlation plots.
Multivariate Regression Lab14:09
Explore multivariate regression to predict wine quality using 11 chemical properties, build a linear model, visualize correlations with a correlation matrix, and assess residuals and r-squared for evaluation.
Multivariate Regression Exercise2:15
Practice multivariate regression on a nighttime air force noise data set to predict airfoil sound pressure level from frequency and angle of attack, then compare all-variable and three-independent-variable models.
Multivariate Regression Solution13:12
Analyze multivariate regression on airfoil noise data, identify key predictors via correlation, compare full and three-variable models, and evaluate using r-squared and post-resample metrics.

Underfitting / Overfitting 10111:19
Explore underfitting and overfitting in regression and classification, and learn how bias and variance balance training and validation data to minimize overall error.
Train / Validation / Test Split 1012:56
Learn how to split data into training, validation, and test sets—the gold standard for model evaluation—train models, validate performance, and obtain an unbiased final evaluation with matching distributions.
Train / Validation / Test Split Interactive7:45
Explore training, validation, and testing splits using random sampling to avoid bias, compare linear versus random selection, and learn how data size influences the ideal split.
Train / Validation / Test Split Lab12:51
Build an R function to split a data frame into training, validation, and test sets using 0.6, 0.2, and 0.2 ratios, with floor-based sampling and index-based assignment.
Resampling Techniques 1014:52
Explore resampling techniques, including five-fold, tenfold, and leave-one-out cross-validation, to balance training and validation data, compare algorithms, and manage computational cost for stable model performance.
Resampling Techniques Lab18:06
Join the resampling techniques lab to apply 10-fold cross-validation and leave-one-out cross validation to wine quality data, comparing training and test performance of a linear model using post resample metrics.

Requirements

no prior knowledge required - just be passionate to gain new skills

Description

You want to be able to perform your own data analyses with R? You want to learn how to get business-critical insights out of your data? Or you want to get a job in this amazing field? In all of these cases, you found the right course!

We will start with the very Basics of R, like data types and -structures, programming of loops and functions, data im- and export.

Then we will dive deeper into data analysis: we will learn how to manipulate data by filtering, aggregating results, reshaping data, set operations, and joining datasets. We will discover different visualisation techniques for presenting complex data. Furthermore find out to present interactive timeseries data, or interactive geospatial data.

Advanced data manipulation techniques are covered, e.g. outlier detection, missing data handling, and regular expressions.

We will cover all fields of Machine Learning: Regression and Classification techniques, Clustering, Association Rules, Reinforcement Learning, and, possibly most importantly, Deep Learning for Regression, Classification, Convolutional Neural Networks, Autoencoders, Recurrent Neural Networks, ...

You will also learn to develop web applications and how to deploy them with R/Shiny.

For each field, different algorithms are shown in detail: their core concepts are presented in 101 sessions. Here, you will understand how the algorithm works. Then we implement it together in lab sessions. We develop code, before I encourage you to work on exercise on your own, before you watch my solution examples. With this knowledge you can clearly identify a problem at hand and develop a plan of attack to solve it.

You will understand the advantages and disadvantages of different models and when to use which one. Furthermore, you will know how to take your knowledge into the real world.

You will get access to an interactive learning platform that will help you to understand the concepts much better.

In this course code will never come out of thin air via copy/paste. We will develop every important line of code together and I will tell you why and how we implement it.

Take a look at some sample lectures. Or visit some of my interactive learning boards. Furthermore, there is a 30 day money back warranty, so there is no risk for you taking the course right now. Don’t wait. See you in the course.

Who this course is for:

R beginners interested in learning R
data science practitioners who want to deepen their knowledge
developers who want to learn different aspects of Machine Learning

R Ultimate 2024: R for Data Science and Machine Learning

What you'll learn

Explore related topics

Course content

Course Introduction8 lectures • 47min

Data Types and -structures8 lectures • 1hr 41min

R Programming6 lectures • 48min

Data Im- and Export4 lectures • 22min

Basic Data Manipulation17 lectures • 1hr 1min

Data Visualisation9 lectures • 1hr 9min

Advanced Data Manipulation10 lectures • 1hr 17min

Machine Learning: Introduction3 lectures • 18min

Machine Learning: Regression12 lectures • 1hr 26min

Machine Learning: Model Preparation and Evaluation6 lectures • 58min

Requirements

Description

Who this course is for: