Categories

2017-08-15 00:40:49

Learning Path: R: Master Data Mining Techniques with R

Please confirm that you want to add **Learning Path: R: Master Data Mining Techniques with R** to your Wishlist.

Unlock the power of R for data mining on real-world datasets

2 students enrolled

Current price: $10
Original price: $200
Discount:
95% off

30-Day Money-Back Guarantee

- 7 hours on-demand video
- Full lifetime access
- Access on mobile and TV

- Certificate of Completion

What Will I Learn?

- Make use of statistics and programming to understand data mining concepts and their application
- Explore various libraries available in R for data mining
- Apply data management steps to handle large datasets
- Get to know various data visualization libraries available in R to represent data
- Create predictive models to build a recommendation engine
- Implement various dimension reduction techniques to handle large datasets
- Acquire knowledge about the neural network concept drawn from computer science and its applications in data mining

Requirements

- Basic programming knowledge of R
- Basic knowledge of Math and Statistics

Description

The world is emitting data at a very high pace and everyone wants to gain insights from the huge number of data coming their way. Data mining provides a way of finding these insights and R has become the go-to-tool for it among the data analysts and data scientists. *If you're looking forward to working on complex data mining projects and gaining deeper insights of data, then go for this Learning Path.*

Packt’s Video Learning Paths are a series of individual video products put together in a logical and stepwise manner such that each video builds on the skills learned in the video before it.

The highlights of this Learning Path are:

*Practical projects on***real-world data mining use cases**presented in a very easy-to-understand manner*One-stop solution to perform***spatial data mining**,**text mining**,**social media mining**, and**web mining**

Let’s get on this data mining journey together! This Learning Path starts with a brief introduction to R and setting up the development environment. ** Get a firm hold on the fundamentals of R and gradually build your skill level for data science**. This Learning Path will then teach you various data mining techniques, showing you how to

*After completing this Learning Path, you will have a solid understanding of all data mining techniques and how to implement them using R, in any real-world scenario.
*

**About the Author:**

We have combined the best works of the following esteemed authors to ensure that your learning journey is smooth:

**Dr. Samik Sen** is a theoretical physicist and loves thinking about hard problems. After his PH.D. in developing computational methods to solve problems for which no solutions existed, he began thinking about how to tackle math problems while lecturing. He developed algorithms to generate problem sets and solutions and learned how to create video lessons. He has since developed a large Facebook community teaching school math around Ireland, with associated e-learning products and a YouTube channel. He has a YouTube channel associated with data science, which also provides a valuable engagement with people round the world who look at problems from a different perspective.

**Pradeepta Mishra** is a data scientist, predictive modeling expert, deep learning and machine learning practitioner, and an econometrician. He is currently leading the data science and machine learning practice for Ma Foi Analytics, Bangalore, India. He holds a patent for enhancing planogram design for the retail industry. Pradeepta has published and presented research papers at IIM Ahmedabad, India. He is a visiting faculty at various leading B-schools and regularly gives talks on data science and machine learning. Pradeepta has spent more than 10 years in his domain and has solved various projects relating to classification, regression, pattern recognition, time series forecasting, and unstructured data analysis using text mining procedures, spanning across domains such as healthcare, insurance, retail and e-commerce, manufacturing, and so on.

Who is the target audience?

- This Learning Path is aimed at aspiring or professional data analysts or data scientists who want to gain deeper insights of data.

Students Who Viewed This Course Also Viewed

Curriculum For This Course

67 Lectures

07:03:03
+
–

Speaking ‘R’ - The Language of Data Science
19 Lectures
02:18:40

This video gives an overview of the entire course.

Preview
03:57

The aim of the video is to introduce the section and overview of the language R.

What Is R?

02:12

We need to have the core programs before we can begin and in this video,we show where to get them.

Getting and Setting Up R/Rstudio

02:01

In this video, we look at where to begin, so that we can get started.

Using RStudio

03:52

In this video, you will learn how RStudio has packages which avoid the problems and how we'll work on them.

Packages

08:01

In this video, you will learn how similar R is to other languages.

A Lot Is the Same

07:57

In this video, we see more familiar things in R.

Familiar Building Programming Blocks

07:49

In this video, we are now ready to write programs.

Putting It All Together

11:27

In this video, we will look at R data types which are new.

Core R Types

10:18

In this video, we will introduce some key commands to study data.

Some Useful Operations

05:12

In this video, we will introduce various commands to help us pick out elements in which we are interested in.

More Useful Operations

03:18

In this video, we will investigate the Titanic dataset to see what it says.

Titanic

11:11

In this video, we willadd a value by processing our data.

Tennis

12:42

In this video,we will download football results from a web page.

It's Mostly Cleaning Up

12:21

In this video, we will use R to do some statistics.

The Most Widely Used Statistical Package

10:15

In this video, we will work with distributions using R.

Distributions

09:27

In this video, we will see some of R's graphical power.

Time to Get Graphical

06:41

In this video, we will use the plotting package, ggplot2.

Plotting to Another Dimension

03:37

In this video, we will see another plotting technique known as Facets.

Facets

06:22

Test Your Knowledge

6 questions

+
–

R Data Mining Projects
31 Lectures
03:19:39

This video provides an overview of the entire course.

Preview
03:52

The process of deciphering meaningful insights from existing databases and analyzing results for consumption by business users.

What Is Data Mining?

04:58

We are going to start with basic programming using R for data management and data manipulation.

Introduction to the R Programming Language

14:44

Changing one data type to another if the formatting is not done properly is not difficult at all using R.

Data Type Conversion

02:11

While working on a client dataset with a large number of observations, it is required to subset the data based on some selection criteria and with or without replacement-based sampling.

Sorting, Merging, Indexing, and Subsetting Dataframes

09:45

The date functions return a Date class that represents the number of days since January 1, 1970.

Date and Time Formatting

03:01

There are two different types of functions in R, user-defined functions and built-in Functions.

Types of Functions

02:24

Using a loop, a similar task can be performed many times.

Loop Concepts

02:30

The apply function uses an array, a matrix, or a dataframe as an input and returns the result in an array format.

Applying Concepts

03:17

In typical data management, it is important to standardize the text columns or variables in a dataset because R is case sensitive and it reads any discrepancy as a new data point.

String Manipulation

02:14

The R programming language, missing values are represented as NA. NAs are not string or numeric values; they are considered as an indicator for missing values.

NA and Missing Value Management and Imputation Techniques

02:52

To generate univariate statistics about a dataset, we have to follow two approaches, one for continuous variables and the other for discrete or categorical variables.

Univariate Data Analysis

09:18

The relationship or association between two variables is known as bivariate analysis. There are three possible ways of looking at the relationship.

Bivariate Analysis

01:48

The multivariate relationship is a statistical way of looking at multiple dependent and independent variables and their relationships.

Multivariate Analysis

00:57

Understanding probability distributions is important in order to have a clear idea about the assumptions of any statistical hypothesis test.

Understanding Distributions and Transformation

04:53

Interpretation of the calculated distribution helps in forming a hypothesis.

Interpreting Distributions and Variable Binning

05:14

Contingency tables are frequency tables represented by two or more categorical variables Frequency table is used to represent one categorical variable; however, contingency table is used to represent two categorical variables.

Contingency Tables, Bivariate Statistics, and Checking for Data Normality

06:17

The null hypothesis states that nothing has happened; the means are constant, and so on. However, the alternative hypothesis states that something different has happened and the means are different about a population.

Hypothesis Testing

11:58

When a training dataset does not conform to any specific probability distribution because of non-adherence to the assumptions of that specific probability distribution, the only option left to analyze the data is via non-parametric methods.

Non-Parametric Methods

02:37

This video will walk you through the basics of data visualization along with how to create advanced data visualization using existing libraries in R programming language.

Introduction to Data Visualization

16:06

This video will let you explore different kinds of charts and plots and their creation. You'll also be able to use geo mapping.

Visualizing Charts, and Geo Mapping

03:39

By the end of this video, you will be able to use some amazing data visualization techniques which are widely used for smart Data representation.

Visualizing Scatterplot, Word Cloud and More

10:51

This video will teach you how to take the plotting to a new level. Here, you will learn to use the plotly library, which is designed as an interactive browser-based charting library built on the JavaScript library.

Using plotly

04:49

This video will let you explore the Geo mapping which is a type of chart, used by data mining experts when the dataset contains location information.

Creating Geo Mapping

02:20

How could you predict the future outcomes of a target variable? Regression is the answer to this. Let's have a brief introduction and understand regression.

Introduction about Regression

04:08

This video will let you explore about Linear regression model which can be used for explaining the relationship between a single dependent variable and independent variable.

Linear Regression

14:04

This video will let you understand the use of stepwise regression method to solve complex regression problems.

Stepwise Regression Method for Variable Selection

02:19

What could we do in those scenarios where the variable of interest is categorical in nature, such as buying a product or not, approving a credit card or not, tumor is cancerous or not, and so on? Logistic regression is the best solution to these.

Logistic Regression

09:39

Let's dive into another form of regression where the parameters in a linear regression model are increased up to one or two levels of polynomial calculation.

Cubic Regression

08:46

Market Basket Analysis is the study of relationships between various products and products that are purchased together or in a series of transactions.

Introduction to Market Basket Analysis

12:29

Implementing market basket analysis.

Practical project

15:39

Test Your Knowledge

5 questions

+
–

Advanced Data Mining projects with R
17 Lectures
01:24:44

This video provides an overview of the entire course.

Preview
03:53

It is important to classify objects according to their similarities or dissimilarities so that their study becomes easier. We use clustering techniques for that purpose.

Understanding Customer Segmentation

03:50

There are many clustering methods available. Out of them, we will learn about two methods, K-means and hierarchical, in this video.

Clustering Methods – K means and Hierarchical

15:36

In this video, we will go a step further and learn about model-based and other clustering algorithms. We will also compare the algorithms.

Clustering Methods – Model Based, Other and Comparison

05:32

Recommendation is a technique by which the algorithm detects what the user is buying. You would always like to be recommended things similar to your interest or things you have bought before. Recommendation engine helps in doing that.

What Is Recommendation?

07:29

There are different types of methods for building recommendation engine. You need to know which method to use depending on what type of product shopping you do. Also, there are certain limitations to these methods.

Application of Methods and Limitations of Collaborative Filtering

02:30

As we are armed with the theory of recommendation, we will now build a recommendation engine.

Practical Project

04:41

When there are a lot of variables, it becomes difficult to extract data. We need to devise something that will let us gather data in less number of variables. Dimensionality reduction provides you with that solution.

Why Dimensionality Reduction?

09:14

In order to understand dimensionality reduction, we need to work with it. Here, we will apply dimensionality reduction procedure, both the model-based and principal component-based approaches.

Practical Project around Dimensionality Reduction

12:41

We can also try some other approaches to perform dimensionality reduction according to the need of the dataset. Let's look at that in this video.

Parametric Approach to Dimension Reduction

02:42

Before working on neural networks, we need to understand the theory behind neural networks.

4.1 Introduction to Neural Networks

04:07

To understand and implement the neural networks, we need to understand the maths behind it. This video will do just that!

Understanding the Math Behind the Neural Network

01:58

After knowing about neural networks, we need to see how to implement neural networks in R.

Neural Network Implementation in R

01:58

Prediction is an important aspect of data mining. In this video, we will create a prediction model using neural network to predict the auction average price.

Neural Networks for Prediction

03:25

We need to form clusters or groups of data so that performing actions on them becomes easier. Here we are going to classify customers based on marketing.

Neural Networks for Classification

01:31

We will also perform forecasting using neural networks. In this video, we will forecast a time series.

Neural Networks for Forecasting

01:16

After working with neural networks, we should also know the merits and demerits of the famous technology.

Merits and Demerits of Neural Networks

02:21

Test Your Knowledge

5 questions

About the Instructor

Tech Knowledge in Motion

Packt has been committed to developer learning since 2004. A lot has changed in software since then - but Packt has remained responsive to these changes, continuing to look forward at the trends and tools defining the way we work and live. And how to put them to work.

With an extensive library of content - more than 4000 books and video courses -Packt's mission is to help developers stay relevant in a rapidly changing world. From new web frameworks and programming languages, to cutting edge data analytics, and DevOps, Packt takes software professionals in every field to what's important to them now.

From skills that will help you to develop and future proof your career to immediate solutions to every day tech challenges, Packt is a go-to resource to make you a better, smarter developer.

Packt Udemy courses continue this tradition, bringing you comprehensive yet concise video courses straight from the experts.

- About Us
- Udemy for Business
- Become an Instructor
- Affiliate
- Blog
- Topics
- Mobile Apps
- Support
- Careers
- Resources

- Copyright © 2017 Udemy, Inc.
- Terms
- Privacy Policy and Cookie Policy
- Intellectual Property