Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Data Mining - Clustering/Segmentation Using R, Tableau
Rating: 4.1 out of 5(94 ratings)
1,662 students

Data Mining - Clustering/Segmentation Using R, Tableau

Learn Data Mining - Clustering Segmentation Using R,Tableau
Created byExcelR EdTech
Last updated 8/2018
English

What you'll learn

  • Understand the various types of Data Mining Techniques
  • Learn about the K-Means clustering algorithm & how to use R to accomplish this

Course content

6 sections45 lectures6h 58m total length
  • Introduction to Clustering Using R and Tableau4:17

    Introduction of the trainer & the agenda of the various concepts that you will learn as part of Data Science Learn  Data Mining-Clustering using R and Tableau also Tableau will be discussed.

  • Introduction To Clustering12:23

    Why do we need to do clustering and how does it help in making a business decision.What is the primary objective of Clustering; learn about it with an example.

  • Types Of Data Mining Techniques11:32

    Data Mining in a Nutshell!What are the two main branches of Data mining and their distributaries? The Two approaches to clustering, and introduction to principle of Hierarchical clustering.

  • Hierarchical Clustering Introduction9:44

    Visualisation of Hierarchical clustering, Grouping of records & division of cluster based on distance measure. Rules for the measure of distance

  • Hierarchical Clistering Case Study9:41

    Hierarchical clustering using a case study, Computation of distance amongst two records having multiple inputs using the eucleadian method. Understanding the need for standardization of data; Z-score, Other types of distance measures

  • Distance For Categorical Data9:36

    Learn How to measure of distance between categorical data. Various measurements using binary matrix for twin category data. Distance measurement rule for for more than 2 categories.

  • Distance Between Clusters With Case Study13:01

    Learn about Various distance measure criteria for between clusters. “Hands On” exercise on hierarchical clustering and summarizing using Dendrogram

  • Distance For Mixed Data Case Study Part 115:13

    Learn about Measurement of distance between records having both numerical and categorical variables. Method of creation of Dummy variable data from categorical data. The need and the method of standardization the numerical data to the same scale as categorical dummy variable data.

  • Distance For Mixed Data Case Study Part 25:49

    Learn about Measure of distance using Gower’s General Similarity coefficient for mixed data using weighted means.

  • Hierarchical Clustering Synopsis6:24

    Learn How Clustering helps - the consumer perspective and the supplier perspective; Other insights from clusters and their labeling

  • Hierarchical Clustering Using R Part 110:45

    Learn about Reading an XLSX file into R; Understanding the Database (MBA); Scaling the data to make it unitless using Z Scores

  • Hierarchical Clustering Using R Part 211:02

    Learn about The ‘hclust’ function with complete linkage; Visualization of the clustering as as dendrogram using the ‘plot’ function; splitting the dendrogram into ‘k’ clusters using the ‘cuttree’ function; assigning the records to the respective clusters

  • K Means Clustering Introduction4:44

    Learn about The main difference between Hierarchical and non-hierarchical clustering; similarity within, dissimilarity amongst clusters; Algorithm (or iterative steps) for K means clustering

  • K Means Clustering Using R - Part 113:41

    Learn about The main difference between Hierarchical and non-hierarchical clustering; similarity within, dissimilarity amongst clusters; Algorithm (or iterative steps) for K means clustering

  • K Means Clustering Using R - Part 218:12

    Learn R code for K means clustering using only two inputs; for ease in visualization; computing the ideal number of clusters; viewing the iterative process of clustering (giving and receiving clusters) as an animation

  • K Means Clustering Using R - Part 315:35

    Learn about random generation of centroid at the first iteration; The explanation of ‘receiving and giving clusters’ using the euclidean distance; understanding the math of centroid and the clustering based on distance; understanding the various attributes of the k-means output.

  • kselection, CLARA and PAM Clustering Using R - Part 411:54

    Learn The best value for ‘k’ using scree plot; determining the value of ‘k’ based on information gain; the point of the elbow; the ‘aggregate’ function for viewing each cluster as one record and analysis of the same; Labeling of clusters

  • Difference Between K Means And Hierarchical3:32

    Learn about The Pros and Cons of K means clustering and Hierarchical Clustering

  • Summary Of K Means Clustering3:31

    Learn about Selection of ‘k’ based on the simplicity or adequacy; risks with the ‘k’ value - Local Minima problem; Cross checking of clusters for consistency;

  • K Means Clustering Case Study8:49

    Learn about accelerometer data set for analysing the steps taken by different user as tracked by their smart phones; Importance of Domain knowledge

  • Recap Data Mining Clustering12:30

    Learn about Data Mining unsupervised; Hierarchical Clustering; Non Hierarchical Clustering; Distance measure for continuous and discrete; Types of Linkage; Dendrogram;  Sum of Sqares distances between & within Clusters

  • Quiz-1

Requirements

  • Knowledge of High school level Mathematics and Statistics
  • Rudimentary knowledge of any programming knowledge though not essential will be an added plus.
  • Download R & RStudio before starting this tutorial
  • Download all the datasets provided as part of this program to be able to practice & replicate the visualisations
  • Download Tableau Desktop Professional software from Tableau website

Description

Learn Data Mining - Clustering Segmentation Using R,Tableau is designed to cover majority of the capabilities of R from Analytics & Data Science perspective, which includes the following:

  • Learn about the usage of R for building Various models
  • Learn about the K-Means clustering algorithm & how to use R to accomplish the same
  • Learn about the science behind Clustering & accomplish the same using R and Tableau

Course is structured to start with introduction to the tool & the principles behind Clustering Using R tool. From there R is explained thoroughly including analytical concepts behind applicable Data Mining Techniques. Finally course ends with explanation of clustering using Tableau and statistics behind clustering along with interview questions for job seekers



Who this course is for:

  • All working professionals whose organization uses/creates data
  • Personnel who want to make their foray into Data Science