
Introduction of the trainer & the agenda of the various concepts that you will learn as part of Data Science Learn Data Mining-Clustering using R and Tableau also Tableau will be discussed.
Why do we need to do clustering and how does it help in making a business decision.What is the primary objective of Clustering; learn about it with an example.
Data Mining in a Nutshell!What are the two main branches of Data mining and their distributaries? The Two approaches to clustering, and introduction to principle of Hierarchical clustering.
Visualisation of Hierarchical clustering, Grouping of records & division of cluster based on distance measure. Rules for the measure of distance
Hierarchical clustering using a case study, Computation of distance amongst two records having multiple inputs using the eucleadian method. Understanding the need for standardization of data; Z-score, Other types of distance measures
Learn How to measure of distance between categorical data. Various measurements using binary matrix for twin category data. Distance measurement rule for for more than 2 categories.
Learn about Various distance measure criteria for between clusters. “Hands On” exercise on hierarchical clustering and summarizing using Dendrogram
Learn about Measurement of distance between records having both numerical and categorical variables. Method of creation of Dummy variable data from categorical data. The need and the method of standardization the numerical data to the same scale as categorical dummy variable data.
Learn about Measure of distance using Gower’s General Similarity coefficient for mixed data using weighted means.
Learn How Clustering helps - the consumer perspective and the supplier perspective; Other insights from clusters and their labeling
Learn about Reading an XLSX file into R; Understanding the Database (MBA); Scaling the data to make it unitless using Z Scores
Learn about The ‘hclust’ function with complete linkage; Visualization of the clustering as as dendrogram using the ‘plot’ function; splitting the dendrogram into ‘k’ clusters using the ‘cuttree’ function; assigning the records to the respective clusters
Learn about The main difference between Hierarchical and non-hierarchical clustering; similarity within, dissimilarity amongst clusters; Algorithm (or iterative steps) for K means clustering
Learn about The main difference between Hierarchical and non-hierarchical clustering; similarity within, dissimilarity amongst clusters; Algorithm (or iterative steps) for K means clustering
Learn R code for K means clustering using only two inputs; for ease in visualization; computing the ideal number of clusters; viewing the iterative process of clustering (giving and receiving clusters) as an animation
Learn about random generation of centroid at the first iteration; The explanation of ‘receiving and giving clusters’ using the euclidean distance; understanding the math of centroid and the clustering based on distance; understanding the various attributes of the k-means output.
Learn The best value for ‘k’ using scree plot; determining the value of ‘k’ based on information gain; the point of the elbow; the ‘aggregate’ function for viewing each cluster as one record and analysis of the same; Labeling of clusters
Learn about The Pros and Cons of K means clustering and Hierarchical Clustering
Learn about Selection of ‘k’ based on the simplicity or adequacy; risks with the ‘k’ value - Local Minima problem; Cross checking of clusters for consistency;
Learn about accelerometer data set for analysing the steps taken by different user as tracked by their smart phones; Importance of Domain knowledge
Learn about Data Mining unsupervised; Hierarchical Clustering; Non Hierarchical Clustering; Distance measure for continuous and discrete; Types of Linkage; Dendrogram; Sum of Sqares distances between & within Clusters
Learn about the concepts that would be discussed as part of this section. Learn on why Tableau is extremely valid to be in the current world alongside learning Tableau's dominance in the space of Business Intelligence & Analytics. You will notice that Tableau exists in the leaders quadrant of Gartner's magic quadrant since 4 years in a stretch.
Learn about Tableau and its importance in the space of Analytics. Learn on why data visualization is extremely imperative in the world of big data. Get a glimpse into the amount of data getting generated via various sources. Finally you will learn the importance of data visualization in identifying interesting insights from the data.
Learn about the various products which Tableau provides us to cater to our business needs. This includes Tableau Desktop, Server, Online, Mobile, Public, Reader. Learn about the Tableau architecture and various components including various data source formats, data connectors and APIs.
Learn about the various components of Tableau server including Data Server, VizQL Server, Application Server. Learn about the load balancing and failover support with Tableau. Also learn on what are the various browsers, mobile devices on which end users can view the visualization using Tableau
Learn on how to extract the data from various datasources using Tableau Desktop or Tableau Server. Also discussed about is the way to establish live connection with data sources using Tableau Desktop & Server. Learn on how we can extract inly relevant fields of the data source by writing queries. One can also automate the reports & establish security settings in Tableau.
Learn about the operating systems on which Tableau Desktop can be installed. Pros and Cons of using it on the two operating systems. Also learn about VSA of Tableau, the word coined by ExcelR Solutions.
Learn about Tableau start page & how to connect to wide variety of various datasources. Also learn about thumbnails & discover pane, which will help you gain access to amazing real-world projects which are solved in current world.
Learn about an interesting case study solved using data visualisation using Tableau, which is called as Location Based Analytics.
Learn about the worksheet interface of Tableau including dimensions & measures. Learn on how to covert dimensions to measures & vice-versa. Learn about the terminologies of various components of a worksheet & about show me panel.
Learn on how to accomplish a simple visualisation on Bar plot, function of Filters & Marks shelf. Also learn about the usage of double clicking on measures & dimensions.
Explore the ANOVA framework by dissecting weight loss data across three diet programs, defining grand mean, treatment sum of squares, and mean square error, and deriving the F statistic.
Compute anova for clustering by calculating centroids and sums of squares to obtain mean squares and a p-value from the f distribution, and use Tableau's elbow plot to pick clusters.
Create and edit clustering in Tableau desktop using a universities dataset, explore the default two clusters, adjust the number of clusters from 2 to 50, and group the results.
this lecture demonstrates min-max normalization for clustering, showing how to compute (x minus min) over (max minus min) to scale features like SAT scores and balance distance calculations.
Master k-means clustering in Tableau on a 25-university dataset, using centroids, euclidean distance, and min-max normalization, and interpreting within-cluster sum of squares across sat score, acceptance ratio, and top 10.
Turn categorical columns into dummy variables, normalize to a 0-1 range, and standardize data to enable hierarchical clustering and k-means clustering on mixed data in R and Tableau.
Learn to compute a distance matrix using Euclidean and other distance measures, apply hierarchical clustering with various linkages, and interpret a dendrogram to form and visualize clusters.
Explore how distances from cluster centers and the sum of squared distances guide choosing the number of clusters, illustrated through the elbow curve and its impact on cluster quality.
Learn how to apply mixed data k-means clustering with five clusters, including iterative updates, distance metrics, and turning nonnumeric features into dummy variables for normalization.
Learn Data Mining - Clustering Segmentation Using R,Tableau is designed to cover majority of the capabilities of R from Analytics & Data Science perspective, which includes the following:
Course is structured to start with introduction to the tool & the principles behind Clustering Using R tool. From there R is explained thoroughly including analytical concepts behind applicable Data Mining Techniques. Finally course ends with explanation of clustering using Tableau and statistics behind clustering along with interview questions for job seekers