Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Cluster Analysis and Unsupervised Machine Learning in Python

Name: Cluster Analysis and Unsupervised Machine Learning in Python
Rating: 4.7 (5236 reviews)

Data science techniques for pattern recognition, data mining, k-means clustering, and hierarchical clustering, and KDE.

Created byLazy Programmer Team, Lazy Programmer Inc.

Last updated 2/2026

English

English [Auto],

What you'll learn

Understand the regular K-Means algorithm
Understand and enumerate the disadvantages of K-Means Clustering
Understand the soft or fuzzy K-Means Clustering algorithm
Implement Soft K-Means Clustering in Code
Understand Hierarchical Clustering
Explain algorithmically how Hierarchical Agglomerative Clustering works
Apply Scipy's Hierarchical Clustering library to data
Understand how to read a dendrogram
Understand the different distance metrics used in clustering
Understand the difference between single linkage, complete linkage, Ward linkage, and UPGMA
Understand the Gaussian mixture model and how to use it for density estimation
Write a GMM in Python code
Explain when GMM is equivalent to K-Means Clustering
Explain the expectation-maximization algorithm
Understand how GMM overcomes some disadvantages of K-Means
Understand the Singular Covariance problem and how to fix it

Course content

9 sections • 57 lectures • 7h 57m total length

Introduction5:03
Course Outline4:34
What is unsupervised learning used for?5:31
This lecture describes what unsupervised machine learning (not just clustering) is used for in general.
There are 2 major categories:

1) density estimation
If we can figure out the probability distribution of the data, not only is this a model of the data, but we can then sample from the distribution to generate new data.
For example, we can train a model to read lots of Shakespeare and then generate writing in the style of Shakespeare.

2) latent variables
This allows us to find the underlying cause of the data we've observed by reducing it to a small set of factors.
For example, if we measure the heights of all the people in our class and plot them on a histogram, we may notice 2 "bumps".
These "bumps" correspond to male heights and female heights.
Thus, being male or female is the hidden cause of higher / lower height values.
Clustering does exactly this - it tells us how the data can be split up into distinct groups / segments / categories.

Unsupervised machine learning can also be used for:
dimensionality reduction - modern datasets can have millions of features, but many of them may be correlated
visualization - you can't see a million-dimensional dataset, but if you reduce the dimensionality to 2, then it can be visualized
Why Use Clustering?9:20
Explore why clustering matters in unsupervised learning, including automatic labeling, faster information retrieval, density estimation, and applications in generative models.
Where to get the code4:36
How to Succeed in this Course3:04

An Easy Introduction to K-Means Clustering7:06
Hard K-Means: Exercise Prompt 19:13
Generate a two-dimensional data set with n samples and d features, assign cluster labels for k clusters, compute the k centroids, and plot data points with color-coded clusters and centroids.
Hard K-Means: Exercise 1 Solution11:09
learn how to generate three gaussian data clouds and solve the hard k-means exercise by computing centroids via axis zero and visualizing centers with red stars.
Hard K-Means: Exercise Prompt 25:04
Assign each data point to the closest mean using Euclidean distance, given a set of means and data points, and output the cluster identity vector for visualization.
Hard K-Means: Exercise 2 Solution7:08
Hard K-Means: Exercise Prompt 36:55
Explore hard k-means: initialize centers at random points, iteratively assign points to nearest centers, recompute means, and run until convergence, noting local optima.
Hard K-Means: Exercise 3 Solution16:22
Hard K-Means Objective: Theory13:01
Hard K-Means Objective: Code5:13
Complete the hard k-means objective by plotting the cost per iteration during training, using the squared euclidean distances to cluster means to verify convergence.
Soft K-Means5:41
The Soft K-Means Objective Function1:39
Explore the soft k-means objective function J for unsupervised learning in Python, where responsibilities weight squared distances to cluster means, optimized by coordinate descent with guaranteed decrease and convergence.
Soft K-Means in Python Code10:03
How to Pace Yourself3:19
Visualizing Each Step of K-Means2:18
Examples of where K-Means can fail7:32
Examine how k-means can fail in real-world clustering by exploring the donut problem, elongated gaussians, and clusters of different densities. Analyze the cost function outcomes and visualizations to understand limitations.
Disadvantages of K-Means Clustering2:13
Examine the disadvantages of k-means clustering, including choosing k, sensitivity to initialization, and convergence to local minima. It also struggles with non-spherical shapes and ignores data density.
How to Evaluate a Clustering (Purity, Davies-Bouldin Index)6:33
Using K-Means on Real Data: MNIST5:00
One Way to Choose K5:15
K-Means Application: Finding Clusters of Related Words8:38
Learn to apply k-means clustering to text by building a term-document matrix and tf-idf, reducing to two dimensions for visualization, and printing cluster word lists.
Clustering for NLP and Computer Vision: Real-World Applications6:58
Suggestion Box3:10

Visual Walkthrough of Agglomerative Hierarchical Clustering2:35
Agglomerative Clustering Options3:38
Learn about the different possible distance metrics that can be used for both k-means and agglomerative clustering, and what constitutes a valid distance metric. Learn about the different linkage methods for hierarchical clustering, like single linkage, complete linkage, UPGMA, and Ward linkage.
Using Hierarchical Clustering in Python and Interpreting the Dendrogram4:38
Explore hierarchical clustering in Python using a library, comparing Ward, single, and complete linkage, interpreting the dendrogram to identify three natural clusters and understand the chaining effect.
Application: Evolution14:00
Application: Donald Trump vs. Hillary Clinton Tweets18:34
Apply hierarchical clustering to Donald Trump and Hillary Clinton tweets using tf-idf vectorization to form two clusters and assess their purity, extracting top tf-idf words.

Gaussian Mixture Model (GMM) Algorithm15:31
Explore the Gaussian mixture model as a generalization of k-means, learn maximum likelihood estimation for Gaussian parameters, and master the expectation maximization algorithm with E and M steps and responsibilities.
Write a Gaussian Mixture Model in Python Code18:54
Implement a two-step gaussian mixture model in Python using the EM algorithm, generate data from three gaussian clouds, compute responsibilities, update means, covariances, and priors.
Practical Issues with GMM / Singular Covariance9:07
Identify why singular covariance in Gaussian mixture models occurs when clusters contain a single point or near-zero variance; apply covariance restrictions—full, diagonal, spherical, or tied—to prevent it.
Comparison between GMM and K-Means3:55
Kernel Density Estimation6:24
GMM vs Bayes Classifier (pt 1)9:28
GMM vs Bayes Classifier (pt 2)11:30
Expectation-Maximization (pt 1)11:45
Expectation-Maximization (pt 2)2:24
Expectation-Maximization (pt 3)8:09
Apply the full expectation-maximization algorithm to update gaussian mixture parameters, deriving the E-step and M-step for pi, mu, and Sigma, with notes on derivatives and constraints.
Future Unsupervised Learning Algorithms You Will Learn1:01
Explore future unsupervised learning algorithms in Python, latent variables and mixture models. Study hidden Markov models for sequence likelihoods and apply them to DNA and text in deep learning frameworks.

How to Succeed in this Course (Long Version)10:24
Is this for Beginners or Experts? Academic or Practical? Fast or slow-paced?22:04
Clarify who should take this course by detailing prerequisites and perspectives, contrast academic rigor with practical coding, Python, APIs, and real-world data applications.
Machine Learning and AI Prerequisite Roadmap (pt 1)11:18
Machine Learning and AI Prerequisite Roadmap (pt 2)16:07

Requirements

Know how to code in Python and Numpy
Install Numpy and Scipy
Matrix arithmetic, probability

Description

Cluster analysis is a staple of unsupervised machine learning and data science.

It is very useful for data mining and big data because it automatically finds patterns in the data, without the need for labels, unlike supervised machine learning.

In a real-world environment, you can imagine that a robot or an artificial intelligence won’t always have access to the optimal answer, or maybe there isn’t an optimal correct answer. You’d want that robot to be able to explore the world on its own, and learn things just by looking for patterns.

Do you ever wonder how we get the data that we use in our supervised machine learning algorithms?

We always seem to have a nice CSV or a table, complete with Xs and corresponding Ys.

If you haven’t been involved in acquiring data yourself, you might not have thought about this, but someone has to make this data!

Those “Y”s have to come from somewhere, and a lot of the time that involves manual labor.

Sometimes, you don’t have access to this kind of information or it is infeasible or costly to acquire.

But you still want to have some idea of the structure of the data. If you're doing data analytics automating pattern recognition in your data would be invaluable.

This is where unsupervised machine learning comes into play.

In this course we are first going to talk about clustering. This is where instead of training on labels, we try to create our own labels! We’ll do this by grouping together data that looks alike.

There are 2 methods of clustering we’ll talk about: k-means clustering and hierarchical clustering.

Next, because in machine learning we like to talk about probability distributions, we’ll go into Gaussian mixture models and kernel density estimation, where we talk about how to "learn" the probability distribution of a set of data.

One interesting fact is that under certain conditions, Gaussian mixture models and k-means clustering are exactly the same! We’ll prove how this is the case.

All the algorithms we’ll talk about in this course are staples in machine learning and data science, so if you want to know how to automatically find patterns in your data with data mining and pattern extraction, without needing someone to put in manual work to label that data, then this course is for you.

All the materials for this course are FREE. You can download and install Python, Numpy, and Scipy with simple commands on Windows, Linux, or Mac.

This course focuses on "how to build and understand", not just "how to use". Anyone can learn to use an API in 15 minutes after reading some documentation. It's not about "remembering facts", it's about "seeing for yourself" via experimentation. It will teach you how to visualize what's happening in the model internally. If you want more than just a superficial look at machine learning models, this course is for you.

"If you can't implement it, you don't understand it"

Or as the great physicist Richard Feynman said: "What I cannot create, I do not understand".
My courses are the ONLY courses where you will learn how to implement machine learning algorithms from scratch
Other courses will teach you how to plug in your data into a library, but do you really need help with 3 lines of code?
After doing the same thing with 10 datasets, you realize you didn't learn 10 things. You learned 1 thing, and just repeated the same 3 lines of code 10 times...

Suggested Prerequisites:

matrix addition, multiplication
probability
Python coding: if/else, loops, lists, dicts, sets
Numpy coding: matrix and vector operations, loading a CSV file

WHAT ORDER SHOULD I TAKE YOUR COURSES IN?:

Check out the lecture "Machine Learning and AI Prerequisite Roadmap" (available in the FAQ of any of my courses, including the free Numpy course)

Who this course is for:

Students and professionals interested in machine learning and data science
People who want an introduction to unsupervised machine learning and cluster analysis
People who want to know how to write their own clustering code
Professionals interested in data mining big data sets to look for patterns automatically

Cluster Analysis and Unsupervised Machine Learning in Python

What you'll learn

Explore related topics

Course content

Introduction to Unsupervised Learning6 lectures • 32min

K-Means Clustering22 lectures • 2hr 30min

Hierarchical Clustering5 lectures • 43min

Gaussian Mixture Models (GMMs)11 lectures • 1hr 38min

Appendix / FAQ Finale1 lecture • 4min

Setting Up Your Environment (FAQ by Student Request)3 lectures • 42min

Extra Help With Python Coding for Beginners (FAQ by Student Request)4 lectures • 42min

Effective Learning Strategies for Machine Learning (FAQ by Student Request)4 lectures • 1hr

Appendix / FAQ Finale1 lecture • 6min

Requirements

Description

Who this course is for: