
Explore sum and count on dataframes, using numeric_only to sum numeric columns, inspect totals such as total sales, total bedrooms, and Titanic survival counts, and count present values across datasets.
Learn how to compute mean, median, and mode with data frame methods that calculate these values for every column in a data frame, with examples from houses and Titanic.
Explore basic DataFrame methods in pandas, including min, max, mean, head, mode, and describe, to analyze bestseller data and identify top authors like Jeff Kinney.
Explore how a pandas series, a one-dimensional array with axis labels, differs from a data frame and use common methods like sum, min, and max on a column.
Explore value_counts to count unique values in a series or data frame, yielding a labeled counts series you can sort, ascending, top-n, and plot to reveal distributions.
Demonstrate how sorting text columns in Pandas handles string comparisons, revealing case sensitivity, and use a key function (often a lambda) to lowercase values before sorting for consistent order.
Sort values and sort index on series and value counts to shape bar plots. Use Titanic class data and bedroom counts to practice ascending and descending orders.
Practice indexing, sorting, and retrieval on a Pokemon dataset to compute averages, identify top attackers, and plot attack values, using in-place operations and name-based indexing.
Filter dataframes with boolean series in pandas, using comparison operators to select Titanic rows by conditions like sex, survived, or class, via bracket indexing.
Filter Titanic data by sex to compare survival rates and visualize with value counts and bar or pie plots; then filter King County real estate for top multimillion-dollar zip codes.
Learn to create dynamic columns in pandas by combining siblings and spouses with parents and children for Titanic data, then analyze via sorting and simple plots.
Import Joe Biden tweets dataset, index by tweet id, drop url and a row, add a user column, compute ratio and interactions, rank ten by ratio and eight by interactions.
Learn to rename columns and index labels in a data frame using a mapper dictionary with the rename method, applying changes via columns or index and optional in-place updates.
Demonstrates a pandas solution for updating a Netflix titles dataset: read with pipe separators, set index to show_id, update director and duration, rename columns, and mark favorites.
Welcome to (what I think is) the web's best course on Pandas, Matplotlib, Seaborn, and more! This course will level up your data skills to help you grow your career in Data Science, Machine Learning, Finance, Web Development, or any tech-adjacent field.
This is a tightly structured course that covers a ton, but it's all broken down into human-sized pieces rather than an overwhelming reference manual that throws everything at you at once. After each and every new topic, you'll have the chance to practice what you're learning and challenge yourself with exercises and projects. We work with dozens of fun and real-world datasets including Amazon bestsellers, Rivian stock prices, Presidential Tweets, Bitcoin historic data, and UFO sightings.
If you're still reading, let me tell you a little about the curriculum.. In the course, you'll learn how to:
Work with Jupyter Notebooks
Use Pandas to read and manipulate datasets
Work with DataFrames and Series objects
Organize, filter, clean, aggregate, and analyze DataFrames
Extract and manipulate date, time, and textual information from data
Master Hierarchical Indexing
Merge datasets together in Pandas
Create complex visualizations with Matplotlib
Use Seaborn to craft stunning and meaningful visualizations
Create line, bar, box, scatter, pie, violin, rug, swarm, strip, and other plots!
What makes this course different from other courses on the same topics? First and foremost, this course integrates visualizations as soon as possible rather than tacking it on at the end, as many other courses do. You'll be creating your first plots within the first couple of sections! Additionally, we start using real datasets from the get go, unlike most other courses which spend hours working with dull, fake data (colors, animals, etc) before you ever see your first real dataset. With all of that said, I feel bad trash talking my competitors, as there are quite a few great courses on the platform :)
I think that about wraps it up! The topics in this courses are extremely visual and immediate, which makes them a joy to teach (and hopefully for you to learn). If you have even a passing interest in these topics, you'll likely enjoy the course and tear through it quickly. This stuff might seem intimidating, but it's actually really approachable and fun! I'm not kidding when I say this is my favorite course I've ever made. I hope you enjoy it too.