
Explore basic series arithmetic, creating series from dictionaries, and index handling in pandas, including isnull checks, null value detection, and named series and index; plus adding series with aligned indexes.
Modify a data frame by assigning to a column and using a series, create new columns, delete columns, and build data frames from dictionaries for Italy and Germany across years.
Demonstrate deleting data from the axis in Pandas with the drop method on series and dataframe, removing indexes and columns using axis controls.
Explore indexing, selection, and filtering in pandas series, including label and position retrieval, slicing, boolean masking, and updating values such as B and C, similar to numpy.
Explore indexing, selection, and filtering in a data frame, including selecting columns and rows with lists, boolean masks, and conditional updates.
explore arithmetics with series and data frames by adding pandas objects and aligning by index and columns. see how sums handle missing indices or columns with not a number.
Sort data in pandas by index and by columns using sort_index, and order values with sort_values. Learn to sort by a specific column (axis=1) or by index across rows.
Explore how pandas handles indexes with duplicate values by inspecting a series and a data frame, and verify index uniqueness using is unique while selecting entries by index label.
Demonstrate a pandas workflow by building a country population dataframe, adding female and male columns, computing 70% of male, calculating percent of male, and removing the dump column.
Describe how pandas handles non-numerical data, showing count, unique, top, and freq, and summarize methods like min, max, argmin, argmax, idxmin, idxmax, sum, mean, median, var, std, diff, and pct_change.
Explore using the isin method to filter unique values and build a boolean mask, then apply pandas value_counts across Q1–Q3 with zeros for non-numeric values.
Learn to manipulate missing data in pandas by filtering with dropna, using a series containing none values, and observing how dropna drops missing entries while preserving valid data.
Learn how to fill missing data in pandas using fillna with scalar values, per-column fills via a dictionary, and mean (average) imputation for a series in a data frame.
Explore hierarchical indexing in pandas, using multi-level indexes and unstack to transform a multi-index series into a data frame, with practical examples of indexing, selecting, and unstacking.
Create a pandas data frame from cattle, chicken, and sheep, indexed by country; use lambda to find min and max, then compute sum and average; sort by index and sheep.
Learn to read and write data from files with pandas, including multi-index creation using a list of index columns, handling missing values with na_values, and skipping rows with read_csv.
Learn to perform partial reading of large text files with pandas by reading in chunks using read_csv and chunk size, then compute value counts by key.
Read excel files into pandas data frames using xlrd and openpyxl, load a file, parse sheet one, and produce a data frame with columns A, B, C, D, and message.
Learn json data concepts and handling in python by using the json library to load and dump data, and pandas to build dataframes from json with selected fields.
Learn to load three year data frames with pandas read_csv, filter by gender, sum totals, and count names like Grace and Alice.
Explore creating a pie graph with matplotlib using a one-dimensional data array, customizing with labels, explode, colors from seaborn palettes, auto percent, and an optional shadow.
Read macro data and marker data csv files into data frames, create frames a, b, and c with year, real gdp, and real cons, and plot the graph for 1960–1963.
Read macro data from csv files with pandas to build data frames. Visualize real GDP and related metrics using line plots and horizontal bar charts.
Master data wrangling in Python with Pandas by reading, cleaning, transforming, and rearranging data, and learn how to combine datasets with merge and concat operations.
Merge data frames using pandas' merge method to link rows by keys. Learn inner, left, right, and outer joins and how to use on, left_on, and right_on for column names.
Explore merging dataframes by index and by column in pandas, using left_on and right_index, including multi-index joins and the outer join to align all indexes.
Explore how to concatenate data frames in pandas using pd.concat, control join behavior with axis and inner join, and build hierarchical indexes with keys.
Learn how hierarchical indexing enables data frame rearrangement in pandas, using stack and unstuck to rotate between rows and columns.
Learn to replace values in pandas series using the replace method, including replacing with numpy NaN, handling multiple replacements, and using a dict to map old values to new ones.
Group a dataframe by a dictionary mapping columns to red and blue, then sum values to compare red (a, b) and blue (c, d) groups.
Explore grouping with functions in Pandas, using len to group by index length and compute sums or minimum. Build multi index results; relate functions to lists, arrays, dictionaries, and series.
Explore pivot tables in pandas to summarize and aggregate data by multiple keys, using pivot_table and group by, with margins and aggregate functions like count and sum.
Learn to generate date time indexes with pandas date_range, using start, end, or periods, and apply daily (D) or last business day (BM) frequencies.
Learn to convert period index objects to different frequencies in pandas, transitioning from annual December periods to monthly start and end indexes, and creating series with month-end and month-start indexes.
Explore resampling and frequency conversion in pandas, focusing on downsampling high-frequency data to lower frequencies and upsampling to higher frequencies, with mean and sum aggregations.
Draw time series in pandas by loading stock data from csv and parsing dates; set a date index, select AAPL, MSFT, XOM, SP, fill missing values forward, and plot trends.
The course title is “Data analysis and visualization using Python” and it is using the pandas library.
It is divided into 7 chapters.
Chapter 1 talk about creation of pandas objects such as: Series, DataFrame, Index. This chapter includes basic arithmetic with pandas object. Also it describes other operations with pandas object such as: reindexing, deleting data from axis, filtering, indexing and sorting.
Chapter 2 describes statistical methods applied in pandas objects and manipulation with data inside pandas object. It describes pandas operations such as: unique values, value counting, manipulation with missing data, filtering and filling missing data.
Chapter 3 talks about reading and writing data from text file format and Microsoft Excel. Partial reading of large text files is also described with an example.
Chapter 4 describes data visualization using matplotlib library. It has example about the following graphs: line, scatter, bar and pie. Setting title, legend and labels in the graph is also describes with some practical examples. Drawing from pandas object is also described.
Chapter 5 talks about data wrangling. Merging Series object and DataFrame object is described with practical examples. Combining pandas objects and merging them is part of this chapter.
Chapter 6 talks about various forms of data aggregation and grouping. Creating and using pivot tables is also described.
Chapter 7 talks about time Series creation and manipulation. Classes DatetimeIndex and Period are included in the description of the chapter. Indexing and selection is described with practical examples.