Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Python for Beginners: Learn Data Science in Google Colab

Name: Python for Beginners: Learn Data Science in Google Colab
Rating: 4.7 (21 reviews)

Learn Python Basics in Google Colab to Start Your Machine Learning and Data Science Journey

Created byMd Ahshanul Haque

Last updated 5/2026

English

What you'll learn

Understand the basics of Python programming, including variables, data types, and functions.
Learn how to use Google Colab for writing, running, and sharing Python code in the cloud.
Understand and work with Python variables, data types, and arithmetic operators
Import, clean, and explore datasets such as CSV files in Google Colab
Understand core concepts of machine learning and deep learning
Apply classification techniques to categorize data accurately

Course content

11 sections • 58 lectures • 3h 46m total length

Course Overview: What You Will Learn1:15
In this lecture, you will get a complete overview of the course, including what you will learn and how the course is structured from basics to machine learning.
How to Start Using Google Colab4:07
In this video, you will get a comprehensive introduction to Google Colab, a free, cloud-based platform that allows you to write and run Python code directly in your browser. We will explore the Google Colab interface, learn how to create and save notebooks, and understand why Colab is an excellent tool for beginners in data analysis and machine learning. By the end of this video, you’ll be ready to start your coding journey without any software installation. Perfect for beginners!
Why We Use Google Colab for Python1:47
In this lecture, you will learn why we use Google Colab for Python programming and how it makes learning easier, especially for beginners.
Important Message: Scope of This Course and What You Can Do Next0:34
In this lecture, we clarify the scope of the course and set clear expectations. This course is designed for beginners, so the focus is on building a strong foundation rather than covering all advanced models in detail. You will understand what is included, what is not, and how to move forward after completing the course. By the end, you will be confident in using the Help menu to explore and analyze a wide range of additional models on your own.

Variables and Data Types in Python1:57
Learn the fundamentals of Python by understanding variables and different data types. This lecture explains how data is stored and used in Python with simple, real-world examples.
Data Types in Python (Hands-on Practice)4:20
Practice working with common Python data types including integers, floats, strings, and booleans. This session focuses on practical examples to build a strong foundation.
Operators in Python - Overview0:38
Understand how operators work in Python. This lecture introduces arithmetic, relational, and logical operators with clear explanations for beginners.
Arithmetic Operators in Python (Practical Guide)2:05
Learn how to perform mathematical operations in Python using arithmetic operators. Includes hands-on examples for addition, subtraction, multiplication, and division.
Relational Operators in Python (Comparison Made Easy)2:45
Explore how to compare values in Python using relational operators. Understand conditions like greater than, less than, and equality with real examples.
Logical Operators in Python (Practical Examples)1:10
Master logical operators such as AND, OR, and NOT in Python. This lecture uses simple examples to explain how decisions and conditions work in real data analysis.

Introduction to Data Import and Dataset Overview3:00
In this lecture, you will get an overview of how data is imported and explored in Python using Google Colab. We will introduce the workflow of loading datasets, understanding their structure, and preparing for analysis.

Python data import, Google Colab tutorial, pandas introduction, dataset overview, beginner Python data analysis
Importing CSV Data in Python (Google Colab)2:22
Learn how to import CSV files into Python using the pandas library in Google Colab. This lecture covers file upload methods and how to load data into a DataFrame for analysis.
read_csv pandas, import CSV Python, Google Colab CSV, pandas tutorial, Python data analysis
Importing Excel Data in Python (Google Colab)1:53
This lecture explains how to import Excel files into Python. You will learn how to read Excel sheets using pandas and prepare your data for further analysis.
read_excel pandas, import Excel Python, pandas Excel tutorial, Google Colab Excel, data analysis Python
Understanding Dataset Structure (Shape, Columns, Data Types)2:41
Understand the structure of your dataset by checking its dimensions, column names, and data types. This lecture introduces key pandas functions to explore and understand your data.
df.shape pandas, df.columns pandas, df.dtypes, pandas info function, dataset structure Python
Basic Summary Statistics in Python3:58
Generate summary statistics such as mean, standard deviation, and distribution of variables using pandas. This helps you quickly understand the characteristics of your dataset.
Checking Missing Values and Duplicates2:36
Learn how to identify missing values and duplicate records in your dataset using pandas. This is an essential step to ensure data quality before analysis.
Find the Missing Value4:05
In this video, you will learn how to identify missing values in your dataset using Python's Pandas library. We will show you simple commands to check for null or missing data, count them, and understand how they may affect your data analysis process.
Rename Variable2:39
In this video, you will learn how to rename variables (columns) in your dataset using Python’s Pandas library in Google Colab. Renaming variables helps make your data more understandable and easier to work with during analysis. We will cover different methods to rename single or multiple columns efficiently.
Python for Data Encoding8:32
You've learned the basics of Python and pandas—now you're ready for the real-world challenge of data science. The most common roadblock? Categorical data.
Machine learning models cannot understand text like "Male" or "New York." You need to transform this data into a numerical format, and this process is called data encoding. This course is your complete, hands-on guide to mastering the essential data encoding techniques that every data scientist and machine learning engineer must know.
We will go beyond the basics, exploring various encoding methods and learning when to use each one to get the best performance from your models. By the end of this course, you will be able to confidently handle any type of categorical data, preparing your datasets for powerful machine learning algorithms.
Variable Transformation in Python4:28
Raw data is rarely ready for analysis or machine learning. In the real world, you'll need to clean, format, and transform your data into a usable format. This process, known as variable transformation, is a critical skill that separates a beginner from a professional data scientist.
This course is your complete guide to mastering variable transformation in Python. We will move beyond simple data loading and dive into the practical techniques used by data professionals every day. From creating new features like BMI to handling messy dates and categorizing data, you'll learn to prepare any dataset for a powerful machine learning model.
We'll use industry-standard libraries like pandas and NumPy to efficiently manipulate your data. By the end of this course, you'll have the confidence to tackle real-world datasets and make them "machine-learning ready."

Uploading Data from a Computer Folder to Google Drive2:38
In this lecture, you will learn how to upload a dataset from your computer folder to Google Drive. This is the first step before importing the data into Google Colab for analysis.
For the lecture title, your current title is good:
Uploading Data from a Computer Folder to Google Drive
For resources, you can upload the sample dataset if you want students to practice with the same file. If the dataset is already shown in the video, adding it as a resource is helpful.
Importing Data from Google Drive into Google Colab Using Python7:24
In this lecture, students will learn how to import data from Google Drive into Google Colab using Python. This step helps them access their dataset directly inside Colab for data analysis.

Creating Histograms in Python | A Google Colab3:48
#-----
import pandas as pd
df=pd.read_csv('/content/MyData.csv')
#----
import matplotlib.pyplot as plt
plt.hist(df['momheight'])
plt.show()
#-----
import matplotlib.pyplot as plt
plt.figure(figsize=(16,9))
sns.histplot(df['momheight'], bins=10, kde=True, color='blue')
plt.xlabel('Maternal body height, cm')
plt.ylabel('Frequency')
plt.show()

#-----

Histograms are an essential tool for any data analyst or scientist. They provide a powerful way to understand the distribution of your data, helping you uncover patterns, identify outliers, and make informed decisions.
In this video, you will learn to master the art of creating effective histograms using Python's matplotlib library within the collaborative environment of Google Colab. We'll cover everything from the basics of what a histogram is and why it's so useful, to the practical steps of writing the code.
By the end of this lesson, you will be able to:
Understand what a histogram is and when to use one.
Write Python code to create basic histograms from scratch.
Customize your histograms by adjusting the number of bins, colors, and labels for maximum clarity.
Interpret the results of your histograms to gain insights into your data's frequency and distribution.
This hands-on guide will equip you with a fundamental data visualization skill, empowering you to better explore and present your datasets. No complex setup is required—just a browser and your Google account.
Creating Box Plots in Python | A Google Colab Guide2:17
#----

plt.figure(figsize=(16,9))
sns.boxplot(df['income'], color='red')
plt.title('Distribution of Maternal Height')
plt.xlabel('Maternal Height (cm)')
plt.show()

#---

Box plots, also known as box-and-whisker plots, are a powerful data visualization tool that provides a concise summary of a dataset's distribution. They are indispensable for quickly identifying key statistical measures and spotting potential outliers.
In this video, you will learn to create and interpret box plots using Python's matplotlib and seaborn libraries within the accessible Google Colab environment. We'll start with the fundamentals of what a box plot represents, including the median, quartiles, and whiskers. You will then move on to writing the code to generate your own plots.
By the end of this lesson, you will be able to:
Understand the key components of a box plot: the median, interquartile range (IQR), and outliers.
Use Python to generate box plots from your datasets.
Customize the appearance of your box plots for better clarity and presentation.
Interpret box plots to understand the spread and skewness of your data.
This hands-on guide will give you a core data visualization skill, enabling you to perform exploratory data analysis more effectively.
Creating Box Plots by another variables in Python | A Google Colab Guide8:26
Learn how to explore and summarize data in Python using boxplots. Understand distributions, detect outliers, and visualize patterns with clear labeling and hands-on examples.
Violin Plots in Python | Google Colab1:29
#---
plt.figure(figsize=(16,9))
sns.violinplot(df['momheight'], color='green')
plt.title('Distribution of Maternal Height')
plt.xlabel('Maternal Height (cm)')
plt.show()
#--

Violin plots are a powerful data visualization tool that combines the benefits of a box plot with a kernel density estimate. They are excellent for visualizing the distribution of numerical data and are particularly useful for comparing distributions across different groups.
In this video, you will learn to create and interpret violin plots using Python's seaborn and matplotlib libraries within the convenient Google Colab environment. We'll start by understanding what a violin plot reveals, including the median, quartiles, and the full shape of the data distribution.
Pie Charts Made Easy2:36
Learn how to create simple and clear pie charts to visualize your data quickly and effectively

#----

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
#
df=pd.read_csv('/content/MyData.csv')
df.columns
#
wealth_counts=df['wealth'].value_counts()
plt.pie(wealth_counts, labels=wealth_counts.index, autopct='%1.1f%%', startangle=90)
plt.title('Wealth Distribution')
plt.show()
Simple Bar Graphs1:45
Master creating bar graphs to show numbers, counts, and estimates effectively

#---

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
#
df=pd.read_csv('/content/MyData.csv')
df.columns

#

plt.figure(figsize=(8, 6))
sns.countplot(data=df, x='wealth', order=df['wealth'].value_counts().index, palette='viridis')

plt.title('Wealth Category Distribution')
plt.xlabel('Wealth Category')
plt.ylabel('Count')
plt.show()
Heatmaps in Python: Visualize Your Data Like a Pro5:44
Discover how to turn raw data into meaningful visuals. Create correlation heatmaps, customize labels, and understand patterns in your datasets.

Showing the Sample Boxplot We Will Create1:09
In this lecture, students will first see a sample boxplot that will be created during the course. This will help them understand the final output before learning the step-by-step process of creating a publication-ready boxplot in Python.
Importing a New Dataset in Python for Publication-Ready Boxplots3:21
In this lecture, students will learn how to import a new dataset in Python for creating publication-ready boxplots. They will understand how to load the data, check the dataset structure, and prepare the variables needed for boxplot visualization.
Writing Python Code for a Boxplot3:52
In this lecture, students will learn how to write Python code to create a boxplot. They will use the imported dataset, select the required variables, and generate a clear boxplot for data visualization. This lecture will help students understand the basic coding steps before improving the plot for publication-ready presentation.
Data type of Main Variable for a Boxplot
Formatting the Boxplot in Python: Properties6:24
In this lecture, students will learn how to format important boxplot properties in Python. They will customize elements such as figure size, box width, colors, line style, marker style, and overall appearance to make the boxplot more professional and publication-ready.
Axis and Legend Labels in Python Boxplots10:56
In this lecture, students will learn how to customize axis labels and legend labels in Python boxplots. They will understand how clear labels make the boxplot easier to read, interpret, and present in a publication-ready format.

Univariate Descriptive Statistics | mean, SD, and median, Frequency, Proportion6:15
Learn how to calculate and interpret mean, standard deviation, and median for numerical variables, as well as frequency and proportion for categorical variables using Python.
Bi-variate Descriptive Statistics | Cross tab & group mean or median1:59
Learn how to explore the relationship between two variables in Python using cross-tabulation, frequency, and proportion for categorical data, as well as comparison of means and medians across groups for numerical data.
Bi-variate Analysis with P-Values | Exporting the findings4:40
Learn how to compare two variables in Python using bivariate analysis. This includes cross-tabulation with frequency and proportion for categorical data, group comparisons of mean, standard deviation, and median for numerical data, and significance testing with p-values.

Simple Logistic Regression: An Introduction and Practical Example8:22
In this video, we will introduce you to Simple Logistic Regression, one of the most commonly used techniques in machine learning for binary classification. We will walk you through the theory behind logistic regression, followed by a hands-on example using Python and Google Colab. By the end of this video, you will understand how to implement logistic regression in real-world data analysis projects.
Simple and Multiple Logistic Regression with Table Preparation (Odds Ratio)8:54
In this video, we'll walk you through the process of implementing Simple and Multiple Logistic Regression models in Python using Google Colab. We’ll start by preparing a table with the necessary data, followed by training the models for binary classification. You'll learn how to clean and prepare data for logistic regression, how to build both simple and multiple models, and how to evaluate their performance. We'll also cover the interpretation of coefficients, odds ratios, and key metrics such as accuracy, confusion matrix, and ROC curve. This hands-on guide will help you develop the skills to apply logistic regression to your own datasets and research projects.

Overview of Machine Learning2:45
Master data science and machine learning using Python in Google Colab. Learn data analysis, visualization, statistical modeling, regression, and modern machine learning techniques. Hands-on projects, practical coding, and real-world examples make this course ideal for beginners, students, and researchers.
Python data science, Google Colab for beginners, Data analysis with Python, Machine learning in Python, Biostatistics with Python
Tabular Data Analysis & Machine Learning with Python in Google Colab1:03
Master tabular data analysis and machine learning using Python in Google Colab. Learn how to clean, explore, visualize, and model real-world datasets (CSV/Excel). Build predictive models with regression, classification, and evaluation techniques. Designed for beginners, students, and researchers who want hands-on experience in applied data science.
Tabular data analysis Python, Google Colab for data science, Machine learning with tabular data, CSV Excel data in Python, Data preprocessing and cleaning Python
Data Management for Tabular Data6:27
Learn data management and machine learning step by step using Python in Google Colab. This course teaches how to import, clean, transform, and manage tabular data (CSV/Excel), followed by visualization, statistical analysis, regression, and predictive modeling. Ideal for beginners, students, and researchers working with real-world datasets.
Importing Data in Python for Machine Learning (Google Colab)5:41
In this lecture, you will learn how to import datasets into Python using Google Colab. We will cover reading Excel files with Pandas, exploring the data structure, and preparing it for machine learning models like Random Forest.
One-Hot Encoding for Machine Learning in Python3:57
In this lecture, you will learn how to transform categorical variables into numerical values using one-hot encoding. We will use Pandas and Scikit-Learn in Google Colab to create dummy variables and prepare datasets for Random Forest and other machine learning models.
Defining Class and Features in Machine Learning4:25
In this lecture, you will learn how to define the class variable (target) and the features (independent variables) in a dataset. Using Python and Google Colab, we will separate input variables from the output variable to prepare data for Random Forest and other machine learning models.
Splitting Data for Machine Learning in Python (Google Colab)4:02
In this lecture, you will learn how to split your dataset into training and testing sets using Python and Google Colab. We will use Scikit-Learn’s train_test_split function to ensure your Random Forest and other machine learning models are trained and evaluated properly.
Machine Learning Common Libraries or Packages in python (Google Colab)2:57
n this lecture, you will discover how to use the Random Forest library in Python inside Google Colab. You will practice importing the library, building a Random Forest model, and understanding how this machine learning technique improves accuracy with ensemble learning.

Building a Random Forest Model in Python Google Colab9:07
Learn how to create and train a Random Forest model in Python using Scikit-Learn. This lecture guides you step by step through model initialization, training with your dataset, and preparing it for evaluation in Google Colab.
Confusion Matrix in Python Python Google Colab for Random Forest3:08
Learn how to assess your machine learning model with a confusion matrix. This lecture explains how to create and interpret a confusion matrix in Python for Random Forest, helping you measure accuracy, precision, recall, and overall performance.
Identify Key Features with Random Forest in Python Google Colab4:52
Learn how to analyze and rank the importance of features in your dataset using a Random Forest model. This lecture shows how to use Python and Google Colab to interpret which variables impact predictions the most.
Final Model Evaluation with Sensitivity & Specificity3:08
Learn how to assess the performance of your Random Forest model with sensitivity (recall) and specificity. This lecture demonstrates how to calculate these metrics from your predictions in Python using Google Colab, helping you interpret your model’s strengths and weaknesses.
ROC Curve for Random Forest in Python Google Colab3:40
In this lecture, you will learn how to generate and interpret the ROC curve for a Random Forest model in Python using Google Colab. We will cover the steps of fitting the model, plotting the ROC curve, and calculating the Area Under the Curve (AUC) to evaluate model performance. By the end, you will understand how to use ROC curves to assess classification accuracy.
Precision–Recall Curve in Python Google Colab2:33
In this lecture, you will learn how to plot and interpret the Precision–Recall curve for classification models in Python using Google Colab. We will explain the meaning of precision and recall, show how to generate the curve, and calculate the Area Under the Curve (AUC-PR). By the end, you will understand how to evaluate model performance on imbalanced datasets using the Precision–Recall curve.

Requirements

A computer or laptop with a stable internet connection (Google Colab runs online)
No prior programming or data analysis experience needed
A willingness to learn and a positive mindset
A free Google account to access Google Colab and Google Drive

Description

Are you a complete beginner who has never written a line of code? This course is designed specifically for absolute beginners who want to learn Python from scratch and take their first step into data science—without fear, confusion, or complex theory.

You will learn Python using Google Colab, a free, browser-based platform that requires no installation or setup. If you can open a web browser, you can start coding immediately.

This course is ideal for:

Students with no programming background
Public health, social science, and research beginners
Anyone curious about data science but unsure where to start
Learners who feel Python courses are “too advanced”

We begin with the absolute basics, explained slowly and clearly:

What Python is and how it works
Writing your first Python code
Variables, numbers, and text (strings)
Lists and dictionaries (explained with simple examples)
Basic operators and comparisons

You will then gently move into:

Simple if–else conditions
Easy loops with real-life logic
Reading CSV files step by step
Understanding datasets (rows, columns, values)
Handling missing values in a beginner-friendly way

Throughout the course, you will practice by typing and running code yourself, so learning feels natural and confidence-building.

You will also learn how Google Colab helps beginners by offering:

Instant code execution
Clear error messages
Automatic saving
Easy access to Google Drive

By the end of this course, you will:

Understand Python basics clearly
Feel confident reading and writing simple Python code
Be ready to continue learning pandas, NumPy, or data analysis

Important: This is a very beginner-level course. It is not a complete Python programming course and does not cover advanced topics. Its purpose is to give you a strong, stress-free foundation to move forward confidently.

Who this course is for:

Absolute beginners/Intermediate who want to learn Python programming from scratch
Anyone interested in data analysis using Python and Google Colab
Students, professionals, or hobbyists looking to start their journey in data science or machine learnin

Python for Beginners: Learn Data Science in Google Colab

What you'll learn

Explore related topics

Course content

Introduction to Python Using Google Colab4 lectures • 8min

Python Fundamentals: Variables, Data Types, and Operators6 lectures • 13min

Data Import and Basic Data Checking in Python10 lectures • 36min

Importing Data from Google Drive2 lectures • 10min

How to Create Graphs in Python using Google Colab7 lectures • 26min

Publication-Ready Boxplots in Python5 lectures • 26min

Descriptive statistics and inferential Statistics3 lectures • 13min

Estimating Odds Ratios Using Logistic Regression Analysis | Biostatistics2 lectures • 17min

Introduction to Machine Learning - Definition, Data description and objective8 lectures • 31min

Machine Learning Technique: Random Forest in Python Google Colab6 lectures • 26min

Requirements

Description

Who this course is for: