Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Data Science Mastery with Python: Comprehensive course

Name: Data Science Mastery with Python: Comprehensive course
Rating: 4.7 (588 reviews)

Transform Data into Insights with Python, Machine Learning, and Advanced Analytics – Gateway to a Data-Driven Career

Created bySelfcode Academy

Last updated 6/2024

English

What you'll learn

Perform high-level mathematical and technical computing using the NumPy and SciPy packages and data analysis with the Pandas package
Gain an in-depth understanding of Data Science processes: data wrangling, data exploration, data visualization, hypothesis building, and testing
Master the essential concepts of Python programming, including data types, tuples, lists, dicts, basic operators, and functions.
Apply knowledge and actionable insights from data across a broad range of application domains.

Course content

10 sections • 81 lectures • 20h 13m total length

Getting Started with Data Science12:05

Let's Start with Statistics10:14
Data Quality Issues5:06
Types of Statistics7:51
Explore descriptive statistics, including measures of central tendency, spread, and shape, and learn how mean, median, and mode summarize data and reveal outliers.
Measures of Spread9:20
Explore measures of spread—range, interquartile range, variance, and standard deviation—to assess data variability, identify outliers, and monitor control systems.
Measures of Shapes5:36
Explore measures of shape to analyze the distribution's symmetry, left and right skewness, and the normal distribution; compare mean, median, and mode, and assess outliers with practical examples.
Plots Visualisation7:58
Inferential Statistics4:04
Probability10:28
Explore the core concept of probability, estimating event likelihood from prior information, with examples on rain, stock trends, cricket match outcomes, loan decisions, and independent versus dependent events.
Conditional Probability2:17
Random Variables5:05
Normal Probability Distribution9:02
Central Limit Theorem4:16
Hypothesis Testing for Decision Making7:40
Explore hypothesis testing for decision making by examining null and alternative hypotheses, sufficient evidence, p-values, and real-world examples from drug trials, court rulings, manufacturing, and baseball.

Python for Data Science4:45
Python Installation - Google Collab6:42
Python Basics3:11
Identifiers in Python5:03
Comments in Python5:48
Learn to add clear Python comments with hash for single lines and triple quotes for multi-line notes, and use docstrings to document functions.
Python Indentation5:53
Explore how Python uses indentation to define code blocks with four spaces. Learn to write readable, error-free code and recognize indentation rules in loops and ranges.
Python Statements3:44
Variables in Python7:11
Understand how Python stores values in variables using identifiers and assignment operator. See integers, decimals, and strings, plus multiple assignment, and note that different names can share the same memory.
Data Types & Related Stuffs in Python33:32
Conversion of Data Types in Python10:03
Python I/O functions2:25
Output Formatting11:48
Master output formatting in Python by formatting strings, printing with proper spacing and line breaks, and exploring practical examples that demonstrate clean, readable console output.
User Input in Python5:38
Operators in Python23:22
Control Flow in Python52:51
Functions in Python20:43
Types of Functions in Python39:12
Recursive Functions in Python13:14
Argument in a Function14:57
Lambda or Anonymous Functions in Python8:29

Advance Programming in Python56:36
Advance Programming in Python: Part 21:34:39
Data Visualisations34:20
Bivariate Plotting29:38
Master bivariate plotting to uncover relationships between two variables using scatterplots and linear regression. Interpret positive, negative, and neutral correlations and visualize them with heatmaps and KDE.
Multivariate Plotting47:11
Explore multivariate plotting techniques for categorical and numerical data, using scatter plots, grouped box plots, heat maps, and parallel coordinates, with data standardization, normalization, and interactive visualization options.

Linear Regression31:11
How to use Linear Regression44:17
Learn to use linear regression to predict sales from tv, radio, and newspaper ads, with data preparation, assumptions, standardization, train-test split, and MAE, MSE, RMSE, and R-squared evaluation.
Logistic Regression30:39
Logistic Regression on Titanic Data Set19:07
Preprocess Titanic data with missing-value handling and feature engineering (family size, gender class), then train a logistic regression model and evaluate with accuracy and confusion matrix.
Decision Tree4:27
Algorithms used in Decision Treee17:17
Gini Index9:52
Issues with Decision Tree6:12
Identify issues in decision trees, such as underfitting and overfitting, and how training data accuracy, bias, and variance influence splitting, leaf nodes, and pruning.
Applications of Decision Tree2:13
Explore how decision trees predict customer churn, support retention, fraud detection, and credit risk scoring. See how amount, transactions per day, area code, and age form leaf node rules.
Working on Titanic Data Set10:12
Random Forest3:55
Types of Random Forest0:53
Explore how a random forest uses multiple decision trees to tackle classification and regression tasks, employing majority voting for classification and averaging for regression.
Why Random Forest1:52
Understand why random forest uses an ensemble of decision trees to reduce overfitting and achieve high accuracy, even with missing data, for classification and regression.
Application of Random Forest2:40
Explore how random forest uses multiple decision trees to identify loyal and fraud customers in banking, with references to C4.5, information entropy, and pruning low-importance branches.
Random Forest Implementation on Titanic Data Set8:37
Explore random forest implementation on the Titanic dataset, including preprocessing, feature engineering, and model tuning with grid and randomized searches to boost accuracy.
Model Evaluation Technique5:28
Concept of R-Squared4:18
Linear Regression8:54
Classification4:08
Evaluate classification performance using accuracy, confusion matrix, precision and recall, and learn when accuracy fails on imbalanced data through the classification report.
Confusion Matrix6:21
Learn how a confusion matrix summarizes classification results with true/false positives and negatives, and balance false positives and false negatives for tasks like medical screening or spam filtering.
Recall / Sensitivity / True Rate of Positive4:08
FB score7:48
Balance precision and recall with the F1 score, a harmonic mean reflecting both metrics for your business case, and explore specificity and false positive rate.
AUC/ ROC curve6:35
Model Evaluation recall Curve11:42
Learn how to build a credit card transaction model with random forest, address data imbalance, and evaluate with precision, recall, and AUC to minimize false negatives.

Big Data11:54
Explore big data concepts, learn the five Vs: volume, variety, value, velocity, veracity, and how Hadoop, MapReduce, and data warehouses enable storage, processing, and insights for data science.
Intro to Hadoop8:10
Intro to Tableu10:50
Explore data visualization with Tableau to create interactive dashboards and visual analytics that drive storytelling, enable instant insights, and support fast, data-driven business decisions across structured and unstructured data.
Intro to Business Analytics9:00
Explore how business analytics transforms data into wisdom through the lifecycle from data collection to decision making, and master descriptive, diagnostic, predictive, and prescriptive methods.

Project: Part 1: Let's get our system ready15:36
Kick off the data science lifecycle with data abstraction by installing and configuring MongoDB (local or Atlas), connecting via Python, and exploring telecom data as document-based collections.
Project: part 29:02
Project: Part 317:23
Project: part 415:43
Project: Let's Finalise it16:36
This project guides you through deploying a churn-prediction model with Flask, linking frontend and backend, using a virtual environment, and testing with a live app link.

Requirements

An understanding of the fundamentals of Python programming
Basic knowledge of statistics

Description

Today Data Science and Machine Learning are used in almost every industry, including automobiles, banks, health, telecommunications, telecommunications, and more.

As the manager of Data Science and Machine Learning, you will have to research and look beyond common problems, you may need to do a lot of data processing. test data using advanced tools and build amazing business solutions. However, where and how will you learn these skills required in Data Science and Machine Learning?

DATA SCIENCE COURSE-OVERVIEW

Getting Started with Data Science

Define Data
- Why Data Science?
- Who is a Data Scientist?
- What does a Data Scientist do?
- The lifecycle of Data Science with the help of a use case
- Job trends
- Data Science Components
- Data Science Job Roles

Math Basics
- Multivariable Calculus
- Functions of several variables
- Derivatives and gradients
- Step function, Sigmoid function, Logit function, ReLU (Rectified Linear Unit) function
- Cost function
- Plotting of functions
- Minimum and Maximum values of a function

Linear Algebra
- Vectors
- Matrices
- Transpose of a matrix
- The inverse of a matrix
- The determinant of a matrix
- Dot product
- Eigenvalues
- Eigenvectors

Optimization Methods
- Cost function/Objective function
- Likelihood function
- Error function
- Gradient Descent Algorithm and its variants (e.g., Stochastic Gradient Descent Algorithm)

Programming Basics

R Programming for Data Science
- History of R
- Why R?
- R Installation
- Installation of R Studio
- Install R Packages.
- R for business
- Features of R
- Basic R syntax
- R programming fundamentals
- Foundational R programming concepts such as data types, vectors arithmetic, indexing, and data frames
- How to perform operations in R including sorting, data wrangling using dplyr, and data visualization with ggplot2
- Understand and use the various graphics in R for data visualization.
- Gain a basic understanding of various statistical concepts.
- Understand and use hypothesis testing method to drive business
- decisions.
- Understand and use linear, non-linear regression models, and
- classification techniques for data analysis.
- Working with data in R
- Master R programming and understand how various statements are executed in R.

Python for Data Science
- Introduction to Python for Data Science
- Introduction to Python
- Python Installation
- Python Environment Setup
- Python Packages Installation
- Variables and Datatypes
- Operators
- Python Pandas-Intro
- Python Numpy-Intro
- Python SciPy-Intro
- Python Matplotlib-Intro
- Python Basics
- Python Data Structures
- Programming Fundamentals
- Working with data in Python
- Object-oriented programming aspects of Python
- Jupyter notebooks
- Understand the essential concepts of Python programming such as data types, tuples, lists, dicts, basic operators and functions
- Perform high-level mathematical computing using the NumPy package and its vast library of mathematical functions
- Perform scientific and technical computing using the SciPy package and its sub-packages such as Integrate, Optimize, Statistics, IO, and Weave
- Perform data analysis and manipulation using data structures and tools provided in the Pandas package
- Gain an in-depth understanding of supervised learning and unsupervised learning models such as linear regression, logistic regression, clustering, dimensionality reduction, K-NN and pipeline
- Use the matplotlib library of Python for data visualization
- Extract useful data from websites by performing web scraping using

Python

Integrate Python with MapReduce

Data Basics

Learn how to manipulate data in various formats, for example, CSV file, pdf file, text file, etc.
Learn how to clean data, impute data, scale data, import and export data, and scrape data from the internet.
Learn data transformation and dimensionality reduction techniques such as covariance matrix plot, principal component analysis (PCA), and linear discriminant analysis (LDA).

Probability and Statistics Basics

Important statistical concepts used in data science
Difference between population and sample
Types of variables
Measures of central tendency
Measures of variability
Coefficient of variance
Skewness and Kurtosis
Inferential Statistics
Regression and ANOVA

Exploratory Data Analysis

Data visualization
Missing value analysis

Introduction to Big Data
Introduction to Hadoop
Introduction to Tableau
Introduction to Business Analytics
Introduction to Machine Learning Basics

Supervised vs Unsupervised
Time Series Analysis
Text Mining

Data Science Capstone Project

Science and Mechanical Data require in-depth knowledge on a variety of topics. Scientific data is not limited to knowing specific packages/libraries and learning how to use them. Science and Mechanical Data requires an accurate understanding of the following skills,

Understand the complete structure of Science and Mechanical Data

Different Types of Data Analytics, Data Design, Scientific Data Transfer Features and Machine Learning Projects

Python Programming Skills which is the most popular language in Science and Mechanical Data

Machine Learning Mathematics including Linear Algebra, Calculus and how to apply it to Machine Learning Algorithms and Science Data

Mathematics and Mathematical Analysis of Data Science

Data Science Data Recognition

Data processing and deception before installing Learning Machines

Machine learning

Ridge (L2), Lasso (L1), and Elasticnet Regression / Regularization for Machine Learning

Selection and Minimization Feature for Machine Learning Models

Selection of Machine Learning Model using Cross Verification and Hyperparameter Tuning

Analysis of Machine Learning Materials Groups

In-depth learning uses the most popular tools and technologies of today.

This Data Science and Machine Learning course is designed to consider all of the above, True Data Science and Machine Learning A-Z Course. In most Data Science and Machine Learning courses, algorithms are taught without teaching Python or this programming language. However, it is very important to understand language structure in order to apply any discipline including Data Science and Mechanical Learning.

Also, without understanding Mathematics and Statistics it is impossible to understand how other Data Science and Machine Learning algorithms and techniques work.

Science and Mechanical Data is a set of complex linked topics. However, we strongly believe in what Einstein once said,

"If you can't explain it easily, you didn't understand it well enough."

As a teacher, I constantly strive to reach my goal. This is one comprehensive course in Science and Mechanical Data that teaches you everything you need to learn Science and Mechanical Data using simple examples with great depth.

As you will see from the preview talks, some of the more complex topics are explained in simple language.

Some important skills you will learn,

Python Programming

Python is listed as the # 1 language for Data Science and Mechanical Data. It is easy to use and rich with various libraries and functions required to perform various Data Science and Machine Learning activities. In addition, it is the most widely used and automated language for the use of many Deep Learning frameworks including Tensorflow and Keras.

Advanced Mathematics Learning Machine

Mathematics is the foundation of Data Science in general and Learning Machines in particular. Without understanding the meanings of Vectors, Matrices, their operations and understanding Calculus, it is impossible to understand the basics of Data Science and Machine Learning. The Gradient Declaration of Basic Neural Network and Mechanical Learning is built on the foundations of Calculus and Derivatives.

Previous Statistics for Data Science

It is not enough to know only what you are saying, in the middle, the mode, etc. Advanced Techniques for Science and Mechanical Data such as feature selection, size reduction using PCA are all based on previous Distribution and Statistical Significance calculations. It also helps us to understand the operation of the data and use the appropriate machine learning process to get the best results from various Data Science and Mechanical Learning techniques.

Data recognition

As they say, the picture costs a thousand words. Data identification is one of the most important methods of Data Science and Mechanical Data and is used for Analytical Data Analysis. In that, we analyze the data visually to identify patterns and styles. We will learn how to create different sites and charts and how to analyze them for all practical purposes. Feature Selection plays an important role in Machine Learning and Visualization Data is its key.

Data processing

Scientific Data requires extensive data processing. Data Science and Machine Learning specialists spend more than 2/3 of their time analyzing and analyzing data. Data can be noisy and never in good condition. Data processing is one of the most important ways for Data Science and Mechanics to learn to get the best results. We will be using Pandas which is a well-known Python data processing library and various other libraries for reading, analyzing, processing and cleaning data.

Machine learning

Heart and Soul Data Science is a guessing skill provided by algorithms from the Deep Learning and Learning Machines. Machine learning takes the complete discipline of Data Science ahead of others. We will integrate everything we have learned in previous sections and build learning models for various machines. The key features of Machine Learning are not only ingenuity but also understanding of the various parameters used by Machine Learning algorithms. We will understand all the key parameters and how their values affect the outcome in order to build the best machine learning models.

Who this course is for:

For Complete Beginners to Data Sciecne, which will make you Hero in the Data Science Field.

Data Science Mastery with Python: Comprehensive course

What you'll learn

Explore related topics

Course content

Introduction1 lecture • 12min

Basic Maths Required for Data Science13 lectures • 1hr 29min

Python for Data Science20 lectures • 4hr 39min

Advance Python5 lectures • 4hr 22min

Let's dig deeper3 lectures • 2hr 43min

Let's Explore in to Machine Learning3 lectures • 26min

Module Seven24 lectures • 4hr 13min

Module Eight3 lectures • 16min

Featured Topics in Java4 lectures • 40min

Project: Telecom Churn Production5 lectures • 1hr 14min

Requirements

Description

Who this course is for: