Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Supervised Learning - Regression Models

Name: Supervised Learning - Regression Models
Rating: 4.8 (22 reviews)

Created byAISPRY TUTOR

Last updated 9/2023

English

English [Auto],

What you'll learn

Understanding the purpose and applications of regression models in various fields, such as economics, finance.
Exploring the basic concept of simple linear regression, where one dependent variable is modeled against a single independent variable
Understanding how to fit polynomial functions to data, allowing for nonlinear relationships between variables.
Extending the concepts of linear regression to multiple independent variables and learning how to interpret the coefficients of each predictor.

Course content

21 sections • 69 lectures • 13h 45m total length

Introduction About Tutor3:15
Introduces the tutor's profile, highlighting a 16-year background as an industrial revolution 4.0 implementer and data science leader across HSBC, ITC Infotech, Infosys, and Deloitte, with edtech and analytics ventures.

Agenda and stages of Analytics1:02
Identify the agenda and stages of analytics within the data science training program, and learn a practical project management methodology to gain a high-level view of real-world, real-time analytics projects.
What is Diagnoistic Analytics?1:21
Explore diagnostic analytics by answering why events occur, using covid case trends to illustrate how factors like lockdowns and vaccination explain rises and drops.
What is Predictive Analytics ?1:57
Learn how predictive analytics uses current data to forecast future outcomes, such as covid-19 cases, recoveries, and vaccination proportions, and consider how time horizon and changing conditions affect validity.
What is Prescriptive Analytics?11:41
Discover prescriptive analytics and how what-if analyses turn predictions into actions, as you explore descriptive, diagnostic, predictive, and prescriptive stages with real-world health and manufacturing examples.
What is CRISP-ML(Q)3:08
Master CRISP-ML(Q), the cross-industry standard process for data science, covering business and data understanding, data preparation, model building, evaluation, deployment, monitoring, and maintenance.

Business Understanding - Define Scope Of Application18:44
Define the scope and objective to minimize loan defaulters, then train a model with inputs x and output y to predict default and apply survival analytics under business constraints.
Business Understanding -Define Success Criteria8:13
Define business success criteria before applying machine learning, align objectives with key performance indicators such as loan default rates under 5%, and evaluate accuracy, performance, and return on investment.
Business Understanding - Use Cases9:59
Identify business objectives and use cases to balance fraud prevention with customer convenience, using multi-objective reasoning across credit card and agriculture scenarios like drone-driven precision farming.

Agenda Data Understanding0:49
Explore data understanding concepts, including data types, scales of measurement, essential terminology, and primary and secondary data collection techniques for supervised learning regression models.
Introduction to Data Understanding ?6:18
Analyze data, the basis for modeling, predictions, and optimization, to support management decisions and understand how what-if analysis tests levers like marketing spend and sales resources.
Data Types - Continuous Vs Discrete11:18
Compare continuous and discrete data, learn how decimals define continuity, and distinguish numeric from categorical data, with examples like time, money, height, weight, and count data.
Categorical Data Vs Count Data6:45
Differentiate categorical data from count data with binary and multiple category examples, including churn and default scenarios, and outline nominal, ordinal, interval, and ratio classifications.
Pratical Data Understanding using Realtime Examples11:15
Explore practical data understanding through real-time examples, identifying nominal, ordinal, interval, and ratio data with travel scenarios, temperatures, and prices.
Scale of Measurement3:34
Explore scale of measurement across nominal, ordinal, interval, and ratio data, highlighting counts, frequencies, percentages, and why ratio data enables broad statistical analysis for data science.
Quantitave Vs Qualitative5:04
Compare quantitative and qualitative data, distinguish continuous, count, and categorical data, and recognize structured versus unstructured data to inform decision making in regression modeling.
Structure Vs Unstructured Data13:04
Compare structured data in tabular form with unstructured data like videos, images, audio, text. Transform raw data into structured formats via frames, pixels, Mel frequency cepstral coefficient.
Big Data vs Non Big Data9:44
We compare big data and non big data using the three v's and discuss storage and compute needs for structured versus unstructured data, with SQL and NoSQL options.

What is Data Collection?4:12
Explore data collection, distinguishing primary and secondary data sources, and differentiate output variables (response, dependent) from input variables (explanatory, predictors) in structured data for machine learning.
Understanding Primary Data Sources22:15
illustrates how primary data sources and outwards data improve decision making in credit risk, using bank loan example and facebook data, while highlighting data privacy and secondary vs primary data.
Understanding Secondary Data Sources13:31
Explore secondary data sources, distinguish primary from secondary data, and learn to blend internal data with Google Maps and drone analytics to improve data-driven telecom insights.
Understanding Data Collection Using Survey6:46
Examine how to collect data using surveys to diagnose business realities, identify root causes, and frame decision problems into clear research objectives. Break down a multidimensional problem into one-dimensional aspects and craft survey questions on time, constraints, and strengths to assess customer preferences, purchase intentions, and price elasticity.
Understanding Data Collection Using DoE7:15
Learn how design of experiments guides data collection for marketing trials, testing discount levels, expiry, and customer radius to optimize coupon redemption.
Understanding possible errors in Data Collection Stage16:21
Identify and mitigate errors in data collection, including random and systematic errors, faulty measuring devices, and measurement procedures to ensure unbiased, representative data for regression models.
Understanding Bias and Fairness5:17
Understand bias and fairness in supervised learning, ensure models yield fair results, and emphasize data collection with diverse data, avoiding race or gender as predictors in loan default prediction models.

Introduction to CRISP-ML(Q) Data preparation & Agenda2:08
Explore the crisp-ml(q) data preparation framework, outlining six phases, with phase one on business and data understanding, data types, data collection (secondary to primary), and errors.
What is Probability?5:33
Learn the probability formula: number of interested events divided by total events, with die-based examples and evaluations such as greater than three or smaller than four.
What is Random Variable?12:00
Define a random variable by splitting it into random and variable, then show a variable's output varies. Each value has a probability and probabilities sum to one, forming a distribution.
Understanding Probability and its Application, Probability Discussion13:17
Explore the power of probability, from understanding probability distributions to modeling a random variable like daily iPad sales, using discrete versus continuous data and table or graph representations.

Understanding Normal Distribution15:42
Explore the normal distribution as a continuous probability distribution, using heights or profits to illustrate its shape, area under the curve equals one, and zero probability for a single value.
What is Inferential Statistics?10:41
Explore inferential statistics by drawing inferences about a population from a sample using simple random sampling and sampling frames. Learn about hypothesis testing and compare parametric and nonparametric approaches.
Understanding Standard Normal Distribution & what is Z Scores?28:16
Understand the standard normal distribution and z-scores, including how mean and standard deviation shape the curve. Learn standardization with z = (x - mean) / standard deviation and sigma rules.
Understanding Measures of central tendency ( First moment business decision)26:45
Explore mean, median, and mode as first moment business decision measures, and compare population parameters with sample statistics, noting outliers and data contexts.
Understanding Measures of Dispersion ( Second moment business decision)10:54
Analyze measures of dispersion and the second moment to compare volatility across markets, use averages and bands to forecast profits, and identify outliers via control charts.
Understanding Box Plot(Diff B-w Percentile and Quantile and Quartile)6:17
Explore how box plots relate percentiles, quantiles, and quartiles, defining q1–q4 and their connections to min, median, and max with practical examples.
Understanding Graphical Techniques-Q-Q-Plot8:41
Use the normal q-q plot to assess normality by comparing sample quantiles with theoretical quantiles and standardized values; if points lie on a straight line, data are normally distributed.
Understanding about Bivariate Scatter Plot35:36
Understand how a bivariate scatter plot reveals the direction and strength of a relationship between two numeric variables, using R to gauge linear, nonlinear, and exponential patterns, outliers, and clusters.

Python Installation6:07
Download Python from python.org, install version 3.10.7 on any OS, note that Python is open source and free for individuals and organizations, and consider Anaconda for a nicer interface.
Anakonda Installation7:00
Learn how to download and install the Anaconda distribution across Windows, macOS, Linux, and Unix, with pre-installed libraries and OS independence that save data scientists setup time.
Understand about Anakonda Navigator, Spyder & Python Libraries24:30
Explore Anaconda Navigator, Spyder, and popular Python libraries (numpy, pandas, scikit-learn, matplotlib) for reading CSV data, executing code, and visualizing results in a practical learning workflow.
Understanding about Jupyter and Google Colab8:41
Explore how to launch Jupyter and Google Colab, run Python code, import pandas, read CSV files, and leverage Colab's hardware accelerators for faster data experiments.

Understanding Data Cleansing Typecasting10:32
Cleanse and organize data, perform typecasting to correct numeric and categorical types, and transform unstructured log data into structured formats using Python for data preprocessing.
Understanding Data Cleansing Typecasting using python15:42
Learn data cleansing and typecasting with Python and pandas, using read_csv and astype to prepare datasets for regression models by converting columns and validating data types.

Understanding Handling Duplicates10:48
Learn how to identify and handle duplicates using master data management and data quality concepts, including consolidating records, removing duplicate rows and columns, and recognizing high correlation.
Understanding Handling Duplicates using Python25:26
Learn how to identify and remove duplicate records in a dataset using Python and pandas, and explore keep parameters like first, last, and false to drop duplicates and clean data.

Requirements

Basic Mathematics: You should be comfortable with algebra, calculus (especially derivatives and integrals), and basic statistics.
Probability and Statistics: Familiarity with probability theory and basic statistical concepts is important for understanding the underlying principles of regression modeling.
Linear Algebra: Basic knowledge of linear algebra is helpful, as regression models often involve matrix operations and understanding concepts like vectors and matrices.
Programming Skills: Some regression modeling courses might require programming knowledge in a statistical language such as R or Python. Proficiency in data manipulation, visualization, and basic statistical analysis in these languages will be beneficial.

Description

The Comprehensive Regression Models course is designed to provide students with an in-depth understanding of regression analysis, one of the most widely used statistical techniques for analyzing relationships between variables. Through a combination of theoretical foundations, practical applications, and hands-on exercises, this course aims to equip students with the necessary skills to build, interpret, and validate regression models effectively. Students will gain a solid grasp of regression concepts, enabling them to make informed decisions when dealing with complex data sets and real-world scenarios. This course is intended for advanced undergraduate and graduate students, as well as professionals seeking to enhance their statistical knowledge and analytical abilities.

To ensure students can fully engage with the course material, a strong background in statistics and basic knowledge of linear algebra is recommended. Prior exposure to introductory statistics and familiarity with data analysis concepts (e.g., hypothesis testing, descriptive statistics) will be advantageous.

The Comprehensive Regression Models course empowers students to become proficient analysts and decision-makers in their academic and professional pursuits, making informed choices based on evidence and data-driven insights. Armed with this valuable skillset, graduates of this course will be better positioned to contribute meaningfully to research, policy-making, and problem-solving across various domains, enhancing their career prospects and their ability to drive positive change in the world.

Course Objectives:

Understand the Fundamentals: Students will be introduced to the fundamentals of regression analysis, including the different types of regression (e.g., linear, multiple, logistic, polynomial, etc.), assumptions, and underlying mathematical concepts. Emphasis will be placed on the interpretation of coefficients, the concept of prediction, and assessing the goodness-of-fit of regression models.
Regression Model Building: Participants will learn the step-by-step process of building regression models. This involves techniques for variable selection, handling categorical variables, dealing with collinearity, and model comparison. Students will be exposed to both automated and manual methods to ensure a comprehensive understanding of the model building process.
Model Assessment and Validation: Evaluating the performance and validity of regression models is crucial. Students will explore diagnostic tools to assess model assumptions, identify outliers, and check for heteroscedasticity.
Interpreting and Communicating Results: Being able to interpret regression results accurately and effectively communicate findings is essential. Students will learn how to interpret coefficients, measure their significance, and communicate the practical implications of the results to various stakeholders in a clear and concise manner.
Advanced Topics in Regression: The course will delve into advanced topics, including time series regression, nonlinear regression, hierarchical linear models, and generalized linear models. Students will gain insights into when and how to apply these techniques to tackle real-world challenges.
Real-world Applications: Throughout the course, students will be exposed to real-world case studies and examples from various disciplines such as economics, social sciences, healthcare, and engineering. This exposure will enable students to apply regression analysis in different contexts and understand the relevance of regression models in diverse scenarios.
Statistical Software: Hands-on experience is a critical aspect of this course. Students will work with popular statistical software packages (e.g., R, Python, or SPSS) to implement regression models and perform data analysis. By the end of the course, participants will have gained proficiency in using these tools for regression modeling.

Course Conclusion:

In conclusion, the Comprehensive Regression Models course offers an in-depth exploration of regression analysis, providing students with the necessary tools and knowledge to utilize this powerful statistical technique effectively. Throughout the course, students will gain hands-on experience with real-world datasets, ensuring they are well-equipped to apply regression analysis to a wide range of practical scenarios. By mastering regression techniques, students will be prepared to contribute to various fields, such as research, business, policy-making, and more, making data-informed decisions that lead to positive outcomes. Whether pursuing further studies or entering the workforce, graduates of this course will possess a valuable skillset that is highly sought after in today's data-driven world. As the demand for data analysis and predictive modeling continues to grow, this course will empower students to become proficient analysts and problem solvers, capable of making a significant impact in their respective domains.

By the end of this course, participants will be able to:

Understand the theoretical underpinnings of various regression models and their assumptions.
Build and validate regression models using appropriate techniques and tools.
Interpret regression results and communicate findings to different stakeholders.
Apply regression analysis to solve complex problems in diverse fields.
Confidently use statistical software for data analysis and regression modeling.

Who this course is for:

Students and Researchers: Those studying or conducting research in fields like statistics, economics, social sciences, and data science, where regression analysis is a fundamental tool.
Data Analysts and Data Scientists: Professionals who work with data and want to gain a deeper understanding of regression techniques to analyze relationships between variables and make predictions.
Individuals working in business analytics, marketing, finance, or any other domain where data-driven decision-making is crucial.
Those with programming skills who want to expand their knowledge to include regression modeling for data analysis and prediction tasks.

Supervised Learning - Regression Models

What you'll learn

Explore related topics

Course content

Introduction1 lecture • 3min

Introduction About Basics5 lectures • 19min

Business Understanding Phase3 lectures • 37min

Data Understanding Phase - Data Types9 lectures • 1hr 8min

Data Understanding Phase - Data Collection7 lectures • 1hr 16min

Understanding Basic Statistics4 lectures • 33min

Data Preparation Phase - Exploratory Data Analysis (EDA)8 lectures • 2hr 23min

Python Installation and Setup4 lectures • 46min

Data Preparation Phase | Data Cleansing- Type Casting2 lectures • 26min

Data Preparation Phase | Data Cleansing- Handling Duplicates2 lectures • 36min

Requirements

Description

Who this course is for: