Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Predictive Analytics & Modeling: R | Minitab | SPSS | SAS

Name: Predictive Analytics & Modeling: R | Minitab | SPSS | SAS
Rating: 4.4 (24 reviews)

Master predictive analytics and become a data expert with our all-inclusive course on R, Minitab, SPSS, and SAS!

Created byEDUCBA Bridging the Gap

Last updated 7/2024

English

What you'll learn

Data Importing and Preparation: Learn how to import, clean, and prepare datasets in R, Minitab, SPSS, and SAS for predictive analysis.
Information Value (IV) Calculation: Understand how to calculate Information Value (IV) and use it to assess the predictive power of variables in R
Model Building and Optimization: Gain proficiency in building and optimizing logistic regression models, decision tree models, and other predictive models
Data Visualization: Master data visualization techniques using tools like ggplot2 in R and various plotting options in Minitab, SPSS, and SAS
Descriptive Statistics and Graphical Representations: Perform and interpret measures of dispersion, descriptive statistics, and create graphical presentations
Hypothesis Testing and ANOVA: Conduct hypothesis testing, ANOVA, and other statistical analyses to make informed decisions based on data.
Control Structures and Functions in R: Learn to write functions, use control structures, and implement loops in R programming for efficient data manipulation
Advanced Statistical Techniques: Apply advanced statistical techniques such as non-linear regression, logistic regression, and multivariate analysis
Predictive Modeling with SAS Enterprise Miner: Use SAS Enterprise Miner to build predictive models, select input data nodes, and perform variable selection
Hands-On Projects: Gain practical experience through hands-on projects, such as card purchase prediction in R, to reinforce learning and apply skills

Course content

9 sections • 360 lectures • 49h 46m total length

Overview of R Programming2:04
Explore the R programming language and its packages for statistical analysis, data visualization, data science, and machine learning. Engage in hands-on exercises and real-world projects guided by industry experts.
Downloading and Installing R Studio3:29
Download and install rstudio on your local machine after installing R, by visiting rstudio.com and selecting the desktop free license for a single-user PC.
How to use R Studio7:57
Navigate RStudio’s windows, set a working directory, write and run R scripts, and manage variables using the console, environment, history, files, plots, packages, and help tools.
How to use R Studio Continues9:16
Learn to manage the working directory in R Studio with getwd and setwd, create and run R scripts, and load libraries or install packages like ggplot2.
R Studio Basics1:45
Basic Data Type R9:24
Explore the basic data types in R—numeric, integer, logical, character, and complex—along with type checking and conversions using class and as.numeric.
Vectors10:04
More on Vector9:16
Create and manipulate vectors in R, check length, access elements with indices or the colon operator, perform element‑wise arithmetic, and name vector members for easy retrieval.
Matrix10:12
Matrix Continues9:01
Explore matrix operations in R, including element-wise arithmetic, dimension checks with dim, transposing matrices, combining with cbind and rbind, and naming matrix rows and columns.
What is List9:33
Explore how to create and manipulate lists in R, naming and accessing elements, combining lists, and handling mixed data types.
What is List Continues5:28
Master list manipulation in R by removing elements with null, accessing specific members like mathematics within subjects, and merging lists with simple c() syntax.
Data Frame in R9:22
Learn data frames in R: create with data.frame from equal-length vectors, expand with new elements, then access columns by name or with c for multiples, using the Boston data set.
Data Frame in R Sub Clip4:57
Decision Making10:34
Explore decision making in R through if, if else, and switch statements, with practical examples calculating employee bonuses based on months worked and base salary.
Conditional Statements12:22
Loops in R9:53
Master loops in R, including for, while, and repeat loops with break and next, to automate tasks and iterate through vectors efficiently.
Implementing Loop with Practical Examples10:04
Explore practical for loops in R by calculating employee bonuses based on months of experience and salary, using a data frame with five employees and conditional logic.
While Loop7:24
Demonstrate implementing the while loop in R and comparing it to the for loop, using a Fibonacci series as a practical example driven by a test expression.
Break Statement11:37
Explore how break, next, and repeat loops control execution in R by terminating, skipping, and repeating iterations, with practical examples using for, while, and vector operations.
Functions11:36
Learn to define functions in R, use built-in and user-defined functions, call functions with arguments, and set default values, illustrated by BMI, print, head, and arithmetic switch examples.
Alternative Loops8:28
Learn to avoid loops in R by using vectorization, the vectorized if-else, and the apply family, with hands-on examples using normal distributions.
Alternative Loops Continue9:05
User Define Function9:11
Learn to write a user defined function inside apply in R to add values to a data frame, comparing column-wise and row-wise results and using transpose.
Power of GGPLOT3:09
Explore the power of ggplot2 in R for data exploration and visualizations, and learn to create three basic plots—scatter plots, line graphs, and histograms—using the ggplot object and geom layers.
GGPLOT 2 Visuals8:51
Explore ggplot2 visuals by building dot plots, line graphs, and histograms from the mtcars and pressure data, using mpg, horsepower, temperature, and color by cylinders to reveal trends and distributions.
Use of Function10:29
Explore the syntax of functions in R, including name, arguments, default values, and the body. Understand local versus global scope, lazy evaluation, and returning values by expression or return.

Introduction and Importing Dataset9:27
Explore a propensity model to predict card purchases by importing a dataset, examining variables like gender, country, income, and scores, and evaluating model performance with event rate.
IV Calculation8:30
Explore IV calculation in R by checking missing values and unique levels, computing information value with weight of evidence, and using backward elimination to refine a logistic regression model.
Plotting Variables6:52
Explore categorical variables with ggplot2 by creating bar plots that compare card offer across gender and country region, encode categories, and interpret event rates around 15%.
Splitting9:13
Split data into training and test sets with caTools, then explore binning via decision trees and weight of evidence binning, optionally scale numeric variables, and fit logistic regression.
Building Logistic Model8:28
Making Optimal Model11:33
Apply backward elimination with a GLM to identify key variables: country, income, holding balance, and credit score, and build an optimal model assessed by AIC and ROC AUC.
Making Lift Chart for Training Set12:27
Explore building a lift chart or gain chart for the training set to evaluate predictive models, using deciles and quantiles to assess score distributions and performance.
Checking Model Performance9:32
Explore decile-based model evaluation with lift and gain charts, calculate event and cumulative percentages for goods and bads, and assess training versus test set performance to refine targeting.
Model Performance in Test Set9:12
Assess model performance on the test set by computing test scores, analyzing lift in the fourth decile and case, and evaluating out-of-time stability.
Saving Model in R11:15
Save your logistic regression model in R and load it for scoring new data. Compare training and test performance, decile lift, and scores to assess model quality.
Fitting Decision Tree Model7:51
Fit a decision tree model to compare its performance with logistic regression, noting that scaling is not required and missing values are handled automatically in training and test sets.
Fitting Decision Tree Model Continue6:04
Fit and interpret a decision tree classifier using splits like preferred customer score < 0.5546 and estimated income to predict purchases, and visualize the model with plotting options.
Prediction of Decision Tree and Model Performance4:15
Learn to generate predictions with a decision tree, interpret class probabilities, evaluate model performance with a confusion matrix and accuracy, and explore pruning and forests for better results.

Overview and History of R11:12
Explore the overview and history of R, from the S language roots to an open-source statistical programming language with CRAN and Bioconductor packages and active community.
Datatypes and Basic Operations - Part1_1 part 017:18
Explore explicit coercion in R with as dot star functions and NA results, then create, convert by dimension change, inspect, and bind matrices using matrix, dim, and cbind/rbind.
Datatypes and Basic Operations - Part1_1 part 025:37
Datatypes and Basic Operations - Part1_2 Part 015:41
Explore data types and basic operations in R, including numeric, character, integer, complex, and logical objects, vectors, lists, missing values, data frames, factors, and attributes, with assignment and print functions.
Datatypes and Basic Operations - Part1_2 Part 026:31
Explore evaluation and printing in R, including auto vs explicit print, console, and vector creation with colon notation and c() function across numeric, logical, character, integer, complex types, and coercion.
Datatypes and Basic Operations - Part1_2 Part 03_part016:26
Handle missing values in data with R by distinguishing na from nan, testing with is.na and is.nan, and working with data frames, vectors, and matrices.
Datatypes and Basic Operations - Part1_2 Part 03_part 02 summary0:32
Explore data types in R such as numeric, logical, character, integer, and complex; examine vectors, lists, factors, missing values, and data frames, and learn how to name objects.
Datatypes and Basic Operations - Part2_110:45
Master subsetting in R for lists and matrices using single, double brackets and dollar, explore partial matching and vectorized operations, and use drop to preserve matrix form when extracting values.
Datatypes and Basic Operations - Part2_26:29
Explore subsetting nested lists and vectors in R, including dollar versus double-bracket indexing, partial matching, removing any values with complete.cases, and perform vectorized and matrix operations.
ReadingData-14:50
ReadingData-23:01
Set the working directory in R and read a sample txt file into a table; R infers types, places data in columns a to e, and skips hash lines.
ReadingData-37:04
Improve data handling in R by using read.table to load large tables, estimate memory, set colClasses to speed up reads, and load only top rows or in parts.
ReadingData-4a9:40
Assess system capacity for large data in R by estimating memory needs, confirming RAM and 64-bit OS, and using dput, dump, and dget for version-control friendly data.
ReadingData-4b10:48
Learn to read and write data in R using dump and read lines, create and manage connections to local files and URLs, and handle gz compressed data.
Debugging-17:59
ControlStructures11:55
Explore control structures in programming, including if else, for loops, while loops, repeat, break, skip, next, and return, with syntax and flow-control examples.
Functions Part 016:51
Define functions in R as first-class objects, show their syntax and how to pass, nest, and return values. Explain formal arguments, named versus positional matching, defaults, and partial matching.
Functions Part 026:26
Explore defining functions in R, including default and null arguments, lazy evaluation, and error handling when arguments are missing; learn how ellipsis and triple-dot syntax enable flexible argument passing.
ScopingRules1 Part 017:46
Explore how R binds values to symbols through lexical scoping, the search list, global environment, and namespaces, and why package order affects lookups.
ScopingRules1 Part 025:58
Explore how R environments store symbol-value pairs and resolve free variables. Understand closures and lexical scoping through nested functions in practice.
ScopingRules210:31
Explore how function environments drive variable lookup with lexical and dynamic scoping in R, illustrated by y, f, and g. See how lexical scoping enables efficient optimization of likelihoods.
Looping111:13
Discover looping in R with lapply, sapply, apply, tapply, and mapply, using split and anonymous functions to drive results and understand when lapply yields a list versus sapply's simplification.
Looping29:35
Explore how to apply functions across matrix margins with apply, tapply, and split in R, computing means, sums, quantiles, and group ranges for predictive analytics.
Looping39:15
Simulation_part-17:07
Simulation_part-27:52
simulate a linear model y equals 0.5 plus two x plus e with normal, binomial, and Poisson inputs, and use set.seed and sampling to ensure reproducibility.
Plotting110:34
Learn to create plots in R using base graphics, lattice, and grid, including hist and box plots, and to choose devices (X11, pdf, png) and customize with par parameters.
Plotting210:00
Plotting3_part-18:09
Explore ggplot2 and the grammar of graphics to map data to aesthetics like color, shape, and size, and compare ggplot2 with base and lattice plotting systems in R.
Plotting3_part-27:53
Explore how to create and customize plots with qplot in ggplot2, including point and smooth lines, histograms, facets, and density curves for grouped data.
Plotting46:41
Explore ggplot2 as an implementation of the Grammar of Graphics, building layered plots from a data frame with aesthetic mappings, geoms, facets, scales, and coordinates.
Plotting59:02
Master ggplot2 basics by annotating plots with x lab, y lab, labs, and gg title, and by customizing themes, colors, and point aesthetics while illustrating outliers and smoothing with lm.
Plotting Colors 16:13
Plotting Colors 26:51
Date and TimePart1and 5.Date and TimePart211:58
Date andTimePart38:05
RegEx17:05
RegEx211:28
Master regex basics by using meta characters to mark line starts and ends, character classes, and alternation with | to extract patterns from text like social media feeds.
RegEx3_part-16:13
Master regex patterns match terms like flood or earthquake, extract sentences starting with good or bad, apply alternation, grouping, star, and optional metacharacters, with slash escape to treat dots literally.
RegEx3_part-26:59
Explore how regular expressions, with metacharacters like star, plus, and brackets, enable repetition and data extraction in text analytics, useful beyond R.
Classes and Methods1_part-18:37
Explore object oriented programming in R, focusing on S3 and S4 classes, the methods package, and how set class, set method, and set generic enable method dispatch.
Classes and Methods1_part-23:52
Classes and Methods2_part-16:57
Explore how generic functions dispatch to S3 and S4 methods based on an object's class, including default and trace methods, and how get method and get S3 method retrieve code.
Classes and Methods2_part-27:20
Debugging Part26:31
Practice part 2 debugging by writing test cases, comparing expected and actual results, reproducing the problem, and using traceback, debug, browser, trace, and recover to step through code.

Introduction to Minitab5:09
Learn to use Minitab for descriptive and inferential statistics to study data and support sound business decisions. Understand population, sample, frame, gap, and the difference between non-probability and probability samples.
Types of Data9:41
Explore data types, including attribute (categorical or count) and measurement (continuous), with nominal, ordinal, interval, and ratio scales, then learn mean, median, mode, range, and interquartile range.
Measure of Dispersion3:17
Descriptive Stats10:11
Explore descriptive statistics in Minitab, computing mean, standard deviation, variance, range, quartiles, and interquartile range from neck, BMI, and body fat data, with grouping by gender for comparison.
Data Sorting5:10
Sort data in minitab to organize a randomized dataset of 100 rosewood high students by gender, ethnicity, and body mass index, storing results in new columns.
Histograms5:06
Explore how to create a histogram in Minitab with 36-month data set of working days. Learn to add data labels, adjust titles, colors, and display class intervals with cut points.
Pie Charts8:47
Learn to create and customize pie charts in Minitab, displaying ethnicity and subject distributions with slice labels and colors, and saving graphs for external use.
Bar Charts5:07
Create bar charts in Minitab to display counts of categorical data, like ethnicity and subjects, add data labels, edit titles, customize colors, and chart data from tables.
Line Graphs3:39
Learn how to create line graphs in Minitab by plotting graduation year against the percentage of students admitted to tier one universities, revealing time-based trends.
Scatter plots3:31
Box Plot3:48
Discrete Random Variable10:31
Compute the mean and standard deviation of a discrete random variable using a binomial distribution in Minitab, and determine the expected value for a group of 14.
Binomial Distribution9:03
Explore binomial distribution with minitab by computing event probabilities for a 15-birth dataset, including at most ten and at least twelve boys, using a 0.5 success probability.
Normal Distribution10:14
Explore normal distribution probabilities with minitab, using a mean of 95.7 and sd of 4.9 to compute areas below 89.6, above 102, and 5% tails for fasting blood sugar levels.
Normality Test6:05
Learn how to check normality in Minitab using three methods on lab testing time data. Validate normal distribution with Anderson-Darling test, probability plots, and normality test, interpreting p-values around 0.119.
Data Transformation6:03
Explore transforming non-normal data into normal using the Box-Cox method in Minitab, illustrated with baking time samples; test normality with Anderson-Darling and confirm transformed data follows a normal distribution (p>0.05).
Sampling and Sample Size5:32
Minitab demonstrates generating a suitable sample from a class of 100 students' gender, ethnicity, and BMI data and determining the ideal sample size for analyses.
Sample Size for Estimation8:27
Learn how to determine sample size for estimation of proportions and means using minitab, guided by examples with confidence levels, margin of error, historical data, and planning values.
Parameter Estimation8:59
Learn parameter estimation with Minitab by calculating proportions, means, and standard deviations, and constructing 95% confidence intervals using exact, chi-square, and bonnet methods.
Power Analysis11:35
Explore power analysis with Minitab for proportions, means, and standard deviation, using real IVF, heart-rate, and processing-time examples to determine sample sizes and test power.
Measurement System Analysis8:08
Apply measurement system analysis in a Six Sigma project using Minitab, ensuring valid data before decisions. Explore gage R&R for continuous data, covering accuracy, repeatability, reproducibility, linearity, stability, and thresholds.
MSA Gage R and R3:53
Conduct a Gage R&R study in Minitab using ANOVA to analyze parts and operators, read R and x bar charts, and evaluate variation and distinct categories for acceptance with caution.
MSA Attribute Agreement Analysis10:57
Assess a measurement system for discrete data with attribute agreement analysis in Minitab, using multiple appraisers and a standard to evaluate within-appraiser, between-appraiser, and team accuracy.
Process Capability Analysis10:20
Analyze process capability with cp, cpk, pp, and ppk for continuous data in Six Sigma, using Minitab; the example indicates the process is not capable in the short term.
Hypothesis Testing11:06
Hypothesis Testing Mean8:21
Paired-T Test7:39
Apply the paired t test in minitab to compare dependent means, burger patty frying times with and without the additive, testing whether the difference exceeds seven minutes at 95% confidence.
Anova5:26
Pareto Analysis7:45
Correlation7:15
Explore how the Pearson correlation coefficient measures the linear relationship between calories consumed and weight gained, interpret r from -1 to 1, and assess statistical significance with p-values in Minitab.
Regression4:55
Link regression to correlation by deriving a simple linear model y = b0 + b1 x + e from historical data to predict future y and measure fit with r-squared.
Regression Continue11:09
Control Charts10:36
Learn how control charts drive Six Sigma improvements by distinguishing common and assignable variation, monitoring attribute and variable data, and catching out-of-control conditions before defects.
P-Chart9:41

Introduction of Predictive Modeling9:15
Non Linear Regression10:51
Anova and Control Charts9:57
Apply one-way anova and basic regression concepts, explore scatter plots and regression analysis in minitab, and import data to perform descriptive statistics and graphical summaries.
Understanding, Interpretation and implementation using Minitab11:00
Explore descriptive statistics, means, standard deviations, t tests, and skewness and kurtosis using Minitab, with practical data from mutual fund returns.
Continue on Interpretation and implementation using Minitab10:40
Explore descriptive statistics in Minitab to assess fund returns, including mean, standard deviation, variance, skewness, and kurtosis. Interpret how confidence level and graphical summaries inform risk comparisons and investment decisions.
Observation11:37
Observe how standard deviation measures risk and volatility across funds, guiding investment choices to match a listener's risk appetite, with data organized using Minitab and Excel.
Results for NAV Prices6:47
The lecture analyzes NAV price statistics, including mean, standard deviation, and range, noting higher volatility for ICICI Prudential Tech Fund, Banking and Financial Services Fund, and HDFC Equity Fund.
NAV Prices - Observations10:27
Assess fund volatility by examining standard deviation and range; identify X equity and HD cap as having the lowest volatility, while IBF and Itec show higher volatility.
Descriptive Statistics8:09
Explore descriptive statistics in Minitab across finance, medical, and energy data, using mean, standard deviation, range, and skewness to interpret risk and volatility.
Customer Complaints-Observations9:57
Apply descriptive statistics to customer complaints and resting heart rate data to highlight mean, median, standard deviation, and skewness, and interpret three shifts and activity effects.
Resting Heart Rate Observations8:30
Compare before and after resting heart rate using descriptive statistics, noting minimal mean and median differences, and highlight data quality, interpretation, and loan applicant skewness for predictive modeling.
Results for Loan Applicant MTW9:30
Analyze loan applicant data by examining income distribution, education level, age, savings, and debt, noting high income variance, negative skewness, and typical credit card ownership.
More Details on Results for Loan Applicant MTW8:48
Analyze loan applicant data through predictive analytics, noting income variability, high savings dependent on spending, low debt, and credit card usage up to six, with R, Minitab, SPSS, and SAS.
Features of T- Test9:33
Explore the features of the t test in predictive modeling, including single-sample and two-sample t tests, p-values, and interpretation using resting heart rate data in Minitab.
Loan Applicant6:16
Use a paired t test in Minitab to assess if debt depends on income for a loan applicant, interpreting t and p values from the income and debt data.
Paired T - Test6:47
Apply paired t tests and two-sample tests to determine if savings affect debt, interpreting t and p values in predictive modeling with Minitab, SAS, SPSS, or R.
Understanding and Implementation of ANOVA10:25
Explore one-way ANOVA to determine if mutual fund return means differ, using minitab to compute p-values, r-squared, and confidence intervals for hypothesis testing.
Pairwise Comparisons7:55
Assess pairwise comparisons in anova by evaluating p values, r-squared, and confidence intervals; conclude that not all means are equal when p < 0.05, rejecting the null hypothesis.
Features of Chi - Test11:19
explains how to compare observed and expected frequencies using a chi-square (g square) test, including degrees of freedom, null and alternative hypotheses, and p-values, with a practical umbrella handles example.
Preference and Pulse Rate9:57
The instructor explains g square chi-square tests to compare observed versus expected frequencies for smoking preferences and pulse rates before and after running, with 95% confidence and p-values.
Diffe. btw Growth Plan ad Dividend Plan in MF7:06
Analyze differences between growth and dividend plans in mutual funds by comparing nav and repurchase prices, and test observed versus expected prices using chi-square tests.
Checking NAV Price and Repurchase Price6:18
Illustrates using chi-square goodness-of-fit in minitab to compare nav price and repurchase price, test null vs alternative hypotheses, and interpret p-values and critical values to decide on rejection.
Basic Correlation Techniques8:33
Explore basic correlation techniques to understand positive, negative, and zero relationships, interpret correlation coefficients between -1 and 1, and apply these concepts with sample data in Minitab.
More on Basic Correlation Techniques5:50
CT Implementation Using Minitab10:05
Explore how to compute Karl Pearson's and Spearman's rho in Minitab, build a store matrix from data, and understand why the unitary matrix omits diagonal correlations in a 4x4 matrix.
Continue on Implemetation using Minitab3:19
Continue on implementation using Minitab demonstrates arranging variables, exploring correlations, and interpreting values like 0.853, 0.778, and 0.015, with color-coded results in predictive analytics.
Interpretation of Correlation Values6:05
Interpret correlation values within a 5x5 matrix, highlighting diagonal 100% correlations and positive or negative links. Learn how these patterns inform diversification and predictive modeling with Minitab.
Results for Return8:42
Calculate Pearson correlations with Minitab to distinguish positive, negative, and zero relationships in mutual fund returns. Note self-correlation of 100% and the use of r and p-values in interpreting associations.
Correlation Values - Observations5:55
Correlation Values - Interpretations8:11
Explore how correlation values reveal diversification benefits across sectoral funds, with AI tech and IBF showing strong diversification; learn credible predictive interpretations for portfolio decisions.
Heart Beat - Objective5:53
Explore how resting heart rate varies before and after rest using Pearson correlation, revealing a strong positive relationship (r = 0.716) between the two measures.
Heart Beat - Interpretation5:19
Interpret heartbeat variations before and after rest and practice deciding when prediction is meaningful, then perform correlation analysis on income, savings, and debt in a demographics dataset.
Demographics and Living Standards6:07
Analyze correlations among income, savings, and debt using a tabulated matrix, noting a positive correlation between income and savings and negative correlations between income and debt and savings and debt.
Demographics and Living Standards - Observation6:07
Analyze observed correlations among income, savings, and debt, noting a positive link between income and savings and negative links with debt, organized in tabulations for demographics and living standards.
Graphical Implementation9:02
Add Regression Fit8:46
Scatterplot with Regression5:39
Explore scatterplots with regression to analyze relationships, compute correlation values such as positive 0.21 between income and savings and negative correlations with debt, using multiple graphs for panels.
Scatterplot of Rhdeq vs Rhcap4:36
Explore scatterplots with regression to reveal strong positive correlations across heartbeat data and HDFC equity, and interpret correlation values such as 71.6% in predictive analytics and modeling.
Introduction to Regression Modeling8:47
Explore regression modeling from simple linear regression y = mx + c to interpreting r squared, t values, and p values, and predict outcomes using Minitab.
Identify Independent Variable8:32
Identify the independent variable as weight (a continuous predictor) and the dependent variable as heartbeat after run, to fit a regression model in Minitab, interpreting the regression equation and r-squared.
Regression Equation7:45
Tabulating the Values6:11
Tabulating these values highlights relevant variables, noting the y intercept t value is 8.93 with a zero p value, and weight is insignificant for the after run heart pulse.
Interpretation and Implementation on Data Sets7:57
Analyze how a smoker's heartbeat depends on weight using a regression equation, interpret t and p values, and predict pulse from weight with Minitab and Excel.
Continue on Interpretation on Database8:31
Explore how smoker weight affects post-run heartbeat, with higher weight linked to lower heart rate and a negative weight coefficient. Assess regression significance and fit for before-run and after-run heartbeats.
Significant Variable7:40
Explore how regression outputs are written as y = mx + c and interpreted alongside r square and p values, with weight shown as insignificant and descriptive statistics for height.
Calculating Corresponding Values8:55
Compute corresponding y values from given weights and heights to assess before-run smoker pulse, then use Minitab to create scatter plots with regression lines and interpret weight- and height-related trends.
Identify Dependent Variable9:03
Identify dependent and independent variables in a regression model, showing how energy consumption depends on machine energy settings, and fit a simple linear regression in Minitab to derive the equation.
Generate Descriptive Statistics8:41
Explore descriptive statistics for machine setting as the independent variable and its impact on energy consumption, including mean, min, max, range, standard deviation, and a regression relation.
Scatterplot of Energy Consumption6:33
Explore scatterplot analysis and regression modeling to assess energy consumption relationships and copper expansion with temperature, using minitab to interpret r-squared, p-values, and model fit.
Identity Equation8:00
Analyze a simple regression model that explains about 69% of the variance, using Kelvin as predictor with y = 0.021060 Kelvin + 7.449, and significant t and p values.
P - Value and T - Value7:11
Explore p-value and t-value in the context of simple linear regression modeling copper expansion as a function of temperature in kelvin, using Excel to compute the regression equation.
Changes in Tem. and Expansion8:17
Explore how temperature changes drive copper expansion through regression, with kelvin as predictor, expansion as dependent variable, and a scatter plot showing a 0.83 correlation and 68.95% r-squared.
Objective of Stock Prices9:19
this lecture uses a finance example to test whether Reliance and Infosys stock returns depend on BSE Sensex returns, using Minitab to fit regression and report r-squared and p-values.
Interpretations of Example 58:40
Analyze example 5 interpretations, noting R square 50.16% and Sensex returns predicting Reliance returns with significant t and p values, and apply the regression equation in Excel for unit-term predictions.
Reliance Return Change8:26
Generate Predicted Values7:36
Apply the regression equation to generate predicted Infosys returns from BSE Sensex changes, using Excel, and interpret results with R-squared, t and p values, plus a regression scatter plot.
Scatterplot Return RIL7:21
Explore linear and simple regression, interpret R-squared, p-values, and t-values; create regression equations and scatter plots for Reliance and Infosys against Sensex.
Basic Multiple Regression8:36
Explore how density and temperature influence the stiffness of a plastic board through multiple regression, using Minitab to model predictors and interpret the t, p values and r-squared.
Basic Multiple Regression Continues8:25
Explore continuing multiple regression with an example predicting stiffness of a plastic board using density and temperature; interpret regression coefficients, r square, and p values, noting density as significant.
Basic Multiple Regression - Interpretation8:36
Explore multiple regression with density and temperature predicting stiffness in a plastic board, using Minitab to identify the dependent variable, predictors, and model statistics.
Generate Basic Statistics7:22
Build a basic regression model predicting stiffness from density and temperature, compare models with and without temperature, and interpret confidence intervals, t and p values, plus a scatter plot.
Working on Scatterplot4:02
Dependent Variable Objective11:30
Concept of Multicollinearity9:20
Identify Dependent Variable Y11:41
Identify the dependent variable y as total heat flux and model it with predictors insulation east, north, south, and time of day using regression, checking multicollinearity and noting r square.
Outputs and Observation11:57
Explore regression outputs and observations, interpret p-values for predictors (insolation, east, north, south) and time of day, and interpret heat flux implications with an r-squared of 89.88%.
Interpretations - Example 310:23
Time of day remains insignificant in the current regression model; the lecture covers two regression equations (with and without time of day) and Excel-based descriptive statistics to compare predictions.
Calculate with and without Flux7:09
Analyze how insolation and time of day shape total heat flux using a regression model and predict values from the regression equation, with and without time of day.
Scatterplot of Heart FLux Vs Insolation6:13
Generate scatter plots of heat flux against insolation, east, south, north, and time of day, interpret regression outputs, note correlations and multicollinearity, and apply to cotton wrinkle resistance.
Interpretation of Datasets12:06
Analyze how formaldehyde concentration, catalyst ratio, temperature, and time affect cotton's durable press wrinkle resistance through a four-variable regression in Minitab, yielding a 72% R-squared and an interpretable regression equation.
Implementation of Datasets7:22
Explain regression outputs by interpreting r-squared 72.98%, identifying significant predictors like concentration and temperature, noting insignificant constants and time, and evaluating p-values, t-values, and f-values for best-fit models.
Example 4 Observations9:30
Learn to build a regression model predicting wrinkle resistance rating from concentration, ratio, temperature, and time; assess intercept and variable significance and explore 90% and 75% confidence intervals with Excel.
Display Descriptive Statistics6:41
Display descriptive statistics by showing min and max values for concentration, temperature, and time, and show how inputs drive predicted values in predictive modeling.
Predicted Values Example 49:55
Scatterplot of Example 45:23
Explore scatterplots and regression analyses for ferrite, aluminate, silicate, and trisilicate, comparing simple and multiple regression amid multicollinearity, with r-squared and p-value interpretations.
Calculating IV - Multiple Regression9:39
Use a regression-based approach to calculate density from known temperature and stiffness by adapting the regression equation, enabling density estimates for specified stiffness and temperature values.
Calculating Independent Multiple Regression4:20
Explore how to compute a specific independent variable in a multiple regression model to predict density across varying temperatures and stiffness, showcasing predictive modeling and logistic regression next session.
Understanding Basic Logistic Scatter Plot10:23
Explore logistic regression with dichotomous and categorical variables, modeling how smoking affects after running heart pulse across gender, using height and weight as predictors.
Basic Logistic Scatter Plot Continues8:15
Extend regression analysis across gender by deriving separate equations for females and males, using height and weight to predict after running heart pulse, with Minitab demonstration on a 90-respondent dataset.
Generation of Regression Equation11:29
Generate and interpret regression equations for heart pulse using height and weight as continuous predictors, with gender and smoking as predictors, evaluating model fit with p-values, t-values, and r-squared.
Tabulated Values7:20
Explore tabulated values and regression outputs to determine variable significance and predictability, highlighting ambiguous outputs, weak r square, and lack of strong correlation in scatter plots.
Interpretation and Implementation on Dataset10:31
Apply regression modeling to predict sales from client count and years, using group-based dummy variables and separate equations to compare three company segments in Minitab.
Interpretation and Implementation on dataset Continues7:48
Analyze regression results across three groups, interpreting r-squared at 81.69%, the t-values and p-values, and the role of clients versus years in business, while noting multicollinearity.
Output and Observation - Tabulated Values8:41
Business Metrics Example6:46
Interpret regression results with an r square of 81.69%, assess variable significance via t tests, and show how sales rise with client count and years in business across three groups.
Example Two and Three Interpretations6:51
Interprets regression-based sales predictions by analyzing how client counts and years affect revenue across three groups, using Excel equations to estimate and compare projected sales.
Regression Equation Group7:44
Examine how a regression equation uses clients and years in business to predict group sales across three groups, illustrating how changing inputs alters the predicted values.
Interpretation and Implementation of Scatter Plot9:14
Interpret how to plot sales against clients and years across groups, analyze scatter plots with regression, and interpret positive correlations despite limited data points.
More on Implementation of Scatter Plot5:51
Learn to implement scatter plots and generate predicted values, handling categorical predictors and regression outputs, including R-squared, t-values, and p-values, with examples using temperature, strength, and manufacturers.
Plastic Case Strength11:01
Learn how to model plastic case strength as a function of temperature with manufacturers as a categorical predictor, derive regression equations for each manufacturer, and interpret r-squared around 0.59.
Separate Equations10:59
Analyze separate regression equations for manufacturers a and b, with significant constant and temperature, r square indicating best fit, and exploring predicted strength values.
Generation of Predicted Values10:30
Generate predicted strength values for manufacturers A and B in excel using the equation 7349 minus 11.47 temperature, and show how temperature between 183 and 208 shifts strength.
Scatter Plot Strength Vs Temp10:13
Analyze a scatter plot of plastic strength versus temperature for manufacturers A and B; manufacturer B declines sharply with heat, while manufacturer A preserves strength better.
Data of Cereal Purchase11:27
Explore how logistic regression analyzes cereal purchase decisions using income, whether viewers have children, and ad exposure, yielding four equations to predict buying behavior.
Children Viewed and RE10:18
Develop and interpret regression equations linking income and ad exposure to children viewed across four scenarios, using Excel to format, compute, and compare predicted outcomes.
Predicted Values for Individual Customers11:47
Explore how to derive predicted values for individual customers across four regression equations conditioned on income, ad exposure, and parental status, and interpret R square, t values, and p values.
Income Independent Variable9:22
Evaluate regression results by examining income as an independent variable, interpreting p-values, t-values, and R-squared to assess significance and predictive ambiguity, with scatter plots illustrating outcomes.
Example of Credit Card Issuing11:13
Apply logistic regression to determine how age, education, debt, and savings influence income and guide credit card granting decisions for users and non-users.
Example Five - Tabulated Values9:05
Explore a regression-based credit card grant decision, using tabulated values and outputs like r-squared and p-values to evaluate age, education, debt, and savings as predictors.
Generating Outputs8:31
Example Five Interpretations11:17
Explore how regression interpretations vary across credit card scenarios, with changes in education, age, and savings shaping income, while assessing intercept shifts and predictive value using Excel.
Situations Income9:34
Explains building a predictive model for credit card approvals across four situations (nn, ny, yy, yn) using age, education, savings, and debt, and highlights debt's insignificance.
Scatterplot7:16
Explore how to interpret scatterplots in predictive analytics, comparing predicted values with debt factor scenarios to understand income level effects and positive or negative correlations.
Scatter Plot Scale8:31
Using Data Analysis Toolpak6:38
Develop predictive modeling in Excel with the Data Analysis Toolpak, covering ANOVA, t-test, F-test, regression, correlation, and descriptive statistics.
Implementation of Descriptive Statistics8:14
Activate the Excel data analysis toolpak and run descriptive statistics for height and weight. Set input ranges and outputs, apply a 95% confidence level, and format results to two decimals.
Descriptive statistics - Input Range7:13
Implementation of ANOVA6:25
Learn how to implement single-factor ANOVA in Microsoft Excel using the Data Analysis Toolpak, including setting alpha to 0.05 and interpreting p-values and F statistics for comparing multiple groups.
Implementation of T - Test5:50
Implementation Using Correlation10:17
Learn to implement correlation in predictive modeling with Excel, using data analysis tools and the correlation formula to compute correlation coefficients between variables.
Implementation Using Regression11:51
Implement linear regression in Excel using the data analysis toolpak, selecting y and x ranges, interpreting R-squared and ANOVA, and reviewing line fit plots and residuals for prediction tasks.

Implementation using SPSS11:59
Apply descriptive statistics in SPSS to identify the highest and lowest crime incidences across states, using larceny, murder, and robbery examples like Arizona, Mississippi, Nevada, and New York.
Implementation using SPSS Continues8:06
Analyze descriptive statistics in SPSS using the gasoline dataset, importing data, selecting length and faults, and interpreting variance, standard deviation, skewness, and kurtosis to assess quality and normality.
Importing Datasets in Text and CSV6:12
Import text and csv datasets in SPSS, then apply descriptive statistics, correlations, and regression modeling for predictive insights.
Other Concepts of Understanding Mean SD10:50
Examine descriptive statistics for datasets, computing mean, standard deviation, variance, kurtosis, range, minimum and maximum, and interpret implications for volatility and returns in predictive modeling.
Software Menus4:52
Explore descriptive statistics and graphing in SPSS using stock returns as examples, including data view, variable types, scatter plots, and setup for regression modeling in predictive analytics.
Understanding Mean Standard Deviation10:57
Explore descriptive statistics in SPSS, including mean, standard deviation, skewness, kurtosis, and range, and interpret the output from sample data sets.
Understanding User Operating Concepts5:25

Introduction of SAS Enterprise Miner11:35
Explore the basics of predictive modeling with SAS Enterprise Miner, create a new project, configure data sources, and navigate menus and nodes to build early models.
Select a SAS Table9:54
Select the rest data table, review its properties and metadata, and set the target variable for binary modeling in SAS Enterprise Miner, then create a process flow diagram.
Creating Input Data Node12:58
Create an input data node from a data source, apply data partitioning and filtering for outliers, and connect nodes to build a predictive analytics workflow in SAS Enterprise Miner.
Metadata Advisor Options8:53
Configure metadata for time series analysis by defining fields such as month, product, state, and sales as target variable, creating a transaction data source, and applying seasonal decomposition.
Add More Data Sources11:01
Extend predictive modeling with SAS Enterprise Miner by adding more data sources, building exploration diagrams, and using stat-explore to assess chi-square and Cramer's v for variable importance.
Sample Statistics10:22
Explore a three-variable data set—month-to-month saving balance changes, interest rate differential, and ads expenses—using Stat-xplore, Multiplot, and Graph Explorer to assess Pearson and Spearman correlations and generate interactive plots.
Trial report9:32
Explore visualizing a binary response with scatter plots, examine age and income as variables, and perform variable clustering and regression steps in predictive analytics.
Properties of Cluster Node8:06
Learn to run and interpret a cluster node, including standardization, Ward's clustering, segment plots, and associated statistics, plus a glimpse into variable selection and data partition steps.
Variable Selection9:13
Input Variable9:38
Master variable selection and transformation in SAS Enterprise Miner for predictive modeling across continuous and binary targets with numeric or nominal inputs, including variable clustering and before/after transformations.
Input Variable Continues9:43
Demonstrates a SAS Enterprise Miner workflow from input and data partition to variable selection and regression, identifying a continuous interval target and selecting high-impact variables using r square criterion.
Values of R-Square9:22
More on Variable Selection8:55
Explore variable selection in SAS Enterprise Miner, identifying five key inputs that drive the regression model and reveal their impact on the target variable.
Binary Target Variable8:41
Explore how to select input variables for a binary target using chi-square or r-squared criteria, with a bank email response example, data partitioning, and regression modeling.
Variable and Effect Summary9:16
Examine predictive analytics and modeling workflows, tracking selected versus rejected inputs, 12 of 282 variables, and assessing model performance with chi-square tests and variable importance visuals for binary targets.
Variable Selection - Variable ID's8:39
Variable Frequency Table9:12
Examine the variable frequency table across 19 clusters, including selector variables, cluster plots, and a dendrogram, then see regression modeling with stepwise selection and model comparison for credit line.
Variable S - Updating Model Comparison8:47
compare two regression models via variable clustering and a decision tree node, updating inputs from both paths to assess training, validation, and test performance with mean and max predicted.
Run Data Partition Node8:14
Build a predictive analytics workflow by connecting a decision tree to a regression model, using variable clustering and selection to form leaf segments, and comparing models with output statistics.
Variable Selection - Fit Statistics9:22
Explore variable selection and fit statistics in predictive modeling, comparing regression and decision tree approaches, examining training and validation results and model comparisons.
Understanding Transformation of Variables9:37
Explore transformation of variables using the transform variables node to convert inputs into regression-ready features, and compare transforming before versus after variable selection in a binary target workflow.
Score Ranking Overlay Res.9:08
Explore regression modeling with score rankings overlay, variable selection before and after transformation, and a final model linking inputs to outputs via lift, r2, and training validation metrics.
Update Transformation of Variables9:46
Update variable transformations with optimal binning and merged inputs, run a regression model to predict credit score, and review SAS code and training and validation results.
Combination of Different Models8:44
Compare and combine decision trees, regression, and neural networks to evaluate models against binary and ordinal targets, using a flow diagram, data partitions, and model comparison analysis.
Properties of Neural Network8:33
Analyzing the Output Variable12:16
Analyze how a binary output variable is modeled using decision trees, neural networks, and regression methods, and interpret predictions, node rules, variable importance, and training versus validation performance.
Combination of Regression Model7:42
Combination - Result of Regression Node10:29
Review the regression node outputs for a normal target with loss frequency; examine parameter estimates, standard errors, and fit statistics, then explore the decision tree results and SAS code execution.
Subseries Plot11:55
Analyze a subseries plot with the attrition target variable's variable importance and fit statistics, and compare decision tree and gradient boosting using ROC curves and SAS code.
Creating Densemble Diagram9:40
Create an ensemble diagram by combining logistic regression, decision tree, and neural network models with an ensemble node, using the attrition target and data partition and impute steps.
SAS Code11:55
Decision Tree Model10:06
Explore decision tree modeling in SAS Enterprise Miner to predict a binary response and a continuous loss frequency, using data partition node, profit matrix, and SAS code node.
Run and Upadate Decision Tree Model10:10
Run and update a decision tree model on partitioned data, interpret input and target variables, review fit statistics and tree structure, and execute SAS code.
Creating Dscore Node8:37
Apply the score node to a prospect dataset, predicting the probability of response with a decision tree, data partitioning, and model comparison in the D score diagram.
DT - Resulf of Model Comparison10:19
Leaf Statistics and Tree Map10:16
Analyze a regression tree model's output, depth, and leaf statistics. Compare training, validation, and test results within SAS Enterprise Miner.
Interactively Decision Trees9:22
Interactively build and modify a decision tree workflow from a root node, using an input node, partition node, three decision trees, and a model comparison to evaluate results.
Result Node Data Partition9:14
Change the data split and rebuild the model to produce an interactive decision tree, showing target and input variables, leaf nodes, and fit statistics across train, validation, and test sets.
Interactively Trees Window9:06
Explore creating and refining interactive decision trees: split nodes via right-click, apply nominal and ordinal rules, and train branches from scratch in the interactive window.
Building a Decision Trees8:50
Build a decision tree from scratch in Enterprise Miner by iteratively splitting nodes using entropy and log values, handling missing data, and expanding to the maximum tree; assess performance.
Neural Network Model10:10
Explore neural network models in SAS Enterprise Miner to predict response and risk for auto insurance, using a two-layer network with hidden units and probabilistic outputs.
Neural Network Model Output9:40
Learn how a neural network model runs and is evaluated, from binary inputs and a binary target to training iterations, average squared error, misclassification rate, and SAS-based model comparison.
Model Weight History12:28
Explore how a neural network derives optimal weights across iterations by viewing weights history and final weights, noting the average squared error on the validation set peaks around iteration 49.
Neural Network - Final Weight6:08
Explore how a neural network outputs final weights and performance metrics, including iteration weights, error plots, and SAS code generated by enterprise miner, with emphasis on binary target prediction.
ROC Chart7:41
Score and compare neural network models with SAS, performing model comparison and scoring, and examining fit statistics, lift, and cumulative performance across training, validation, and test data.
Neural Network -Iteration Plot8:45
Explore the neural network iteration plot to inspect training metrics such as average and root mean square errors, misclassification rates, and weight histories, with SAS and RSP scoring insights.
Neural Network - SAS Code10:08
Compare neural network results in SAS code by examining lift, gain, percentage response, and errors, and interpret final weights and model history across iterations.
Neural Network - Cumulative Lift6:23
Explore how changing neural network models affects cumulative lift and related performance metrics, comparing eight networks through model comparison plots, SAS code, and scoring across training, validation, and test sets.
Decision Processing6:22
Compare DM neural, auto neural, and DM mine regression nodes by building a diagram, configuring data, targets, weights, and partitions, then run and compare models.
Results of Auto Neural Node7:01
Explore the auto neural node results, including weights, iterations, and training settings, in SAS Enterprise Miner. Review the generated SAS code, scoring, lift, gain, and model diagnostics to assess performance.
Run Model Comparison8:00
Compare DM neural, deep neural regression, and auto neural; auto neural delivers the lowest average squared error, lower misclassification rate, and highest cumulative captured response and profit.
DEX - Variable ID's10:48
Create a binary target variable with prior probabilities, partition data, and build multiple neural networks (multilayer perceptron, radial width variants, auto neural) to compare performance.
Average Square Error6:14
Examine how switching to the average squared error as the error function alters iteration plots, misclassification rates, weights history, and SAS-backed neural network results.
Score Rating overlay - Event5:41
Explore how the score rating overlay analyzes iteration plots, error metrics, and weight updates from the auto neural node to reveal cumulative lift, gain percentage, and SAS code generation.
Run Dmine Regression Node5:53
Run the d mine regression node and review outputs such as training proportions, predicted variables, fit statistics, and ROC comparison, noting the de mine regression model performs best.
Regression with Binary Target7:59
Explore regression models with binary and continuous targets using SAS Enterprise Miner, including regression node properties like link function, selection model, and criteria, applied to mail campaign response prediction.
Regression - Table Effect Plots7:43
In regression analysis with SAS enterprise miner, learn to read table effect plots, interpret intercepts and effects, and assess lift, gain, and cumulative response for binary and ordinal targets.
Result of Regression Model8:53
Explore building and evaluating a regression model to predict ordinal and nominal targets using Enterprise Miner, covering data partitioning, model diagnostics, and SAS code generation.
Update Regression Node8:56
Creating Flow Diagram8:40
Create a flow diagram to build a logistic regression with backward selection in SAS Enterprise Miner, using RSP as target, partitioning, training/validation/test sets, and reporting odds ratios and SAS code.

What is Predictive Modelling2:25
Explain how predictive modeling uses historical and current data, applies techniques, and generates futuristic data. Identify trends and patterns, and recognize meaningful information as data used for prediction.
Predictive Modelling4:15
Explore predictive modeling, a machine learning approach that uses statistical and mathematical techniques to transform data into models that produce accurate future estimates, patterns, and trends.
How to Build A Predicative Model1:55
Learn to build a predictive model by assembling customer attributes into a dataset, plotting two variables like age and items purchased, and respecting data quality in a multidimensional feature space.
Types of Variables1:22
Explore the types of variables, distinguishing dependent from independent variables, and see how observed values like age, gender, zip code, and purchase counts inform predictive modeling.
Difference Between Variables2:40
Compare independent and dependent variables, noting that the independent variable can be manipulated to explain effects on the dependent variable, while gender cannot, with examples from age brackets and generations.
Other Types - Extraneous Variables2:07
Identify extraneous variables beyond price, including control, moderating, and intervening types; control keeps price constant, moderating relates to returns, and intervening infers unquantified effects on customer behavior.
How to Build A Predicative Model Steps6:20
Algorithms1:36
Explore 13 predictive modeling algorithms, including time series, regression, association, clustering, outlier detection, decision trees, neural networks, Naive Bayes, SVM, uplift, and survival analysis, to forecast data and reveal trends.
Forecasting Methods2:12
Compare qualitative and quantitative forecasting methods, using data mining and statistical analysis to identify trends and predict future events, with qualitative relying on expert judgment for new products and technology.
What is Time Series10:36
Time series uses historical data at regular intervals to forecast future values and reveal trend, cyclical, seasonal, and irregular patterns, with smoothing methods like moving averages.
Smoothing Methods - Moving Averages9:03
Master smoothing for time series with moving averages and weighted moving averages, plus single, double (Holt's), and triple (Winter's) exponential smoothing, using alpha and past values to forecast.
Smoothing Methods - Double Exponential Smoothing9:08
Explore double exponential smoothing and the trend smoothing constant beta for time series forecasting, then survey five regression algorithms from linear to multiple linear regression.
Regression Algorithms - Exponential9:01
Clustering Algorithms - Definition6:10
Explore clustering algorithms that group unlabeled data by similarity or descriptive concepts, using distance-based, exclusive, overlapping, hierarchical, and probabilistic approaches, including k-means, fuzzy c-means, and mixtures of gaussians.
Clustering Algorithms - Fuzzy C Means Clustering6:49
Explore fuzzy C means clustering with degrees of membership and iterative center updates, and note hierarchical clustering, Gaussian mixture models, decision trees, and outlier detection.
Neural Network Algorithm10:50
Explore neural networks and learning models, including Kohonen self-organizing maps, Hopfield nets, bump tree network, Monte Carlo analysis, factor analysis, and Naive Bayes theorem.
Support Vector Machines10:19
Learn support vector machines and uplift modeling to predict treatment effects and customer behavior, then apply survival analysis and Bayes theorem to time-to-event outcomes.

Introduction to Eview Training7:34
Explore econometrics-focused finance modeling using Eviews, with hands-on exploration of descriptive statistics, correlogram, and cointegration test, plus regression, autocorrelation, and arch models.
Eviews GUI4:25
Navigate the Eviews GUI, start the software, and import foreign data from formats like WF1, DBF, Excel, SAS, SPSS, and text; Minitab data isn't supported.
Eviews GUI Continues6:37
Explore the Eviews GUI for estimating regression equations and viewing outputs like R-squared and Durbin-Watson. Generate returns, create volatility graphs, and perform descriptive statistics.
Generating Log Returns9:54
Generate and interpret log returns and descriptive statistics, including standard deviation, with a t test in Eviews across five mutual fund data sets to explore econometrics in financial markets.
Example of Descriptive8:08
Generate descriptive statistics for fund returns using Eviews, including mean, median, maximum, standard deviation, and Jarque-Bera; interpret kurtosis and volatility to assess risk.
Interpretation and Graphs5:50
Learn to interpret descriptive statistics and their investment significance. Use Jarque-Bera and standard deviation to assess volatility and risk, illustrated by HDFC Equity Fund and HDFC Mid Cap Opportunities Fund.
Interpretation and Graphs Continues7:02
Explore volatility analysis by generating spike-based volatility graphs, interpreting Jarque-Bera and standard deviation to compare fund risk, and linking descriptive statistics to econometric data interpretation.
Generating Log Returns and Descriptive6:34
Apply descriptive analysis to stock indices by generating log returns and comparing close prices of BSE Sensex, mid cap, and small cap using Eviews.
Generating Log Returns and Descriptive Continue7:49
Generate log returns for large cap, mid cap, and small cap indices in Eviews, then compute descriptive statistics and compare volatility across groups.
Example of Interpretations11:11
Volatility Graphs9:31
Generating returns Interpretation and Graphs10:05
Analyze foreign-exchange data from AUD, GBP, and euro, computing log returns and generating descriptive statistics and volatility graphs. Note Brexit-driven moves and their impact on Jarque-Bera and kurtosis.
Generating returns Interpretation Continues8:48
Explore how the Brexit effect drives GBP and euro volatility and skewness, shown through descriptives and volatility graphs, with notes on correlation and regression modeling.
Basic Correlation Theory11:38
Define correlation as the relationship indicator between two variables. Show that r lies between -1 and 1 with signs for positive, negative, zero correlation.
Generating Correlation Matrix in Eviews8:53
Learn to generate a correlation matrix in EViews using log returns for five funds, then interpret the results to assess diversification and investment viability.
Generating Correlation Matrix in Eviews Continues7:22
Mutual Funds Correlation Matrix Percentage8:43
Analyze the correlation matrix of sectoral and non-sectoral mutual funds to identify high correlations within sector funds and low or negative links with elss, diversified equity, and mid-cap funds.
Scatter Plots Using Eviews10:36
Learn to create scatter plots in Eviews to visualize positive, negative, and zero correlations, interpret regression lines, and align visuals with the correlation matrix.
Generating Correlation Matrix11:42
Analyze the correlation of Sensex stocks by computing a three by three correlation matrix from log returns of Reliance, Infosys, and the BSE Sensex, to identify the best investment script.
Scatter Plots and Volatility Graphs12:36
Explore scatter plots and volatility graphs to compare Reliance, Infosys, and the BSE Sensex, identify weak and strong positive correlations, and view regression lines for insight.
Generating Correlation Matrix and Interpretations10:33
Generate a correlation matrix for a multi-asset price dataset, calculate logarithmic returns, and interpret relationships among gold, natural gas, and Swiss franc to inform hedge and portfolio decisions.
Generating Correlation Interpretations8:34
Analyze how to generate log returns and interpret a correlation matrix to build risk-aware portfolio combinations, using gold, Swiss franc, gas, and Nifty assets.
Generating Correlation Interpretations Continues5:41
Analyze correlation interpretations to guide risk management, diversification, and hedging decisions across gold, Swiss franc, and Nifty via derivatives.
Scatter Plots11:51
Explore scatter plots and a scatter plot matrix of multi-asset returns, including Nifty, gold, natural gas, and Swiss franc, revealing near unity correlations and regression lines.
Working on Scatter Plots12:07
Explore volatility patterns using scatter plots and volatility graphs, highlighting Swiss franc and natural gas spikes. Assess the relative safety of the Nifty and learn to present these insights clearly.
Basic Regression Modelling Theory3:00
Learn linear regression basics—y equals mx plus c—with x as predictor and y as response, including simple, multiple, logistic, and polynomial forms and stats like t, p, and r squared.
Generating Returns and Estimation Output11:24
More on Generating Returns5:46
Explore regression analysis for generating returns from stock data, detailing the dependent variable, constants, coefficients, and outputs such as p-values, r-squared, f-statistics, and Akaike Schwarz Hannan-Quinn criteria.
Understanding Estimation Output6:56
Learn to read the regression equation and estimation output, and interpret t-statistics, p-values, and R-squared to assess predictor significance in stock returns, reliance, and the BSE Sensex.
Understanding Estimation Output Continues6:11
Example of Interpretations11:11
Analyze descriptive statistics and volatility patterns across the BSE Sensex, midcap, and small cap indices, using mean, standard deviation, and Jarque-bera to inform investment decisions.
Generating Estimation Output11:43
Generate and interpret an ordinary least squares estimation output for Tata Motors log returns against Sensex movements using Eviews, including regression equation, t statistics, and R-squared insights.
Interpretations and Volatility Scatter Plots8:10
Explain the interpretation of the regression output and the significance of coefficients, compare the correlation graphs for Tata Motors and Sensex, and discuss the r-squared and volatility factors.
More on Volatility Scatter Plots5:34
Analyze volatility scatter plots and regression lines to compare stock volatility, like reliance and Tata motors, using Eviews, with emphasis on interpreting correlation in a one-year financial data context.
Estimation Output Interpretations and Graphs9:56
Analyze estimation outputs and graphs for a mutual fund case study, comparing SBI Pharma Fund to the BSc healthcare index using log returns and least squares regression.
Estimation Output Interpretations and Graphs Continues8:42
Interpret estimation outputs to reveal insignificant predictors, near-zero r-squared, and flat regression plots, guiding judgment on best-fit models in predictive analytics using R, Minitab, SPSS, and SAS.
Example 3 - NAV Price Study9:34
Working on Volatility Graphs7:00
Analyze volatility graphs showing spikes from news events like Fed rate delays, Brexit, and Chinese market swings, and explain why models are not a best fit for predicting fund movements.
Correlation Matrix8:48
Explore how the GBP and euro influence the Australian dollar through correlation matrices and regression equations, using log returns and EViews to compare bidirectional effects.
Correlation Matrix Continues9:20
Explore regression-based estimation outputs for aud, gbp, and euro, detailing coefficients and how gdp and other factors influence currency values.
Example 4 - Estimation Output8:45
Investigate how the Australian dollar, euro, and pound sterling interact through regression equations and estimation outputs, using scatter plots and regression lines to compare reserve currency roles against US dollars.
Basic Regression Modelling8:11
Basic Regression Modelling Continues7:42
Clean the data by removing observations and reimporting, then run a regression with p/e and peg and growth as variables. Peg and growth are insignificant, with low r-squared.
Interpretations and Scatterplot Analysis7:30
Study how Swiss franc and natural gas influence gold through regression analysis and log returns, and interpret the estimation output in Eviews with a modest 6% R-squared.
More on Scatterplot Analysis7:02
Analyze scatterplot matrices and regression outputs to explore relationships among gold, Swiss franc, and natural gas. Swiss franc is the only significant independent variable predicting gold, with r square 6%.
Equation Estimation8:18
Estimate a multiple regression of gold prices on Singapore Nifty, natural gas, and Swiss franc, updating equation and noting that Swiss franc significantly affects gold while others show limited impact.

Requirements

Basic Understanding of Statistics: Familiarity with basic statistical concepts such as mean, median, mode, standard deviation, and hypothesis testing.
Basic Knowledge of Programming: Some experience with programming concepts, especially in R, is beneficial but not mandatory.
Access to Software Tools: Participants should have access to R, Minitab, SPSS, and SAS software. Instructions for downloading and installing these tools will be provided.
Computer Skills: Proficiency in using a computer, including managing files, installing software, and navigating operating systems.
Mathematical Skills: A basic understanding of algebra and calculus can be helpful for grasping the mathematical foundations of predictive modeling.
English Proficiency: Proficiency in English to follow the course instructions, lectures, and reading materials.

Description

Introduction

Welcome to the comprehensive course "Predictive Analytics & Modeling with R, Minitab, SPSS, and SAS". This course is meticulously designed to equip you with the knowledge and skills needed to excel in data analysis and predictive modeling using some of the most powerful tools in the industry. Whether you are a beginner or an experienced professional, this course offers in-depth insights and hands-on experience to help you master predictive analytics.

Section 1: R Studio UI and R Script Basics

This section introduces you to the R programming environment and the basics of using R Studio. You will learn how to download, install, and navigate R Studio, along with understanding basic data types, vectors, matrices, lists, and data frames in R. The section also covers decision making, conditional statements, loops, functions, and the power of ggplot2 for data visualization. By the end of this section, you will have a solid foundation in R programming and the ability to perform essential data manipulation and visualization tasks.

Section 2: Project on R - Card Purchase Prediction

In this section, you will embark on a practical project to predict card purchases using R. The journey begins with an introduction to the project and importing the dataset. You will then delve into calculating Information Value (IV), plotting variables, and data splitting. The course guides you through building and optimizing a logistic regression model, creating a lift chart, and evaluating model performance on both training and test sets. Additionally, you will learn to save models in R and implement decision tree models, including making predictions and assessing their performance. This hands-on project is designed to provide you with real-world experience in predictive modeling with R.

Section 3: R Programming for Data Science - A Complete Course to Learn

Dive deeper into R programming with this comprehensive section that covers everything from the history of R to advanced data science techniques. You will explore data types, basic operations, data reading, debugging, control structures, and functions. The section also includes scoping rules, looping, simulation, and extensive plotting techniques. You will learn about date and time handling, regular expressions, classes, methods, and more. This section is designed to transform you into a proficient R programmer capable of tackling complex data science challenges.

Section 4: Statistical Analysis using Minitab - Beginners to Beyond

This section focuses on statistical analysis using Minitab, guiding you from beginner to advanced levels. You will start with an introduction to Minitab and types of data, followed by measures of dispersion, descriptive statistics, data sorting, and various graphical representations like histograms, pie charts, and scatter plots. The section also covers probability distributions, hypothesis testing, sampling, measurement system analysis, process capability analysis, and more. By the end of this section, you will be adept at performing comprehensive statistical analyses using Minitab.

Section 5: Predictive Analytics & Modeling using Minitab

Building on your statistical knowledge, this section delves into predictive modeling with Minitab. You will explore non-linear regression, ANOVA, and control charts, along with understanding and interpreting results. The section includes practical examples and exercises on descriptive statistics, correlation techniques, regression modeling, and multiple regression. You will also learn about logistic regression, generating predicted values, and interpreting complex datasets. This section aims to enhance your predictive modeling skills and enable you to derive actionable insights from data.

Section 6: SPSS GUI and Applications

In this section, you will learn about the graphical user interface of SPSS and its applications. You will cover the basics of using SPSS, importing datasets, and understanding mean and standard deviation. The section also explores various software menus, user operating concepts, and practical implementation of statistical techniques. By the end of this section, you will be proficient in using SPSS for data analysis and interpretation.

Section 7: Predictive Analytics & Modeling with SAS

The final section of the course introduces you to SAS Enterprise Miner for predictive analytics and modeling. You will learn how to select SAS tables, create input data nodes, and utilize metadata advisor options. The section covers variable selection, data partitioning, transformation of variables, and various modeling techniques, including neural networks and regression models. You will also explore SAS coding and create ensemble diagrams. This section provides a thorough understanding of using SAS for complex predictive analytics tasks.

Conclusion

"Predictive Analytics & Modeling with R, Minitab, SPSS, and SAS" is a comprehensive course designed to provide you with the skills and knowledge needed to excel in the field of data analytics. From foundational programming in R to advanced statistical analysis in Minitab, SPSS, and SAS, this course covers all the essential tools and techniques. By the end of the course, you will be equipped to handle real-world data challenges and make data-driven decisions with confidence. Enroll now and take the first step towards mastering predictive analytics!

Who this course is for:

Data Analysts: Seeking to enhance their predictive modeling skills using industry-standard tools.
Business Analysts: Interested in leveraging predictive analytics to make data-driven decisions.
Statisticians: Looking to apply statistical models to predict outcomes.
Researchers: Wanting to use predictive modeling in their research projects.
Graduate Students: Pursuing studies in data science, statistics, or related fields.
Professionals: From diverse domains interested in using predictive analytics for problem-solving.
Anyone Interested: In learning and applying predictive modeling techniques using R, Minitab, SPSS, and SAS.

Predictive Analytics & Modeling: R | Minitab | SPSS | SAS

What you'll learn

Explore related topics

Course content

R Studio UI and R Script Basics27 lectures • 3hr 45min

Project on R - Card Purchase Prediction13 lectures • 1hr 55min

R Programming for Data Science - A Complete Courses to Learn45 lectures • 5hr 49min

Statistical Analysis using Minitab - Beginners to Beyond34 lectures • 4hr 17min

Predictive Analytics & Modeling using Minitab111 lectures • 15hr 41min

SPSS GUI and Applications7 lectures • 58min

Predictive Analytics & Modeling with SAS60 lectures • 9hr 11min

Predictive Modeling Training17 lectures • 1hr 37min

EViews - Introductory Econometrics Modeling46 lectures • 6hr 34min

Requirements

Description

Who this course is for: