
Explore predictive analytics by forecasting future outcomes using current data, and assess the validity of predictions amid changing conditions and varying time horizons.
Define the scope of application, set business objectives and constraints, and use input and output data to build a survival analytics model that reduces defaulters and increases profits.
Explore business understanding and use cases, from fraud detection in credit card transactions to precision farming with drones, balancing fraud minimization, customer convenience, and cost constraints.
Explore data understanding by identifying data types, scales of measurement, and key terms, and compare primary and secondary data collection techniques.
Explore data understanding by measuring data, analyzing it to build models, and using what-if analysis to optimize sales under constraints for informed management decisions.
Explore practical data understanding by comparing real-world examples to describe nominal, ordinal, interval, and ratio data, including absolute zero and subjective versus objective interpretations.
Differentiate quantitative data from qualitative data by showing numbers and continuous or count data. Identify qualitative data as descriptive or categorical and note its role in decision making.
Understand how to translate business reality into survey data collection, identify root causes, and design construct-based questions to gauge training enrollment and price elasticity of demand.
Apply design of experiments to food promotions by testing expiry timing and distance; same-day expiry with near customers yields higher redemptions, revealing how expiry and distance affect consumer response.
Understand bias and fairness in data science, avoiding sensitive variables and biased data collection. Emphasize business understanding, proper data collection, and disciplined use of algorithms.
Explore the standard normal distribution, z-scores, and Six Sigma concepts; relate mean and standard deviation to a symmetric bell curve, and learn z-score standardization with mu and sigma.
Explore the fourth moment and kurtosis using Python, linking excess kurtosis to normality, skewness, and business insights through retail and e-commerce examples.
Explore univariate data with bar plots and histograms in Python to convert raw numbers into meaningful insights, using bins, normal distribution checks, and simple examples like GMAT scores.
Explore creating a bivariate scatter plot in Python to relate waist circumference and adipose tissue, and interpret the correlation coefficient value of 0.81 and covariance.
This program will help aspirants getting into the field of data science understand the concepts of project management methodology. This will be a structured approach in handling data science projects. Importance of understanding business problem alongside understanding the objectives, constraints and defining success criteria will be learnt. Success criteria will include Business, ML as well as Economic aspects. Learn about the first document which gets created on any project which is Project Charter. The various data types and the four measures of data will be explained alongside data collection mechanisms so that appropriate data is obtained for further analysis. Primary data collection techniques including surveys as well as experiments will be explained in detail. Exploratory Data Analysis or Descriptive Analytics will be explained with focus on all the ‘4’ moments of business moments as well as graphical representations, which also includes univariate, bivariate and multivariate plots. Box plots, Histograms, Scatter plots and Q-Q plots will be explained. Prime focus will be in understanding the data preprocessing techniques using Python. This will ensure that appropriate data is given as input for model building. Data preprocessing techniques including outlier analysis, imputation techniques, scaling techniques, etc., will be discussed using practical oriented datasets.