Logistic Regression Practical Case Study

Breast Cancer detection using Logistic Regression
Rating: 4.6 out of 5 (2,422 ratings)
20,358 students
English [Auto]
How to build a Logistic Regression model for a Real-World Case Study
Work on Google Colab


  • Basic theory of Logistic Regression


Did you know that approximately 70% of data science problems involve classification and logistic regression is a common solution for binary problems?

Logistic regression has many applications in data science, but in the world of healthcare, it can really drive life-changing action.

In this SuperDataScience case study course, learn how to detect breast cancer by applying a logistic regression model on a real-world dataset and predict whether a tumor is benign (not breast cancer) or malignant (breast cancer) based off its characteristics.

By the end of the course, you will be able to build a logistic regression model to identify correlations between the following 9 independent variables and the class of the tumor (benign or malignant).

  • Clump thickness

  • Uniformity of cell size

  • Uniformity of cell shape

  • Marginal adhesion

  • Single epithelial cell

  • Bare Nuclei

  • Bland chromatin

  • Normal nucleoli

  • Mitoses

Logistic regression can identify important predictors of breast cancer using odds ratios and generate confidence intervals that provide additional information for decision-making. Model performance depends on the ability of the radiologists to accurately identify findings on mammograms.

Join AI expert Hadelin de Ponteves as you code the solution along with him in this 1-hour, 3-part case study:

Part 1: Data Preprocessing

  • Importing the dataset

  • Splitting the dataset into a training set and test set

Part 2: Training and Inference

  • Training the logistic regression model on the training set

  • Predicting the test set results

Part 3: Evaluating the Model

  • Making the confusion matrix

  • Computing the accuracy with k-Fold cross-validation

Testing your skills with practical courses is one of the best and most enjoyable ways to learn data science…and now we’re giving you that chance for FREE.

Plus, you’ll do it all using Google’s Colab free, browser-based notebook environment that runs completely in the cloud. It’s a game-changing interface that will save you time and supercharge your data science toolkit.

Click the ‘Enroll Now’ button to join Hadelin’s class today!

More about logistic regression:

Logistic regression is a method of statistical analysis used to predict a data value based on prior observations of a dataset. A logistic regression model predicts the value of a dependent variable by analyzing the relationship between one or more existing independent variables.

In data science, logistic regression is a Machine Learning algorithm used for classification problems and predictive analysis.

More real-world applications of logistical regression include:

  • Bankruptcy predictions

  • Credit scoring

  • Consumer behavior

  • Customer retention

  • Spam detection

Who this course is for:

  • Anyone interested in Machine Learning, AI or Data Science
  • Anyone who wants to learn how to make accurate predictions

Course content

5 sections10 lectures1h 5m total length
  • Getting started
  • Dataset + Code + Colab Link
  • Importing the libraries


AI Entrepreneur
Hadelin de Ponteves
  • 4.5 Instructor Rating
  • 259,009 Reviews
  • 1,342,092 Students
  • 29 Courses

Hadelin is the co-founder and CEO at BlueLife AI, which leverages the power of cutting edge Artificial Intelligence to empower businesses to make massive profits by innovating, automating processes and maximizing efficiency. Hadelin is also an online entrepreneur who has created top-rated educational e-courses to the world on topics such as Machine Learning, Artificial Intelligence and Blockchain.

Summary in numbers:

2M+ online courses sold.

1.3M students.

29 online courses created.

4.5/5 instructor average rating.

2 books written.

Instagram: @hadelin2p

Helping Data Scientists Succeed
Ligency Team
  • 4.5 Instructor Rating
  • 511,534 Reviews
  • 1,814,168 Students
  • 119 Courses

Hi there,

We are the Ligency PR and Marketing team. You will be hearing from us when new courses are released, when we publish new podcasts, blogs, share cheatsheets and more!

We are here to help you stay on the cutting edge of Data Science and Technology.

See you in class,


The Real People at Ligency