Data Science Career Guide - Interview Preparation

Prepare for your Data Science Interview with this full guide on a career in Data Science including practice questions!

Highest Rated

Created byJose Portilla, Pierian Training

Last updated 9/2019

English

What you'll learn

Create a great data science resume!
Understand various positions and titles available in the data science ecosystem.
Get practice with probability and statistics interview questions.
Build an understanding of good experiment design.
Get practice with SQL interview questions.

Course content

12 sections • 110 lectures • 4h 0m total length

Course Overview Lecture9:18
Curriculum Overview4:00
Frequently Asked Questions0:09

Why a choose a career in Data Science?2:40
Discover why data science offers fulfilling, high-demand work across industries by applying math and coding to extract insights, with strong pay and growing job opportunities.
Data Science is Interdisciplinary2:34
Data Science Position and Titles4:03
Explore the spectrum of data science roles beyond data scientist, including product analyst, business intelligence engineer, machine learning engineer, and data engineer, and how job postings define required skills.
Thoughts on Higher Education5:00

Introduction to Interview Preparation0:45
Technical Tools of the Trade6:31
Delve into programming languages, frameworks, and software tools in data science, and learn to pick one language (often Python) plus SQL and key libraries like pandas and scikit-learn.
Theory Knowledge1:52
Machine Learning Knowledge2:13
Explore core machine learning concepts for data science roles, including supervised and unsupervised learning, key algorithms, model validation, regularization, and basics of natural language processing, with practical resources.
Software Knowledge2:54
How do I know when I'm ready?4:55

Introduction to Probability Interview Questions1:05
Explore common probability theory interview questions and practice with coins and dice to prepare for data science technical screenings.
Probability Question 10:43
Explore the expected number of flips required for a fair coin to yield two consecutive identical results, whether two heads or two tails, in this probability theory interview question.
Solution for Probability Question 14:50
Probability Question 20:26
Explore the probability of obtaining a total of four when rolling two dice by adding the outcomes and calculating the likelihood.
Solution for Probability Question 21:00
Compute the probability of rolling a sum of four with two dice by counting three favorable outcomes—(1,3), (2,2), (3,1)—out of 36 total outcomes, giving 1/12.
Probability Question 30:35
Solution for Probability Question 30:53
Probability Question 41:49
Solution for Probability Question 42:15
Probability Question 50:47
Learn to compute the probability of seeing a car in ten minutes when the chance in thirty minutes is 0.95, assuming a constant probability.
Solution for Probability Question 52:32
Assume a constant default probability to solve the interview question. Compute P from 1−P^3=0.95, then the ten-minute car probability is 1−P.
Probability Question 60:39
Solution for Probability Question 62:22
Note about Probability Interview Question 70:15
Probability Question 70:58
Solution for Probability Question 75:06
Probability Question 80:39
Solution for Probability Question 82:26
Probability Question 90:35
Solution for Probability Question 91:15

Introduction to Statistics Interview Questions0:44
Statistics Interview Question 11:27
Assess the probability that it is raining in Seattle based on three independent friends who may lie or tell the truth, starting from a 25 percent prior.
Solution for Statistics Interview Question 14:31
Use a probability tree to determine the likelihood it is raining in Seattle given three independent reports. Leverage truth and lie probabilities to compute the conditional rain chance.
Statistics Interview Question 20:52
Solution for Statistics Interview Question 23:20
Statistics Interview Question 30:36
Explore the difference between a type one error and a type two error in statistics, and practice explaining it clearly by whiteboarding your answer.
Solution for Statistics Interview Question 34:11
Explore type one and type two errors in hypothesis testing—false positives and false negatives—when the null hypothesis is true but rejected, or false yet not rejected, with toothpaste examples.
Statistics Interview Question 41:20
Apply Bayes' theorem to a medical test: with 1% base rate, 99% sensitivity, and 99% specificity, a positive result indicates a 50% chance of infection.
Solution for Statistics Interview Question 43:03
Statistics Interview Question 51:37
Explore how to determine a motor guarantee using a normal distribution with mean 10 years and standard deviation 2 years, targeting the 3% failure tail via a z-table.
Solution for Statistics Interview Question 56:00

Introduction to Product Design and Metrics0:45
Learn to tackle open-ended product design and metrics questions in data science interviews by thinking aloud, framing answers as a collaborative conversation, and analyzing features tied to company metrics.
Product Design and Metrics - Interview Question 10:50
Product Design and Metrics - Interview Question 1 - Solution4:24
Product Design and Metrics - Interview Question 21:36
Product Design and Metrics - Interview Question 2 - Solution2:31
Product Design and Metrics - Interview Question 30:47
Explore how A/B testing with a new search algorithm can yield higher advertising revenue despite less relevant results, and examine potential causes within product design and metrics.
Product Design and Metrics - Interview Question 3 - Solution2:08
Explain why a new search algorithm can raise advertising revenue despite less relevant results by increasing searches and potentially more relevant ads served by a separate ads algorithm.
Product Design and Metrics - Interview Question 40:54
Product Design and Metrics - Interview Question 4 - Solution1:25
Product Design and Metrics - Interview Question 52:30
Evaluate product design and metrics by comparing mpg upgrades for Technology A on car X and Technology B on car Y, with a 50/50 country split, to maximize gasoline savings.
Product Design and Metrics - Interview Question 5 - Solution3:27
Analyze two fuel-efficiency policies in a product design and metrics interview, showing policy B saves more gasoline countrywide by comparing mpg improvements with an average distance D.

Introduction to SQL Questions1:20
Data with SQL - Interview Question 10:26
Identify what's wrong with a sample sql query, pause to review. Prepare to learn the solution in the next lecture.
Data with SQL - Interview Question 1 -Solution1:11
Data with SQL - Interview Question 20:26
Identify the error in the SQL query select ID trial date from payments group by ID, as presented in the data with SQL interview question that highlights grouping and selection issues.
Data with SQL - Interview Question 2 - Solution1:03
Learn how to fix a SQL group by error by applying an aggregate to non-grouped columns like trial date. Group by date when dealing with time-stamped values to clarify results.
Data with SQL - Interview Question 30:32
Analyze a flawed sql query that selects user ID and avg(total) as average order total from invoices, using a having clause with count on order ID >= 1.
Data with SQL - Interview Question 3 - Solution0:50
Data with SQL - Interview Question 41:15
Write an SQL query to join the employees and managers tables on the managed_by foreign key. Retrieve all employees who are managed by Sandy Kim.
Data with SQL - Interview Question 4 - Solution1:56
Master SQL joins to solve an interview question: find employees managed by Sandy Kim. Build a join on employees.managed_by and managers.id, then filter where managers.name like 'Sandy Kim'.
Data with SQL - Interview Question 50:35
Write and practice a query that retrieves all employees with no manager using the employees and managers tables, illustrated by Jane Doe's null manager.
Data with SQL - Interview Question 5 - Solution1:11
Retrieve all employees who have no manager using sql query that checks for null in the managed by column without join, noting variations across mysql, postgresql, oracle, and sql server.

Introduction to Machine Learning Interview Questions1:04
Machine Learning Interview Question 10:30
Machine Learning Interview Question 1 - Solution1:51
Explains linear regression and its core assumptions—linearity between y and x and normally distributed residuals—and reviews common types like ordinary least squares and ridge and lasso.
Machine Learning Interview Question 20:21
Describe the logistic regression formula and how it enables binary classification within the machine learning interview questions, as part of the data science career guide.
Machine Learning Interview Question 2 - Solution2:33
Describe the logistic regression formula and how to use the logistic function to turn linear regression outputs into probabilities for binary classification, with a 0.5 cutoff.
Machine Learning Interview Question 30:21
Machine Learning Interview Question 3 - Solution2:41
Explore how decision trees choose splits by maximizing information gain using entropy in a top-down approach from the root node, with alternatives like the Gini index.
Machine Learning Interview Question 40:19
Machine Learning Interview Question 4 - Solution1:23
Machine Learning Interview Question 50:20
Explain the difference between random forest and boosting tree algorithms such as gradient boosting, highlighting their approaches and use cases for machine learning interview preparation.
Machine Learning Interview Question 5 - Solution1:37
Machine Learning Interview Question 60:39
Machine Learning Interview Question 6 - Solution0:49
Machine Learning Interview Question 70:24
Describe how the support vector machine works in a general sense and illustrate the concept with a diagram to explain decision boundaries.
Machine Learning Interview Question 7 - Solution2:07
Explore how support vector machines find a hyperplane that maximizes the margin between classes, using support vectors to define the decision boundary, and how the kernel trick enables nonlinear classification.
Machine Learning Interview Question 80:26
Define overfitting in machine learning, discuss its causes, and outline ways to avoid it in practice.
Machine Learning Interview Question 8 - Solution2:18
Machine Learning Interview Question 90:24
Describe the differences between accuracy, precision, and recall in classification tasks. Understand these common metrics and how their definitions relate to model performance.
Machine Learning Interview Question 9 - Solution2:19
Machine Learning Interview Question 100:17
Machine Learning Interview Question 10 - Solution2:10
Learn to evaluate regression with mean absolute error, mean squared error, and root mean squared error. MAE averages absolute errors, MSE squares errors, RMSE preserves units.

Introduction to Design of Experiments0:35
Explore the design of experiments and its statistical foundations, including P value, statistical test, and hypothesis test. Review concepts before attempting the questions and consult the guidebook for resources.
Design of Experiments Interview Question 10:36
Design of Experiments Interview Question 1 - Solution5:12
Master design of experiments with a/b testing by selecting metrics like daily active users, setting up control and variant pages, randomizing samples, and evaluating hypotheses using alpha and p-values.
Design of Experiments Interview Question 20:26
Design of Experiments Interview Question 2 - Solution3:54
Design of Experiments Interview Question 30:41
Design of Experiments Interview Question 3 - Solution3:10
Design of Experiments Interview Question 40:23
Design of Experiments Interview Question 4 - Solution2:32

Requirements

An understanding of Probability and Statistics
Programming Experience in either Python or R
Experience in SQL
An understanding of Machine Learning Algorithms

Description

According to Glassdoor, a career as a Data Scientist is the best job in America! With an average base salary of over $120,000, not only do Data Scientists earn fantastic compensation, but they also get to work on some of the world's most interesting problems! Data Scientist positions are also rated as having some of the best work-life balances by Glassdoor. Companies are in dire need of filling out this unique role, and you can use this course to help you rock your Data Scientist Interview!

This course is designed to be the ultimate resource for getting a career as a Data Scientist. We'll start off with an general overview of the field and discuss multiple career paths, including Product Analyst, Data Engineering, Data Scientist, and many more. You'll understand the various opportunities available and the best way to pursue each of them. The course touches upon a wide variety of topics, including questions on probability, statistics, machine learning, product metrics, example data sets, A/B testing, market analysis, and much more!

The course will be full of real questions sourced from employees working at some of the world's top technology companies, including Amazon, Square, Facebook, Google, Microsoft, AirBnb and more!

The course contains real questions with fully detailed explanations and solutions. Not only is the course designed for candidates to achieve a full understanding of possible interview questions, but also for recruiters to learn about what to look for in each question response. For questions requiring coded solutions, fully commented code examples will be shown for both Python and R. This way you can focus on understanding the code in a programming language you're already familiar with, instead of worrying about syntax!

Who this course is for:

Anyone who wants to prepare for a Data Science Interview
Anyone interested in a career in Data Science

Data Science Career Guide - Interview Preparation

What you'll learn

Explore related topics

Course content

Course Overview3 lectures • 13min

Data Science Career Overview4 lectures • 14min

Data Science Interview Preparation6 lectures • 19min

The Data Science Interview Process4 lectures • 31min

Probability Theory Interview Questions20 lectures • 31min

Statistics Interview Questions11 lectures • 28min

Product Analysis and Business Metrics Interview Questions11 lectures • 21min

Working with Data Interview Questions11 lectures • 11min

Machine Learning Interview Questions21 lectures • 25min

Design of Experiments Interview Questions9 lectures • 17min

Requirements

Description

Who this course is for: