Introduction To Data Science

Use the R Programming Language to execute data science projects and become a data scientist.
4.3 (94 ratings)
Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
2,561 students enrolled
$50
Take This Course
  • Lectures 28
  • Length 6 hours
  • Skill Level Intermediate Level
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works

Discover

Find online courses made by experts from around the world.

Learn

Take your courses with you and learn anywhere, anytime.

Master

Learn and practice real-world skills and achieve your goals.

About This Course

Published 2/2015 English

Course Description

Use the R Programming Language to execute data science projects and become a data scientist. Implement business solutions, using machine learning and predictive analytics.

The R language provides a way to tackle day-to-day data science tasks, and this course will teach you how to apply the R programming language and useful statistical techniques to everyday business situations.

With this course, you'll be able to use the visualizations, statistical models, and data manipulation tools that modern data scientists rely upon daily to recognize trends and suggest courses of action.

Understand Data Science to Be a More Effective Data Analyst

●Use R and RStudio

●Master Modeling and Machine Learning

●Load, Visualize, and Interpret Data

Use R to Analyze Data and Come Up with Valuable Business Solutions

This course is designed for those who are analytically minded and are familiar with basic statistics and programming or scripting. Some familiarity with R is strongly recommended; otherwise, you can learn R as you go.

You'll learn applied predictive modeling methods, as well as how to explore and visualize data, how to use and understand common machine learning algorithms in R, and how to relate machine learning methods to business problems.

All of these skills will combine to give you the ability to explore data, ask the right questions, execute predictive models, and communicate your informed recommendations and solutions to company leaders.

Contents and Overview

This course begins with a walk-through of a template data science project before diving into the R statistical programming language.

You will be guided through modeling and machine learning. You'll use machine learning methods to create algorithms for a business, and you'll validate and evaluate models.

You'll learn how to load data into R and learn how to interpret and visualize the data while dealing with variables and missing values. You’ll be taught how to come to sound conclusions about your data, despite some real-world challenges.

By the end of this course, you'll be a better data analyst because you'll have an understanding of applied predictive modeling methods, and you'll know how to use existing machine learning methods in R. This will allow you to work with team members in a data science project, find problems, and come up solutions.

You’ll complete this course with the confidence to correctly analyze data from a variety of sources, while sharing conclusions that will make a business more competitive and successful.

The course will teach students how to use existing machine learning methods in R, but will not teach them how to implement these algorithms from scratch. Students should be familiar with basic statistics and basic scripting/programming.

What are the requirements?

  • You should be familiar with basic scripting or programming, and basic statistics.
  • Familiarity with R is a plus. Familiarity with RStudio is a plus. We will teach you how to start with R and RStudio, but you want to install them on your computer prior to starting this course.

What am I going to get from this course?

  • Start and execute the steps of a data science project, from project definition to model evaluation.
  • Use machine learning techniques to build effective predictive models.
  • Learn how to find and correct common problems found in real world data.

What is the target audience?

  • The course is for analytically minded students who are looking for an introduction to applied predictive modeling methods, and who want to learn about what goes into successful data science projects. The course will teach students how to use existing machine learning methods in R, but will not teach them how to implement these algorithms from scratch. Students should be familiar with basic statistics and basic scripting/programming. Some familiarity with R is helpful; otherwise, students should be willing to learn R as they go. We will direct you to ready-to-go implementations and additional references throughout the course.

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.

Curriculum

Section 1: Course Overview
04:20

The course introduction describes what to expect from Introduction to Data Science and help you decide if the course is for you. The examples (available here: http://winvector.github.io/IntroductionToDataScience/ ) are mostly worked using R and RStudio which is freely available software from http://cran.r-project.org and http://www.rstudio.com . We do require some familiarity with "R" and statistics (though a later lesson will discuss starting with R and RStudio).

Walk-through of a data science project
Preview
11:38
Starting with R and data
16:12
Section 2: Modeling and Machine Learning
Mapping Business to Machine Learning Tasks
06:22
Validating Models
17:01
1 page

Your feedback is valuable, both for us developing courses and for helping other students pick courses.

Before we move on the machine learning parts of data science we ask that you consider adding a fair review of the course. To do this you use the "back to course" link (should be on the top left when viewing the course) and click the "write review link" (should be in the top right corner).

This is just an ask. If you would prefer not to review the course until the end (or at all), we understand.

Naive Bayes: background
13:12
Naive Bayes: practice
14:14
Linear Regression: background
19:20
Linear Regression: practice
18:37
Logistic Regression: background
07:17
Logistic Regression: practice
16:52
Decision Trees and Random Forest: background
06:55
Random Forest: practice
10:23
Generalized Additive Models
07:55
Support Vector Machines
19:39
Gradient Boosting
Preview
13:36
Regularization for Linear and Logistic Regression
08:32
Evaluating Models
19:16
Section 3: Data
Loading Data in R
19:28
Visualizing Data
15:17
Missing Values
12:09
The Shape of Data
19:37
Dealing with Categorical Variables
19:56
Useful Data Transformations
14:13
Section 4: Moving On
Recommended Books
05:22
Further Topics
12:38
Next Steps
01:48

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

Nina Zumel, Data Scientist, Win-Vector LLC

Nina Zumel, PhD, has over 10 years of experience in research, machine learning, and data science. She is a co-author of the popular book Practical Data Science with R, co-author of the EMC data scientist certification program, and blogs often on statistics, data science, and data visualization.

Instructor Biography

John Mount, Data Scientist, Win-Vector LLC

I am principal at with the data science consulting firm Win-Vector LLC. Win-Vector LLC specializes in data science research, implementation, and training. I have over 10 years of experience in research, teaching, machine learning, and data science.

I am co-author of the popular book Practical Data Science with R, and I blog often on mathematics, programming, machine learning, and optimization on the Win-Vector blog.

My profesional experience includes managing a data science group for Shopping dot com (an eBay company), working in price optimization for Rapt (acquired by Microsoft), and apply machine learning at a web-scale for Kosmix (acquired by Walmart online). My original fields of study were mathematics (AB UC Berkeley) and computer science (Ph.D. Carnegie Mellon) with a heavy emphasis on probability theory.

Ready to start learning?
Take This Course