Introduction To Data Science
4.6 (103 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
2,585 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Introduction To Data Science to your Wishlist.

Add to Wishlist

Introduction To Data Science

Use the R Programming Language to execute data science projects and become a data scientist.
4.6 (103 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
2,585 students enrolled
Last updated 3/2015
English
Price: $50
30-Day Money-Back Guarantee
Includes:
  • 6 hours on-demand video
  • 1 Supplemental Resource
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
Have a coupon?
What Will I Learn?
Start and execute the steps of a data science project, from project definition to model evaluation.
Use machine learning techniques to build effective predictive models.
Learn how to find and correct common problems found in real world data.
View Curriculum
Requirements
  • You should be familiar with basic scripting or programming, and basic statistics.
  • Familiarity with R is a plus. Familiarity with RStudio is a plus. We will teach you how to start with R and RStudio, but you want to install them on your computer prior to starting this course.
Description

Use the R Programming Language to execute data science projects and become a data scientist. Implement business solutions, using machine learning and predictive analytics.

The R language provides a way to tackle day-to-day data science tasks, and this course will teach you how to apply the R programming language and useful statistical techniques to everyday business situations.

With this course, you'll be able to use the visualizations, statistical models, and data manipulation tools that modern data scientists rely upon daily to recognize trends and suggest courses of action.

Understand Data Science to Be a More Effective Data Analyst

●Use R and RStudio

●Master Modeling and Machine Learning

●Load, Visualize, and Interpret Data

Use R to Analyze Data and Come Up with Valuable Business Solutions

This course is designed for those who are analytically minded and are familiar with basic statistics and programming or scripting. Some familiarity with R is strongly recommended; otherwise, you can learn R as you go.

You'll learn applied predictive modeling methods, as well as how to explore and visualize data, how to use and understand common machine learning algorithms in R, and how to relate machine learning methods to business problems.

All of these skills will combine to give you the ability to explore data, ask the right questions, execute predictive models, and communicate your informed recommendations and solutions to company leaders.

Contents and Overview

This course begins with a walk-through of a template data science project before diving into the R statistical programming language.

You will be guided through modeling and machine learning. You'll use machine learning methods to create algorithms for a business, and you'll validate and evaluate models.

You'll learn how to load data into R and learn how to interpret and visualize the data while dealing with variables and missing values. You’ll be taught how to come to sound conclusions about your data, despite some real-world challenges.

By the end of this course, you'll be a better data analyst because you'll have an understanding of applied predictive modeling methods, and you'll know how to use existing machine learning methods in R. This will allow you to work with team members in a data science project, find problems, and come up solutions.

You’ll complete this course with the confidence to correctly analyze data from a variety of sources, while sharing conclusions that will make a business more competitive and successful.

The course will teach students how to use existing machine learning methods in R, but will not teach them how to implement these algorithms from scratch. Students should be familiar with basic statistics and basic scripting/programming.

Who is the target audience?
  • The course is for analytically minded students who are looking for an introduction to applied predictive modeling methods, and who want to learn about what goes into successful data science projects. The course will teach students how to use existing machine learning methods in R, but will not teach them how to implement these algorithms from scratch. Students should be familiar with basic statistics and basic scripting/programming. Some familiarity with R is helpful; otherwise, students should be willing to learn R as they go. We will direct you to ready-to-go implementations and additional references throughout the course.
Students Who Viewed This Course Also Viewed
Curriculum For This Course
Expand All 28 Lectures Collapse All 28 Lectures 05:52:49
+
Course Overview
3 Lectures 32:10

The course introduction describes what to expect from Introduction to Data Science and help you decide if the course is for you. The examples (available here: http://winvector.github.io/IntroductionToDataScience/ ) are mostly worked using R and RStudio which is freely available software from http://cran.r-project.org and http://www.rstudio.com . We do require some familiarity with "R" and statistics (though a later lesson will discuss starting with R and RStudio).

Preview 04:20


Starting with R and data
16:12
+
Modeling and Machine Learning
16 Lectures 03:19:11
Mapping Business to Machine Learning Tasks
06:22

Validating Models
17:01

Your feedback is valuable, both for us developing courses and for helping other students pick courses.

Before we move on the machine learning parts of data science we ask that you consider adding a fair review of the course. To do this you use the "back to course" link (should be on the top left when viewing the course) and click the "write review link" (should be in the top right corner).

This is just an ask. If you would prefer not to review the course until the end (or at all), we understand.

Your Feedback is Valuable
1 page

Naive Bayes: background
13:12

Naive Bayes: practice
14:14

Linear Regression: background
19:20

Linear Regression: practice
18:37

Logistic Regression: background
07:17

Logistic Regression: practice
16:52

Decision Trees and Random Forest: background
06:55

Random Forest: practice
10:23

Generalized Additive Models
07:55

Support Vector Machines
19:39


Regularization for Linear and Logistic Regression
08:32

Evaluating Models
19:16
+
Data
6 Lectures 01:40:40
Loading Data in R
19:28

Visualizing Data
15:17

Missing Values
12:09

The Shape of Data
19:37

Dealing with Categorical Variables
19:56

Useful Data Transformations
14:13
+
Moving On
3 Lectures 19:48
Recommended Books
05:22

Further Topics
12:38

Next Steps
01:48
About the Instructor
Nina Zumel
4.6 Average rating
103 Reviews
2,585 Students
1 Course
Data Scientist, Win-Vector LLC

Nina Zumel, PhD, has over 10 years of experience in research, machine learning, and data science. She is a co-author of the popular book Practical Data Science with R, co-author of the EMC data scientist certification program, and blogs often on statistics, data science, and data visualization.

John Mount
4.2 Average rating
144 Reviews
6,511 Students
2 Courses
Data Scientist, Win-Vector LLC

I am principal at with the data science consulting firm Win-Vector LLC. Win-Vector LLC specializes in data science research, implementation, and training. I have over 10 years of experience in research, teaching, machine learning, and data science.

I am co-author of the popular book Practical Data Science with R, and I blog often on mathematics, programming, machine learning, and optimization on the Win-Vector blog.

My profesional experience includes managing a data science group for Shopping dot com (an eBay company), working in price optimization for Rapt (acquired by Microsoft), and apply machine learning at a web-scale for Kosmix (acquired by Walmart online). My original fields of study were mathematics (AB UC Berkeley) and computer science (Ph.D. Carnegie Mellon) with a heavy emphasis on probability theory.