Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Baseball Database Queries with SQL and dplyr
Rating: 4.7 out of 5(690 ratings)
19,718 students

Baseball Database Queries with SQL and dplyr

Explaining the relationship between SQL and dplyr with baseball examples.
Created byCharles Redmond
Last updated 6/2015
English

What you'll learn

  • download the Lahman baseball database for Access
  • install dplyr and the Lahman baseball database package in R
  • select, filter, and order with SQL and dplyr
  • group and aggregate with SQL and dplyr
  • perform inner joins with SQL and dplyr

Course content

5 sections30 lectures3h 2m total length
  • Introduction1:05

    This is our course intro.

  • Access Set-up5:55

    After viewing this lecture, you will know how to download the Access version of the Lahman Baseball Database and how to set it up for querying.

  • RStudio Set-up7:10

    After viewing this lecture, you will be able to download R and RStudio and install R packages. You will also gain a basic familiarity with the RStudio interface.

  • The Rcpp Package1:00

    After viewing this lecture, you will be able to install the Rcpp package.

Requirements

  • Students will need to have R and RStudio installed on their own computers. (I will cover how to do this.)
  • It will be best if students also have Microsoft Access, but this is not absolutely necessary.

Description

In this course, we explain the relationship between SQL and the R package dplyr. I will show you how to query a baseball database with SQL in Microsoft Access and then show you how to do exactly the same thing with dplyr in R. We will begin with simple queries, progress to aggregation and grouping, and finish with queries involving joins. By the end of the course, you should be able to use dplyr to explore your own data sets.

At a relaxed pace, it should take about three weeks to complete the course. The course is for beginners in SQL, R, and dplyr. You also do not have to understand very much about baseball. We will be using the Lahman Baseball Database, R, dplyr, and Microsoft Access. I will show you how to install everything.

Who this course is for:

  • This course is for beginners who would like to learn about SQL and/or dplyr.
  • This course is for beginners interested in baseball analytics.
  • This course is NOT for those with extensive knowledge of both SQL and dplyr.