Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Essential Fundamentals of R
Rating: 4.4 out of 5(189 ratings)
1,835 students

Essential Fundamentals of R

Data Types and Structures in R , Inputting & Outputting Data, Writing User-Defined Functions, and Manipulating Data Sets
Last updated 7/2020
English

What you'll learn

  • Install R and RStudio and engage in a basic R session
  • Understand the characteristics of different data types and structures in R
  • Be able to read in data and write out data files from various sources
  • Sort, select, filter, subset, and manipulate tables of data in R
  • Create and execute their own user-defined functions in an R session
  • Understand how to use the apply() family of functions to execute various actions against different R data structures
  • Know how to use reshaping and recoding "short cuts" for changing data types and for rearranging data structures.

Course content

7 sections46 lectures10h 32m total length
  • Introduction to R Software14:56

    R is a programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. Polls, surveys of data miners, and studies of scholarly literature databases show that R's popularity has increased substantially in recent years.

    R is an implementation of the S programming language combined with lexical scoping semantics inspired by Scheme. S was created by John Chambers while at Bell Labs. There are some important differences, but much of the code written for S runs unaltered.

    R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand, and is currently developed by the R Development Core Team, of which Chambers is a member. R is named partly after the first names of the first two R authors and partly as a play on the name of S.

    R is a GNU project. The source code for the R software environment is written primarily in C, Fortran, and R. R is freely available under the GNU General Public License, and pre-compiled binary versions are provided for various operating systems. R uses a command line interface; there are also several graphical front-ends for it.

  • What is R?14:24

    R is a programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. Polls, surveys of data miners, and studies of scholarly literature databases show that R's popularity has increased substantially in recent years.

    R is an implementation of the S programming language combined with lexical scoping semantics inspired by Scheme. S was created by John Chambers while at Bell Labs. There are some important differences, but much of the code written for S runs unaltered.

    R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand, and is currently developed by the R Development Core Team, of which Chambers is a member. R is named partly after the first names of the first two R authors and partly as a play on the name of S.

    R is a GNU project. The source code for the R software environment is written primarily in C, Fortran, and R. R is freely available under the GNU General Public License, and pre-compiled binary versions are provided for various operating systems. R uses a command line interface; there are also several graphical front-ends for it.

  • Workspace Management Controls15:37

    The R Environment consists of all the files necessary for running the R Program as well as data sets and other objects that you have created or loaded into your Workspace. These files can be broken down into three basic types:

    1. The base packages that run all the standard analyses that we use in this course. These files are installed automatically when you first download and install the R program.

    2. Additional packages you can install on your own and which allow for more advanced statistical analysis or additional commands.

    3. The data sets that you download and other objects (data sets and variables) that you create.

  • Workspace Management R Manuals12:58
  • Hands-On Tutorial of R Basics (part 1)14:35
  • Hands-On Tutorial of R Basics (part 2)14:53
  • Tutorial with R Functions13:54
  • Distributional Functions and Plotting19:25

    R Functions for Probability Distributions

    Every distribution that R handles has four functions. There is a root name, for example, the root name for the normal distribution is norm. This root is prefixed by one of the letters

    • p for "probability", the cumulative distribution function (c. d. f.)
    • q for "quantile", the inverse c. d. f.
    • d for "density", the density function (p. f. or p. d. f.)
    • r for "random", a random variable having the specified distribution
    For the normal distribution, these functions are pnorm, qnorm, dnorm, and rnorm. For the binomial distribution, these functions are pbinom, qbinom, dbinom, and rbinom. And so forth.

    For a continuous distribution (like the normal), the most useful functions for doing problems involving probability calculations are the "p" and "q" functions (c. d. f. and inverse c. d. f.), because the the density (p. d. f.) calculated by the "d" function can only be used to calculate probabilities via integrals and R doesn't do integrals.

    For a discrete distribution (like the binomial), the "d" function calculates the density (p. f.), which in this case is a probability

    f(x) = P(X = x)
    and hence is useful in calculating probabilities.

Requirements

  • Students will need to install both R software and RStudio (instructions are provided)

Description

Essential Fundamentals of R is an integrated program that draws from a variety of introductory topics and courses to provide participants with a solid base of knowledge with which to use R software for any intended purpose. No statistical knowledge, programming knowledge, or experience with R software is necessary. Essential Fundamentals of R (7 sessions) covers those important introductory topics basic to using R functions and data objects for any purpose: installing R and RStudio; interactive versus batch use of R; reading data and datasets into R; essentials of scripting; getting help in R; primitive data types; important data structures; using functions in R; writing user-defined functions; the 'apply' family of functions in R; data set manipulation: and subsetting, and row and column selection. Most sessions present "hands-on" material that demonstrate the execution of R 'scripts' (sets of commands) and utilize many extended examples of R functions, applications, and packages for a variety of common purposes. RStudio, a popular, open source Integrated Development Environment (IDE) for developing and using R applications, is also utilized in the program, supplemented with R-based direct scripts (e.g. 'command-line prompts') when necessary.

Who this course is for:

  • Anyone who is interested in learning to use R software who is relatively new (or 'brand new') to using R
  • People who wish to learn the essential fundamentals of using R including data types and structures, inputting and outputting data and files, writing user-defined functions, and manipulating data sets
  • College undergrads and/or graduate students who are looking for an alternative to using SAS or SPSS software
  • Professionals engaged in quantitative analyses and/or data analyses tasks who seek an alternative to using SAS and/or SPSS software.