Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Understanding New Data - Exploratory Analysis in R
Rating: 4.4 out of 5(20 ratings)
209 students

Understanding New Data - Exploratory Analysis in R

Learn how to use R to quickly understand and analyze new data and start your data analysis projects with ease
Last updated 3/2021
English

What you'll learn

  • Identify suitable R libraries for data exploration
  • Create suitable data visualizations
  • Learn the succession of steps in data exploration
  • Use a combination of hypothesis tests, explorations and models
  • How to prepare data for exploration
  • What to do when problems arise in the initial stages
  • Work with the main variable types
  • Use time series data

Course content

3 sections63 lectures6h 52m total length
  • The Landscape: Data Science and Data Analysis10:30

    Explore how data analysis sits inside the data science landscape—from research questions and data collection to data cleaning, validation, exploratory analysis, modeling, and communicating findings.

  • Data Analysis Stages: IDA, EDA and CDA8:05

    Explore the data analysis pipeline from initial data analysis to exploratory data analysis and confirmatory data analysis, focusing on data quality checks, preprocessing, visualization, and hypothesis testing.

  • Why Do We Work with Statistical Samples? - Population vs. Sample8:36

    Learn why researchers use samples instead of testing whole populations, and how random, representative samples estimate population parameters with sample statistics.

  • The Normal Probability Distribution8:36
  • The Tidyverse5:59

    Explore the tidyverse ecosystem, a suite of data cleaning and preprocessing packages with pipe-based workflows. Master core tools and the reader and table concepts for efficient data preparation.

  • Datasets and R Libraries3:51

    Install and update R and RStudio, run the course scripts to install required packages (base and tidyverse), and explore datasets car parts, diamonds, and flights used as training data.

  • Summary1:39

    Contrast data science with data analysis and the initial and exploratory stages. Grasp population versus sample and the normal distribution while setting up R and Studio.

Requirements

  • Basic R programming skills
  • A general understanding of statistics and data visualization
  • R and RStudio ready on your computer

Description

  • Are you new to R and data analysis?

  • Do you ever struggle starting an analysis with a new dataset?

  • Do you have problems getting the data into shape and selecting the right tools to work with?

  • Have you ever wondered if a dataset had the information you were interested in and if it was worth the effort?

If some of these questions occurred to you, then this program might be a good start to set you up on your data analysis journey. Actually, these were the question I had in mind when I designed the curriculum of this course. As you can see below, the curriculum is divided into three main sections. Although this course doesn't have a focus on the basic concepts of statistics, some of the most important concepts are covered in the first section of the course.

The two other sections have their focus on the initial and the exploratory data analysis phases respectively. Initial data analysis (or IDA for short) is where we clean and shape the data into a form suitable for the planned methods. This is also where we make sure the data makes sense from a statistical point of view. In the IDA section I present tools and methods that will help you figure out if the data was collected properly and if it is worthy of being analyzed.

On the other hand, the exploratory data analysis (EDA) section offers techniques to find out if the data can answer your analytical questions, or in other words, if the data has a relevant story to tell. This will spare you from investing time and effort into a project that will not deliver the results you hoped for. In an ideal case the results of EDA may confirm that the planned analysis is worth it and that there are insights to be gained from that dataset and project.

If you are interested in statistical methods and R tools that help you bridge the gap between data collection and the confirmatory data analysis (CDA), then this program is for you. Take a look at the curriculum and give this course a try!

Who this course is for:

  • Data scientists
  • Analysts of all fields
  • Researchers working and analyzing data
  • Young professionals wanting to switch to data analysis related work
  • Students taking data analysis exams
  • Everyone interested in analyzing data
  • Data exploration is an initial phase of a data analysis project therefore you will need these skills in most of your projects