Learning Data Analysis with R
4.0 (2 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
46 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Learning Data Analysis with R to your Wishlist.

Add to Wishlist

Learning Data Analysis with R

Find, process, analyze, manipulate, and crunch data in R
4.0 (2 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
46 students enrolled
Created by Packt Publishing
Last updated 9/2017
English
Current price: $10 Original price: $125 Discount: 92% off
5 hours left at this price!
30-Day Money-Back Guarantee
Includes:
  • 6 hours on-demand video
  • 1 Supplemental Resource
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Import and export data in various formats in R
  • Perform advanced statistical data analysis
  • Visualize your data on Google or Open Street maps
  • Enhance your data analysis skills and learn to handle even the most complex datasets
  • Learn how to handle vector and raster data in R
View Curriculum
Requirements
  • Some basic knowledge of R and programming is assumed, along with a background in mathematics.
Description

R is a programming language and software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis.

This video delivers viewers the ability to conduct data analysis in practical contexts with R, using core language packages and tools. The end goal is to provide analysts and data scientists a comprehensive learning course on how to manipulate and analyse small and large sets of data with R. It will introduce how CRAN works and will demonstrate why viewers should use them.

You will start with the most basic importing techniques, to downloading compressed data from the web and learn of more advanced ways to handle even the most difficult datasets to import. Next, you will move on to create static plots, while the second will show how to plot spatial data on interactive web platforms such as Google Maps and Open Street maps. Finally, you will learn to implement your learning with real-world examples of data analysis.

This video will lay the foundations for deeper applications of data analysis, and pave the way for advanced data science.

About The Author

Fabio Veronesi obtained a Ph.D. in digital soil mapping from Cranfield University and then moved to ETH Zurich, where he has been working for the past three years as a postdoc. In his career, Dr. Veronesi worked at several topics related to environmental research: digital soil mapping, cartography and shaded relief, renewable energy and transmission line siting. During this time Dr. Veronesi specialized in the application of spatial statistical techniques to environmental data.

Who is the target audience?
  • If you are a statistician, analyst, or a budding data scientist and want to learn how to analyze data with R, then this is the course for you.
Compare to Other R Courses
Curriculum For This Course
75 Lectures
05:59:07
+
Importing Data in Table Format
6 Lectures 21:09
This video provides an overview of the entire course.


Preview 04:16

Accessing and importing open access environmental data is a crucial skill for data scientists. This section teaches you how to download data from the Web, import it in R and check it for consistency.

Importing Data from Tables (read.table)
02:30

Often times, datasets are provided for free, but on FTP, websites and practitioners need to be able to access them. R is perfectly capable of downloading and importing data from FTP sites.

Downloading Open Data from FTP Sites
04:03

Not all text files can be opened easily with read.table. The fixed-width format is still popular but requires a bit more work in R.

Fixed-Width Format
04:24

Some data files are simply too difficult to be imported with simple functions. Luckily R provides the readLines function that allows importing of even the most difficult tables.

Importing with read.lines (The Last Resort)
03:20

Most open data is generated automatically and therefore may contain NA or other values that need to be removed. R has various functions to deal with this problem.

Cleaning Your Data
02:36
+
Handling the Temporal Component
5 Lectures 17:57

To follow the exercises in the book viewers would need to install several important packages. This video will explain how to do and where to find information about them.

Preview 04:09

Vector data are very popular and widespread and require some thoughts before importing. R has dedicated tools to import these data and work with them.

Importing Vector Data (ESRI shp and GeoJSON)
04:02

Often times, spatial data is provided in tables and needs to be transformed before it can be used for analysis. This can be done simply with the sp package.

Transforming from data.frame to SpatialPointsDataFrame
02:50

Geographical projections are very important and need to be handled carefully. R provides robust functions to do so successfully
Understanding Projections
03:06

Many datasets have a temporal component and practitioners need to know how to deal with it. R provides functions to do that in a very easy way.

Basic time/dates formats
03:50
+
Importing Raster Data
4 Lectures 18:10

Raster data is fundamentally different from vector data, since its values refer to specific areas (cells) and no single locations. This video will clearly explain this difference and teach users how to import this data in R.

Preview 04:58

The NetCDF format is becoming very popular, since it allows to store 4D datasets. This requires some technical skills to be accessed and this video will teach viewers to open and import NetCDF files.

Reading Raster Data in NetCDF
06:10

Many raster datasets we download from the web are distributed in tiles, meaning a single raster for each subset of the area. To obtain a full raster for the study area we are interested to cover we can create a mosaic.

Mosaicking
02:52

Mosaicking involves merging rasters based on location. Spatio-temporal datasets include also multiple rasters for the same location but different times. To merge these we need to use the stacking function.

Stacking to Include the Temporal Component
04:10
+
Exporting Data
4 Lectures 10:55

Once we complete our analysis we often need to export our results and share them with colleagues. Popular formats are CSV and TXT files, which we learn how to export in this video.

Preview 03:12

If we work with vector data and we want to share the same format with our co-workers, we need to learn how to export in vector formats. This will be covered here.

Exporting Vector Data (ESRI shp File)
02:21

Many raster datasets we download from the Web are distributed in tiles, meaning a single raster for each subset of the area. To obtain a full raster for the study area we are interested in covering, we can create a mosaic.

Exporting Rasters in Various Formats (GeoTIFF, ASCII Grids)
02:42

Nowadays WebGIS applications are extremely popular. However, to use our data for WebGIS, we first need to export them in the correct format. This video will show how to do that.

Exporting Data for WebGIS Systems (GeoJSON, KML)
02:40
+
Descriptive Statistics
5 Lectures 23:49

In the previous volume we explored the basics R functions and syntaxes to import various types of data. In this video we will put these functions together, and overcome some unexpected challenges, to import a full year of NOAA data.

Preview 07:44

Before we can start analyzing our data we first need to properly understand what we are dealing with. The first step we have to take in this direction is describe our data with simple statistical indexes.

Measuring Spread (Standard Deviation and Standard Distance)
03:23

Numerical summaries are very useful but certainly not ideal to provide us with a direct feeling for the dataset in hands. Plots are much more informative and thus being able to produce them is certainly a crucial skill for data analysts.

Understanding Your Data with Plots
05:50

For multivariate data we are often interested in assessing correlation between variables. This can be done in R very easily, and ggplot2 can also be used to produce more informative plots.

Plotting for Multivariate Data
03:02

Detecting outliers is another basic skill that every data analyst should have and master. R provides a lot of technical tools to help us in finding outliers.

Finding Outliers
03:50
+
Manipulating Vector Data
5 Lectures 16:32

This Section will be dedicated entirely to manipulating vector data. However, viewers first need to familiarize with some basic concepts, otherwise they may not be able to understand the rest of the section.

Preview 03:37

In volume 1 we learned how to set the projection of our spatial data. However, in many cases we have to change this projection to successfully complete our analysis, and this requires some specific knowledge.

Re-Projecting Your Data
02:54

In many cases we may be interested in understanding the relation between spatial objects. One of such relations is the intersection, where we first want to know how two objects intersect, and then also extract only the part of one of these object that is included or outside the first.

Intersection
03:07

Other important GIS operations that users have to master involve creating buffers and calculating distances between objects.

Buffer and Distance
03:22

The last two GIS functions that anybody should master are used to merge different geometries and spatial objects and overlay

Union and Overlay
03:32
+
Manipulating Raster Data
5 Lectures 21:41

Raster objects are imported in R as rectangular matrixes. Users needs to be aware of this to properly work on these data, otherwise it may create some issues during the data analysis.

Preview 04:43

In many cases open data are not distributed directly in raster formats and they need to be converted. This can be easily done with the right functions.

Converting Vector/Table Data into Raster
04:00

Working with raster data often means extracting data for particular locations for further analysis, or crop the data to reduce their size. These are essential skills to master for any data analyst.

Subsetting and Selection
03:16

Sometimes we may need to filter out some values of our raster. It may seem tricky but only because it requires some skills.

Filtering
04:58

Creating new raster by calculating their value is extremely important for spatial data analysis. Doing so is simple but can be difficult to understand at first.

Raster Calculator
04:44
+
Visualizing Spatial Data
5 Lectures 28:19

Syntactically plotting spatial data in R is no different than plotting other types of data. Therefore, users need to know the basics of plotting before they can start making maps.

Preview 05:15

Creating multilayer plot can be difficult because we need to take care of several different aspects at once. However, learning that is very easy.

Adding Layers
05:44

When plotting spatial data we are often interested in using colors to show the values of some variables. This can be done manually but producing the right color scale may be difficult. This issue can be solved employing automatic methods.

Color Scale
04:51

Creating multivariate plots not only means adding layers, but also using legends so that the viewer understands what the plot is showing. Creating legends in R is tricky because it requires a lot of tweaking, which will be explained here.

Creating Multivariate Plots
09:09

Temporal data need to be treated with specific procedures to highlight this additional component. This may be done in different ways depending on the scope of the analysis and R provides the right platform for this.

Handling the Temporal Component
03:20
+
Interactive Maps
5 Lectures 26:20

Being able to plot spatial data on web maps is certainly helpful and a crucial skill to have, but it can be difficult since it requires knowledge of different technologies. R makes this process very easy with dedicated functions that allow us to plot on web GIS services a breeze.

Preview 02:32

Plotting data with the function plotGoogleMaps is not as easy as using the function plot. With a simple step by step guide we can achieve good command of the function, so that users can plot whatever data they choose.

Plotting Vector Data on Google Maps
05:45

An interactive map with just one layer is hardly useful for our purposes. Many times we are faced with the challenge of plotting several data at once. This requires some additional work and understanding, but it is definitely not hard in R.

Adding Layers
04:41

Plotting raster data on Google maps can be tricky. The function plotGoogleMaps does not handle rasters very well and if not done correctly the visualization will fail. This video will show users how to plot rasters successfully.

Plotting Raster Data on Google Maps
04:19

Plotting on Google Maps is easy but Google Maps are commercial products therefore if we want to use the on our commercial website we would need to pay. OpenStreetMaps are free to use, therefore knowing how to use them is certainly an advantage.

Using Leaflet to Plot on Open Street Maps
09:03
+
Creating Global Economic Maps with Open Data
4 Lectures 16:55

Using open data for our analysis requires a deep knowledge of the data provider and the actual data we are using. Without this knowledge we may end up with erroneous results.

Preview 02:21

Downloading data from the World Bank can be difficult since it requires users to know the acronym used to refer to these data. However, with some help this process becomes very easy.

Importing Data from the World Bank
05:08

To create a spatial map of the World Bank data we just have to download and we need to transform them into spatial data. However, in the dataset there are no coordinates of other information that may help us do that. The solution is to use the geocoding information from another dataset for this purpose.

Adding Geocoding Information
05:38

Using the world bank data just to plot a static spatial map is very limitative. There are tons of other uses that researchers can do with these data and this video serves to provide some guidance into these additional avenue of research.

Concluding Remarks
03:48
5 More Sections
About the Instructor
Packt Publishing
3.9 Average rating
8,197 Reviews
58,896 Students
687 Courses
Tech Knowledge in Motion

Packt has been committed to developer learning since 2004. A lot has changed in software since then - but Packt has remained responsive to these changes, continuing to look forward at the trends and tools defining the way we work and live. And how to put them to work.

With an extensive library of content - more than 4000 books and video courses -Packt's mission is to help developers stay relevant in a rapidly changing world. From new web frameworks and programming languages, to cutting edge data analytics, and DevOps, Packt takes software professionals in every field to what's important to them now.

From skills that will help you to develop and future proof your career to immediate solutions to every day tech challenges, Packt is a go-to resource to make you a better, smarter developer.

Packt Udemy courses continue this tradition, bringing you comprehensive yet concise video courses straight from the experts.