Text Mining, Scraping and Sentiment Analysis with R
3.9 (449 ratings)
Course Ratings are calculated from individual students’ ratings and a variety of other signals, like age of rating and reliability, to ensure that they reflect course quality fairly and accurately.
3,654 students enrolled

Text Mining, Scraping and Sentiment Analysis with R

Learn how to use Twitter social media data for your R text mining work.
3.9 (449 ratings)
Course Ratings are calculated from individual students’ ratings and a variety of other signals, like age of rating and reliability, to ensure that they reflect course quality fairly and accurately.
3,654 students enrolled
Last updated 11/2017
English [Auto]
Current price: $61.99 Original price: $94.99 Discount: 35% off
7 hours left at this price!
30-Day Money-Back Guarantee
This course includes
  • 2.5 hours on-demand video
  • 9 articles
  • 6 downloadable resources
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
Training 5 or more people?

Get your team access to 4,000+ top Udemy courses anytime, anywhere.

Try Udemy for Business
What you'll learn
  • use R for social media mining
  • get data from Twitter to do text analysis
  • perform web scraping tasks using the twitteR package
  • know which packages to use for web scraping
  • get R and Twitter connected
  • know how to perform a sentiment analysis in R
  • plot text data visualizations
Course content
Expand all 38 lectures 03:08:31
+ Introduction
6 lectures 16:09

Those are the packages you can use in R for natural language processing tasks.

Preview 08:11

Packages for web related tasks like scraping.

Preview 04:47
Course Links
Course Script
17 pages

This worksheet contains the exercises and solutions. Note that there is also the yahoo finance screenshot for exercise 2 available as download.

Worksheet - Exercises
10 pages
+ Scraping and Text Mining
16 lectures 01:05:22

You will learn how to get a Twitter developer account and which info you need to get from that account.

Twitter Developer Account
Important: Connection Information
Code Section: Social Media Mining

In this video I will show you the needed R code to get the scraping process going.

Connection of R Studio with Twitter
Alternative Authentication

In case the standard way to connect to Twitter does not work, here is an alternative code.

Alternative Authentication Code

We will use the searchTwitter function to scrape data from Twitter.

Preview 08:50

In this video I will show you how to clean a text corpus (punctuation, space, stopwords) with the "tm" package. We will also do some transformations for easier text mining.

Text Mining with "tm" - Text Cleaning and Transformations

Let`s plot a wordcloud with our data!


In this video I will show you how to get a document term matrix from your corpus. I will also show you how you can extract the most frequent terms from your data.

Document Term Matrix and Frequent Terms

We will create a dendrogram and we will also identify the term groups in the data.

Dendrogram and Term Groups
Exercise - Quiz
7 questions
Exercise: Text Mining
Solution: Text Mining Part 1
Solution: Text Mining Part 2
Exercise Code: Text Mining
Further R Exercises
+ Working with Strings - gsub and the Regular Expression syntax
8 lectures 30:22
Regular Expressions and gsub for sentiment analysis - handling of scraped data
Code Section: Strings
Working with Strings - Introduction
Working with Strings - gsub
Working with Strings - gsub advanced
Regular Expression Overview
2 pages
Working with Strings - Library Stringr
Exercise and Solution: Strings in R
+ Sentiment Analysis
8 lectures 47:37
Section Code: Sentiment Analysis

You will learn the theory behind a sentiment analysis and I will also walk you through a simple example.

Sentiment Analysis Basics

We will take a closer look at our main function for sentiment analysis.

Score Sentiment Function - J. Breen Approach

We will get Tweets to compare the sentiment on 4 different countries.

Tweets for Sentiment Analysis

We will score our Tweets and visualize the data.

Visualizing the Sentiments
Exercise: Sentiment Analysis
Exercise Code: Sentiment Analysis
  • intermediate R knowledge is required (R Level 1 course)
  • R program ready on your computer
  • basic understanding of social media and web technologies

Are you an advanced R user, looking to expand your R toolbox?

Are you interested in social media sentiment analysis?

Do you want to learn how you can get and use Twitter data for your R analysis?

Do you want to learn how you can systematically find related words (keywords) to a search term using Twitter and R?

Are you interested in creating visualizations like wordclouds out of text data?

Do you want to learn which R packages you can use for web scraping and text analysis purposes?

If YES came to your mind to some of those points – this course might be tailored towards your needs!

This course will teach you anything you need to know about how to handle social media data in R. We will use Twitter data as our example dataset.

During this course we will take a walk through the whole text analysis process of Twitter data.

At first you will learn which packages are available for social media analysis.

You will learn how to scrape social media (Twitter) data and get it into your R session.

After that we will filter, clean and structure our text corpus.

The next step is the visualization of the text data via wordclouds and dendrograms.

And in the last section we will do a whole sentiment analysis by using a common word lexicon.

All of those steps are accompanied by exercise sessions so that you can check if you can put the information to work.

According to the teaching principles of R Tutorials every section is enforced with exercises for a better learning experience. You can download the code pdf of every section to try the presented code on your own.

Disclaimer required by Twitter: 'TWITTER, TWEET, RETWEET and the Twitter logo are trademarks of Twitter, Inc or its affiliates.'

Who this course is for:
  • everybody interested in social media analysis
  • everybody interested in using R for web scraping
  • everybody interested in sentiment analysis
  • everybody interested in text analysis
  • everybody interested in enlarging their R toolbox