Web Scraping with Python, Ruby & import. io

Learn exactly how to scrape web data from many popular websites even if you have very little experience as a programmer!
3.8 (40 ratings)
Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
852 students enrolled
$19
$75
75% off
Take This Course
  • Lectures 46
  • Length 4.5 hours
  • Skill Level All Levels
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works

Discover

Find online courses made by experts from around the world.

Learn

Take your courses with you and learn anywhere, anytime.

Master

Learn and practice real-world skills and achieve your goals.

About This Course

Published 8/2015 English

Course Description

Latest Update:
May 8, 2016 - Import io had a major update going from software to browser based web scraper. I have added new lessons to show you how to use the new Import io. I have left the lectures up for the software as well.. since that can still be used and is a more powerful tool (so far). Enjoy!

Also, see the end of this for all the updates that have been made the course over the last six months.

_____________________________________________________________

You'll be web scraping in less than an hour!!

This course teaches you three ways to scrape websites:

Import. io is a software that you download and is a really quick way to start web scraping quickly. It is not without its limitations but depending on what you need or plan to do it could do everything you need. No programming experience necessary!

If you prefer to learn a little programming along the way, and aren't afraid to invest a little time, you can learn how to do web scraping with Ruby and/or Python. Both have their pluses and minuses but both can do just about anything you could imagine.

Even if you have never used Python or Ruby you can learn how to still use these programs as a web scraper. But if you are willing to learn you can scrape just about any website there is online!

Want to do some web scraping of super popular websites? That's how you will learn!

This course comes complete with code for every lecture for Ruby and Python!

You will learn to web scrape popular websites like:

  • Zillow
  • Craigslist
  • Yelp
  • IMDB
  • Wikipedia
  • Google
  • Macys
  • YouTube
  • NFL
  • Nordstrom
  • Wal-Mart
  • Twitter

You will learn to install Python and Ruby. You'll learn how to download what you web scrape into CSV files and with Python you will learn how to put your data into a database.

Other lessons include how to get around obstacles such as websites with ajax that load content dynamically. You'll also learn how to crawl multiple pages of websites and get all the data you need!

With Import. io you will learn how to download a website that has over 40,000 products on it. Yes, 40,000 products and you will learn how to scrape all of those products.

Stop imagining you can grab data off websites and use them for your next web project, you an do it now.

Please note: Software installation can be done for both Mac OSX and Windows. I use a Mac so it is more focused on OSX however instructions are available for Windows.

Update: January 17, 2016 - Added three Scrapy videos. How to setup scrapy and scraping Craigslist
Update: October 17, 2015 - Added Bonus lesson on extracting zip codes from Yelp
Update: September 15, 2015 - Added How to find JSON and scrape it on Walmart
Updated: September 1, 2015 - Three new Twitter API scraper videos. How to scrape user tweets, users searching for keywords and how to follow those users through Ruby/Twitter API
Update: August 28, 2015 - Added Wikipedia scraper and map

Enroll now!

What are the requirements?

  • How to install software on their computer
  • Be able to understand a reasonable level of programming -- much of it will be explained
  • Follow step-by-step instructions

What am I going to get from this course?

  • Scrape a variety of websites
  • Scrape websites with Ruby
  • Scrape websites with Python
  • Scrape websites with Import. io

What is the target audience?

  • Anyone who has an interest or idea where they want to utilized data from other websites

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.

Curriculum

Section 1: Course Introduction
01:30

YES you will learn to scrape websites! And no, you don't need to be a highly skilled developer!

Section 2: Import.io - Free, Easy to use software built for scraping websites
01:38

Learn what is new in the brand new web-based version of Import io

Setting up the new Import io
03:12
Wikipedia - Learn to scrape a table from Wikipedia
05:15
Wikipedia - Learn to grab content from a Wikipedia page
04:53
CNN.com - Learn how to grab content from a CNN article page
03:03
IMDB - Scrape the most popular movies from IMDB
02:34
NFL.com - Learn how to Scrape the top QB stats
04:44
07:01

You will learn to scrape the NFL draft picks and how to add URLs to your web scraper. 

Section 3: Import io - Download tool
Download Import.io
Article
Import.io overview: learn how to install and get up and running
Preview
00:33
Oh, it's kind of Magic: Grab products from IKEA
Preview
02:12
05:29

Scrape Wikipedia and map the data. Added August 28, 2015

Learn how to Extract data from IMDB
Preview
04:09
Extract data from Macys.com
04:51
Yelp: Learn how to scrape information from Yelp!
04:50
Yelp Advanced: Scrape the Top 100+ Pizza places in New York City
08:03
Yelp: Learn how to Search and scrape Yelp results with an automated process!
08:11
Learn how to crawl an entire website and capture over 40k products!
06:26
From Import.io to Google Sheets. Get it into the cloud!
02:19
03:08

Learn how to extract a zip code from Yelp

Learn how to Install and Setup Ruby
01:48
Simple Script: Learn How to Scrape Macys.com products
12:20
Next Step: Macys.com getting a bit more advanced as we build your skills
04:11
CSV: Now you have Macys.com data learn how to put it into a CSV file
04:41
Section 4: Ruby - Don't know Ruby? It doesn't matter because you will soon!
Walmart: How to scrape all pages from a product category on Walmart.com
07:55
Spoof a site and look like you are web browser!
02:27
Google + Mechanize = How to scrape search results from Google with Mechanize
04:32
BONUS: Scraping YouTube's Most Popular
04:53
BONUS: Scraping Zillow by Zip Code
06:44
BONUS: Using JSON to scrape Walmart's website
08:58
Section 5: Python - Don't know Python? It doesn't matter because you will soon!
Installing and setting up Python
08:02
Simple Script: Learn how to crawl IMDB
15:07
Learn how to submit a web form
06:15
Install Selenium and PhantomJS - Trust me you'll want these!
03:31
Yes, you can crawl JavaScript! Learn to scrape Ajax with Selenium
11:36
Simple: Crawl and export data from Nordstrom.com into a CSV
06:00
Next Step: Crawl NFL data - Grab all the QB data from 2014!
09:20
Advanced: Export NFL QB data to a CSV
05:39
Export Data to MySQL
16:27
Section 6: Scrapy - Use Scrapy built for Python to create your web crawler
Installing and setting up Scrapy
06:13
Craigslist: Use Scrapy for a basic crawl of 100 jobs
19:37
Craiglist: Crawling through paginations for all jobs
02:50
Section 7: Bonus: Ruby - How to Scrape via Twitter API
Scrape a Tweets from another User or Users
13:09
Scrape Twitter searches for keywords and anyone who has tweeted using that keyw
06:08
Take the scraped users and follow them or swap out for another username
05:04

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

Charlie Craine, Chief Technology Officer

Over 17 years of digital media experience. Currently, CTO executive for B2B publishing company with eight magazine websites as well as lead at its Interactive Agency working with Fortune 500 companies.

Our agency specializes in web development, marketing, SEO, social and email for B2B publisher. Leads agency web development, video, SEO & app development.

Previous experience includes Digital General Manager over cool magazines such as Guitar World and Revolver Magazine. And General Manager running digital for the daily newspaper in Salt Lake City with over 30 million page views per month.

Core competencies include:
(SEO) & (SEM) · Google Analytics · Social Media · Content Management · Lead Generation · Agile Project management · Web Development · Digital Marketing · Data Mining

Ready to start learning?
Take This Course