Web Scraping with Python, Ruby & import. io

Learn exactly how to scrape web data from many popular websites even if you have very little experience as a programmer!
4.2 (46 ratings)
Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
896 students enrolled
80% off
Take This Course
  • Lectures 46
  • Length 4.5 hours
  • Skill Level All Levels
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works


Find online courses made by experts from around the world.


Take your courses with you and learn anywhere, anytime.


Learn and practice real-world skills and achieve your goals.

About This Course

Published 8/2015 English

Course Description

Latest Update:
May 8, 2016 - Import io had a major update going from software to browser based web scraper. I have added new lessons to show you how to use the new Import io. I have left the lectures up for the software as well.. since that can still be used and is a more powerful tool (so far). Enjoy!

Also, see the end of this for all the updates that have been made the course over the last six months.


You'll be web scraping in less than an hour!!

This course teaches you three ways to scrape websites:

Import. io is a software that you download and is a really quick way to start web scraping quickly. It is not without its limitations but depending on what you need or plan to do it could do everything you need. No programming experience necessary!

If you prefer to learn a little programming along the way, and aren't afraid to invest a little time, you can learn how to do web scraping with Ruby and/or Python. Both have their pluses and minuses but both can do just about anything you could imagine.

Even if you have never used Python or Ruby you can learn how to still use these programs as a web scraper. But if you are willing to learn you can scrape just about any website there is online!

Want to do some web scraping of super popular websites? That's how you will learn!

This course comes complete with code for every lecture for Ruby and Python!

You will learn to web scrape popular websites like:

  • Zillow
  • Craigslist
  • Yelp
  • IMDB
  • Wikipedia
  • Google
  • Macys
  • YouTube
  • NFL
  • Nordstrom
  • Wal-Mart
  • Twitter

You will learn to install Python and Ruby. You'll learn how to download what you web scrape into CSV files and with Python you will learn how to put your data into a database.

Other lessons include how to get around obstacles such as websites with ajax that load content dynamically. You'll also learn how to crawl multiple pages of websites and get all the data you need!

With Import. io you will learn how to download a website that has over 40,000 products on it. Yes, 40,000 products and you will learn how to scrape all of those products.

Stop imagining you can grab data off websites and use them for your next web project, you an do it now.

Please note: Software installation can be done for both Mac OSX and Windows. I use a Mac so it is more focused on OSX however instructions are available for Windows.

Update: January 17, 2016 - Added three Scrapy videos. How to setup scrapy and scraping Craigslist
Update: October 17, 2015 - Added Bonus lesson on extracting zip codes from Yelp
Update: September 15, 2015 - Added How to find JSON and scrape it on Walmart
Updated: September 1, 2015 - Three new Twitter API scraper videos. How to scrape user tweets, users searching for keywords and how to follow those users through Ruby/Twitter API
Update: August 28, 2015 - Added Wikipedia scraper and map

Enroll now!

What are the requirements?

  • How to install software on their computer
  • Be able to understand a reasonable level of programming -- much of it will be explained
  • Follow step-by-step instructions

What am I going to get from this course?

  • Scrape a variety of websites
  • Scrape websites with Ruby
  • Scrape websites with Python
  • Scrape websites with Import. io

Who is the target audience?

  • Anyone who has an interest or idea where they want to utilized data from other websites

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.


Section 1: Course Introduction

YES you will learn to scrape websites! And no, you don't need to be a highly skilled developer!

Section 2: Import.io - Free, Easy to use software built for scraping websites

Learn what is new in the brand new web-based version of Import io

Setting up the new Import io
Wikipedia - Learn to scrape a table from Wikipedia
Wikipedia - Learn to grab content from a Wikipedia page
CNN.com - Learn how to grab content from a CNN article page
IMDB - Scrape the most popular movies from IMDB
NFL.com - Learn how to Scrape the top QB stats

You will learn to scrape the NFL draft picks and how to add URLs to your web scraper. 

Section 3: Import io - Download tool
Download Import.io
Import.io overview: learn how to install and get up and running
Oh, it's kind of Magic: Grab products from IKEA

Scrape Wikipedia and map the data. Added August 28, 2015

Learn how to Extract data from IMDB
Extract data from Macys.com
Yelp: Learn how to scrape information from Yelp!
Yelp Advanced: Scrape the Top 100+ Pizza places in New York City
Yelp: Learn how to Search and scrape Yelp results with an automated process!
Learn how to crawl an entire website and capture over 40k products!
From Import.io to Google Sheets. Get it into the cloud!

Learn how to extract a zip code from Yelp

Learn how to Install and Setup Ruby
Simple Script: Learn How to Scrape Macys.com products
Next Step: Macys.com getting a bit more advanced as we build your skills
CSV: Now you have Macys.com data learn how to put it into a CSV file
Section 4: Ruby - Don't know Ruby? It doesn't matter because you will soon!
Walmart: How to scrape all pages from a product category on Walmart.com
Spoof a site and look like you are web browser!
Google + Mechanize = How to scrape search results from Google with Mechanize
BONUS: Scraping YouTube's Most Popular
BONUS: Scraping Zillow by Zip Code
BONUS: Using JSON to scrape Walmart's website
Section 5: Python - Don't know Python? It doesn't matter because you will soon!
Installing and setting up Python
Simple Script: Learn how to crawl IMDB
Learn how to submit a web form
Install Selenium and PhantomJS - Trust me you'll want these!
Yes, you can crawl JavaScript! Learn to scrape Ajax with Selenium
Simple: Crawl and export data from Nordstrom.com into a CSV
Next Step: Crawl NFL data - Grab all the QB data from 2014!
Advanced: Export NFL QB data to a CSV
Export Data to MySQL
Section 6: Scrapy - Use Scrapy built for Python to create your web crawler
Installing and setting up Scrapy
Craigslist: Use Scrapy for a basic crawl of 100 jobs
Craiglist: Crawling through paginations for all jobs
Section 7: Bonus: Ruby - How to Scrape via Twitter API
Scrape a Tweets from another User or Users
Scrape Twitter searches for keywords and anyone who has tweeted using that keyw
Take the scraped users and follow them or swap out for another username

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

Charlie Craine, Chief Technology Officer

Over 17 years of digital media experience. Currently, CTO executive for B2B publishing company with eight magazine websites as well as lead at its Interactive Agency working with Fortune 500 companies.

Our agency specializes in web development, marketing, SEO, social and email for B2B publisher. Leads agency web development, video, SEO & app development.

Previous experience includes Digital General Manager over cool magazines such as Guitar World and Revolver Magazine. And General Manager running digital for the daily newspaper in Salt Lake City with over 30 million page views per month.

Core competencies include:
(SEO) & (SEM) · Google Analytics · Social Media · Content Management · Lead Generation · Agile Project management · Web Development · Digital Marketing · Data Mining

Ready to start learning?
Take This Course