Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Modern Web Scraping with Python using Scrapy Splash Selenium

Name: Modern Web Scraping with Python using Scrapy Splash Selenium
Rating: 4.4 (3831 reviews)

Become an expert in web scraping and web crawling using Python 3, Scrapy, Splash and Selenium 2nd EDITION (2021)

Created byAhmed Rafik

Last updated 5/2021

English

What you'll learn

Understand the fundamentals of Web Scraping
Scrape websites using Scrapy
Understand Xpath & CSS Selectors
Build a complete Spider from A to Z
Store the extracted Data in MongoDb & SQLite3
Scrape JavaScript websites using Splash & Selenium
Build a CrawlSpider
Understand the Crawling behavior
Build a custom Middleware
Web Scraping best practices
Avoid getting banned while scraping websites
Bypass cloudflare
Scrape APIs
Scrape infinite scroll websites
Working with Cookies
Deploy spiders locally and to the cloud
Run spiders periodically
Prevent storing duplicated data
Build datasets
Login to websites using Scrapy
Download images and files using Scrapy

Course content

18 sections • 128 lectures • 8h 50m total length

Intro to Web Scraping & Scrapy6:47
Setting up Scrapy the Development Environment (Updated)8:05
Add VSCODE to path (Mac users)0:26
Udemy 101 (Please don't skip*)1:21
Asking questions0:27

Downloadable files0:15
Hi everyone,
Due to some technical issues from Udemy side, some of you couldn't download the resources included in lecture 11 & 14. If that's the case for you please use the links I included below to download the resources needed for this section:
For lecture 11:
https://www.dropbox.com/s/bqrcr7a7vln0qwq/css_html_file.zip?dl=0
For lecture 14:
https://www.dropbox.com/s/zk2dbqr2b1talms/xpath_html_file.zip?dl=0
Please note, this article will be removed as soon as Udemy fixes this issue.
Kind regards,
Ahmed.
XPath & CSS Selectors2:53
CSS Selectors fundamentals9:13
CSS selectors in theory2:54
XPath fundamentals8:47
Navigating using XPath(Going UP)5:15
Navigating using XPath(Going DOWN)3:23
XPath in theory3:26

Requirements

Basics of Python
Internet access

Description

Web Scraping nowadays has become one of the hottest topics, there are plenty of paid tools out there in the market that don't show you anything how things are done as you will be always limited to their functionalities as a consumer.

In this course you won't be a consumer anymore, i'll teach you how you can build your own scraping tool ( spider ) using Scrapy.

You will learn:

The fundamentals of Web Scraping
How to build a complete spider
The fundamentals of XPath & CSS Selectors
How to locate content/nodes from the DOM using XPath & CSS
How to store the data in JSON, CSV... and even to an external database(MongoDb & SQLite3)
How to write your own custom Pipeline
Fundamentals of Splash
How to scrape Javascript websites using Scrapy Splash & Selenium
The Crawling behavior
How to build a CrawlSpider
How to avoid getting banned while scraping websites
How to build a custom Middleware
Web Scraping best practices
How to scrape APIs
How to use Request Cookies
How to scrape infinite scroll websites
Host spiders in Heroku for free
Run spiders periodically with a custom script
Prevent storing duplicated data
Deploy Splash to Heroku
Write data to Excel files
Login to websites using Scrapy
Download Files & Images using Scrapy
Use Proxies with Scrapy Spider
Use Crawlera with Scrapy & Splash
Use Proxies with CrawlSpider

What makes this course different from the others, and why you should enroll ?

First, this is the most updated course. You will be using Python 3.7, Scrapy 1.6 and Splash 3.0

You will have an in-depth step by step guide on how to become a professional web scraper.
You will learn how to use Splash & Selenium to scrape JavaScript websites and I can assure you, you won't find any tutorials out there that teaches how to really use Splash like I'll be doing in this course.
You will learn how to host spiders in Heroku as well as Splash(Exclusive).
You will learn how to create a custom script so spiders can run periodically without any intervention from you.

30 days money back guarantee by Udemy

So whether you are a data analyst who wants to add web scraping to his tool set or someone else who wants to learn how to extract unstructured data from unstructured HTML web pages and then store back that data in a structured way to apply some data analysis on it then you are welcome to join this course.

**STUDENTS THOUGHTS ABOUT THIS COURSE **

"I was particularly looking for web scraping using XPATHs and this course is addressing that. It also covers dynamic paging. A proper mix of theory and practical. A must-have for those who wants to do web scraping . GREAT learning experience !!! ". By Hiran Kumar

"90% of what I was searching for!!! Great job!! Clear explanations and great communication with Ahmed". By Raylyson Estanista

"Admed’s Web scraping course is awesome . His approach using Python with scrapy and splash works well with all websites especially those that make heavy use of JavaScript. Ahmed is a gifted educator: expert communicator, passionate, conscientious and accessible to his students. I highly recommend this course and any of Ahmed Rafik’s Udemy courses. ". By Richard Blackmon

"Great course, and a nice introduction to Scrapy (I'm someone with no Python experience whatsoever).". By I S

"Excellent course. Quick and thorough at the same time. Ahmed is incredibly responsive to the students and often replies to questions within minutes! Highest recommendation." By Robert Nolte

"That course is very good and explanation is crystal clear! The instructor is very supportive in case of questions. Highly recommended." By Shubina Ekaterina

"I like the course. Clear explanations and good comunication with Ahmed. All topics is interesting and full of information. I improved my skils in Scrapy. Author update course content by new videos. It's a big bonus) Explained more advance topics I never see in other courses. Thank you, Ahmed. Waiting for new videos)". By Ruslan Romanenko

Who this course is for:

Anyone who wants to scrape data from any website
Anyone who wants to learn Scrapy
Anyone who wants to automate the task of copying contents from websites
Anyone who wants to learn how to scrape Javascript websites using Scrapy-Splash & Selenium

Modern Web Scraping with Python using Scrapy Splash Selenium

What you'll learn

Explore related topics

Course content

Introduction5 lectures • 17min

Scrapy Fundamentals5 lectures • 30min

XPath expressions & CSS Selectors8 lectures • 36min

Project 1 Spiders from A to Z6 lectures • 21min

Building Datasets1 lecture • 4min

Project 2 Dealing with Multiple pages8 lectures • 23min

Debugging spiders3 lectures • 15min

Let's take a break !2 lectures • 4min

Project 3 Build Crawlers using Scrapy7 lectures • 21min

Splash crash course7 lectures • 31min

Requirements

Description

Who this course is for: