Udemy
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
Development
Web Development Data Science Mobile Development Programming Languages Game Development Database Design & Development Software Testing Software Engineering Development Tools No-Code Development
Business
Entrepreneurship Communications Management Sales Business Strategy Operations Project Management Business Law Business Analytics & Intelligence Human Resources Industry E-Commerce Media Real Estate Other Business
Finance & Accounting
Accounting & Bookkeeping Compliance Cryptocurrency & Blockchain Economics Finance Finance Cert & Exam Prep Financial Modeling & Analysis Investing & Trading Money Management Tools Taxes Other Finance & Accounting
IT & Software
IT Certification Network & Security Hardware Operating Systems Other IT & Software
Office Productivity
Microsoft Apple Google SAP Oracle Other Office Productivity
Personal Development
Personal Transformation Personal Productivity Leadership Career Development Parenting & Relationships Happiness Esoteric Practices Religion & Spirituality Personal Brand Building Creativity Influence Self Esteem & Confidence Stress Management Memory & Study Skills Motivation Other Personal Development
Design
Web Design Graphic Design & Illustration Design Tools User Experience Design Game Design Design Thinking 3D & Animation Fashion Design Architectural Design Interior Design Other Design
Marketing
Digital Marketing Search Engine Optimization Social Media Marketing Branding Marketing Fundamentals Marketing Analytics & Automation Public Relations Advertising Video & Mobile Marketing Content Marketing Growth Hacking Affiliate Marketing Product Marketing Other Marketing
Lifestyle
Arts & Crafts Beauty & Makeup Esoteric Practices Food & Beverage Gaming Home Improvement Pet Care & Training Travel Other Lifestyle
Photography & Video
Digital Photography Photography Portrait Photography Photography Tools Commercial Photography Video Design Other Photography & Video
Health & Fitness
Fitness General Health Sports Nutrition Yoga Mental Health Dieting Self Defense Safety & First Aid Dance Meditation Other Health & Fitness
Music
Instruments Music Production Music Fundamentals Vocal Music Techniques Music Software Other Music
Teaching & Academics
Engineering Humanities Math Science Online Education Social Science Language Teacher Training Test Prep Other Teaching & Academics
AWS Certification Microsoft Certification AWS Certified Solutions Architect - Associate AWS Certified Cloud Practitioner CompTIA A+ Cisco CCNA Amazon AWS CompTIA Security+ AWS Certified Developer - Associate
Graphic Design Photoshop Adobe Illustrator Drawing Digital Painting InDesign Character Design Canva Figure Drawing
Life Coach Training Neuro-Linguistic Programming Mindfulness Personal Development Meditation Personal Transformation Life Purpose Emotional Intelligence Neuroscience
Web Development JavaScript React CSS Angular PHP WordPress Node.Js Python
Google Flutter Android Development iOS Development Swift React Native Dart Programming Language Mobile Development Kotlin SwiftUI
Digital Marketing Google Ads (Adwords) Social Media Marketing Google Ads (AdWords) Certification Marketing Strategy Internet Marketing YouTube Marketing Email Marketing Google Analytics
SQL Microsoft Power BI Tableau Business Analysis Business Intelligence MySQL Data Analysis Data Modeling Big Data
Business Fundamentals Entrepreneurship Fundamentals Business Strategy Online Business Business Plan Startup Blogging Freelancing Home Business
Unity Game Development Fundamentals Unreal Engine C# 3D Game Development C++ 2D Game Development Unreal Engine Blueprints Blender
30-Day Money-Back Guarantee
IT & Software Other IT & Software Web Scraping

Modern Web Scraping with Python

Harness the power of Scrapy, BeautifulSoup and Selenium to boost your webscraping game!
Rating: 4.4 out of 54.4 (298 ratings)
18,110 students
Created by Jordan Sauchuk, Ligency Team
Last updated 3/2021
English
English [Auto]
30-Day Money-Back Guarantee

What you'll learn

  • Understand the most important components for web scraping
  • Build their own web scraping projects
  • Learn core components of two of the most powerful scraping libraries: Scrapy and BeautifulSoup
  • Build multiple hands-on projects
Curated for the Udemy for Business collection

Requirements

  • Internet Connection
  • Windows, Linux or Mac OS along with Anaconda and Spyder

Description

Getting access to the data you need can make or break you.

This is why Fortune 500 companies like Walmart, CNN, Target, and HSBC use web scraping to get ahead and stay ahead with data.

It’s the original growth tool and one of their best-kept secrets


…And it can easily be yours too.


From data spoofing to legalities, crawling libraries, maintenance, monitoring, more, building a safe and effective web scraper is risky business but it’s a skill every data scientist needs in their toolkit.


Today, we’re building one from scratch.


Hi, my name is Jordan Sauchuk. I’m an AI & Cybersecurity Engineer and a SuperDataScience instructor. I’m here to guide you step-by-step in building custom web scrapers in Python using Selenium, Scrapy and BeautifulSoup.


Welcome to Modern Web Scraping in Python.


At the end of this course, you will understand the most important components of web scraping and be able to build your own web scrapers to obtain new data, optimize internal processes and more.


Plus, familiarize yourself with some of the most common scraping techniques and sharpen your Python programming skills while you’re at it!


  1. First, learn the essentials of web scraping, explore the framework of a website and get your local environment ready to take on scraping challenges with Scrapy, BeautifulSoup, and Selenium.

  2. Next, set up a Scrapy crawler and cover the core details that can be applied to building datasets or mining.

  3. Next, cover the basics of BeautifulSoup, utilize the requests library and LXML parser, and scale up to deploy a new scraping algorithm to scrape top product information from Amazon

  4. Fourth, set up Selenium, and deploy it to solve a practical, real-world challenge. Plus, submit your solution to obtain useful feedback from me.

  5. Finally, test your newfound skills on a cybersecurity project that involves you finding highly-sensitive data.


We’l bel coding in Python, and using the automated testing suite Selenium, the Python framework Scrapy, and library BeautifulSoup to build web scrapers that can be customized to your specific needs.


But a thorough walk-through isn’t all you’re getting.


Access our student forum where you can interact with me and your fellow students. Ask me questions, receive input from other students and be inspired by the clever scraping solutions from your classmates.


Whether you’re a data scientist, machine learning or AI engineer who wants to access more data sources; a web developer looking to automate tasks, or a data buff with a general interested in data science and web scraping…


This course delivers an in-depth presentation of web scraping basics, methodologies and approaches that you can easily apply to your own personal projects, or out there in the real world of business.


Join me now and let’s start scraping the web together. Enroll today.


Who this course is for:

  • Anyone interested in harnessing the power of data, web scraping/crawling, and data mining.
  • Data Scientists who want to take their skills to the next level
  • ML/AI engineers that want to put together new sources of information or datasets
  • Web developers looking to obtain new information or automate tasks
  • Any one interested in programming or computer science
  • Software engineers or programmers looking to expand their skill set

Course content

7 sections • 52 lectures • 3h 14m total length

  • Bonus 1: Using Droplet
    00:01
  • Bonus 2: Using Scrapy vs Using BeautifulSoup
    00:01
  • Bonus 3: Starting your First Scrapy Project
    00:01

  • Preview03:22
  • Preview01:36
  • Study Tips For Success
    00:37
  • Simple HTTP Request
    01:50
  • What is a website + the DOM
    02:42
  • Sublime
    03:57
  • The Inspector
    03:05
  • Traversing The Tree
    02:00
  • Robots.txt
    04:15
  • Anaconda Update Notice
    01:11
  • Preview01:33
  • Setting up your virtual environment
    04:41
  • Changes
    00:42
  • Get The Files Here!
    00:55
  • Your Shortcut To Becoming A Better Data Scientist
    02:04

  • Preview03:07
  • Starting your Scrapy Project
    03:21
  • Building your first Spider
    08:38
  • Scrapy Shell
    03:05
  • Selectors
    03:26
  • Scrapy Items
    07:19
  • Request + Response Quick
    01:13
  • Traversing Options
    07:19
  • Challenge Intro
    01:32
  • Challenge Solution
    15:26
  • Scrapy Cloud
    03:55
  • Scrapy Recap Quiz
    10 questions

  • Preview03:10
  • BS Requests
    03:39
  • BS Basics
    05:41
  • BS Basics cont.
    07:27
  • BS Challenge Intro
    03:13
  • BS Challenge Solution
    12:20
  • BS Recap Quiz
    10 questions

  • GitHub
    00:44
  • Section Intro + Selenium Installation
    05:16
  • Resources for Selenium
    00:13
  • Selenium Notes
    00:25
  • Selenium - Click Revised
    06:45
  • Selenium Login Keys
    07:30
  • Selenium Challenge Intro
    02:22
  • Recommended: Build Your Own Approach First
    00:50
  • Selenium Challenge Pt. 1
    08:11
  • Selenium Challenge Solution
    11:21
  • Selenium Project Feedback
    02:29

  • Introduction
    02:24
  • Recommended: Build Your Own Approach First
    00:52
  • Solution Pt. 1
    06:51
  • Solution Pt. 2
    07:19
  • Github Submission
    01:38

  • *** YOUR SPECIAL BONUS ***
    00:57

Instructors

Jordan Sauchuk
Senior AI Advisor & Cybersecurity Engineer
Jordan Sauchuk
  • 4.4 Instructor Rating
  • 658 Reviews
  • 43,194 Students
  • 3 Courses

I'm a Senior AI Advisor, AI & Cybersecurity Engineer and also the CEO of Global Pioneers. My goal is to help provide my experiences and expertise that has been obtained through extensive research, many late nights, and cups of coffee.

I have been fortunate enough to work on a range of challenging projects worldwide and I have extensive experience specializing in technologies such as Python, R, JavaScript, C++, PHP, Tensorflow, Pytorch, Scikit-Learn, Keras, ReactJS, NodeJS, SQL, Plotly, Tesseract, Seaborn, and much more.

Recently, I have launched the Threat Intelligence Group as a means of providing cybersecurity awareness, integrating AI into the security domain, and also providing consulting services.

For more information please feel free to reach out to me on LinkedIn and Twitter. 

Ligency Team
Helping Data Scientists Succeed
Ligency Team
  • 4.5 Instructor Rating
  • 470,219 Reviews
  • 1,689,164 Students
  • 109 Courses

Hi there,

We are the Ligency PR and Marketing team. You will be hearing from us when new courses are released, when we publish new podcasts, blogs, share cheatsheets and more!

We are here to help you stay on the cutting edge of Data Science and Technology.

See you in class,

Sincerely,

The Real People at Ligency

  • Udemy for Business
  • Teach on Udemy
  • Get the app
  • About us
  • Contact us
  • Careers
  • Blog
  • Help and Support
  • Affiliate
  • Impressum Kontakt
  • Terms
  • Privacy policy
  • Cookie settings
  • Sitemap
  • Featured courses
Udemy
© 2021 Udemy, Inc.