Python Regular Expressions - Real World Applied Python
4.5 (98 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
1,228 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Python Regular Expressions - Real World Applied Python to your Wishlist.

Add to Wishlist

Python Regular Expressions - Real World Applied Python

Learn Cutting Edge Pattern Matching Skills for Log Mining, Big Data Parsing, Cleanup and Preparation with Regex
Bestselling
4.5 (98 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
1,228 students enrolled
Created by Chandra Lingam
Last updated 4/2017
English
English
Current price: $10 Original price: $100 Discount: 90% off
30-Day Money-Back Guarantee
Includes:
  • 3.5 hours on-demand video
  • 3 Articles
  • 8 Supplemental Resources
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Confidently use regular expression as a powerful text processing tool for data parsing, cleanup and preparation
  • Minimize effort spent on custom development for data cleanup
  • Gain practical tips to let Regular Expression do bulk of the work for data preparation
  • Understand potential performance issues and techniques to address them
View Curriculum
Requirements
  • All material and software instructions are covered in house keeping lecture.
  • Familiarity with a Programming Language
Description

*** NEW PREVIEW VIDEOS: All lectures in Section 2 and Section 3 enabled for preview!  Check it out! 

     NEW: Project 4 Network Configuration Parser added to the course***

Python Regular Expressions is a hands-on course that teaches you everything you need to know about Regular Expression using Python Language. Master cutting edge pattern matching, data extraction and data cleanup skills in this course.

Regular Expression is a powerful text processing tool for log mining, data parsing, cleanup and preparation.  Power and elegance of regular expression allows you to do complex data extraction and cleanup with very few lines of code.

Over 60% of the effort in big data projects is spent on data cleanup and preparation. Data can come from variety of sources including internal databases, log files, sensor generated data, Twitter, Facebook and so forth.  Having access to powerful regular expression tool will open up lot of opportunities for you on how you look at your data and what can you do with it.

This course contains over 25+ hands-on exercises, practical tips, quizzes and four projects to apply the new skills you learned in this class.  In the first project, we will be extracting useful information from unstructured text data from Robocopy tool, in the second project we will be on working on large data set generated by Sensors and in the third project we will look in to Health Care Systems that deal with Electronic Medical Records. ***NEW***Fourth project added on parsing Network Interface Configuration.

These exercises will demonstrate that with regular expression you can implement complex parsing with only a few lines of code

As a bonus, you will receive an Python Interactive Tool with Source Code for learning regular expression faster.

This course uses free Anaconda Python distribution tool for development and exercise.  This is an all video lecture with quizzes, full source code, downloadable list of data and patterns used.

Who is the target audience?
  • Data Scientists, Software Engineers and Developers
Curriculum For This Course
Expand All 53 Lectures Collapse All 53 Lectures 03:37:52
+
Introduction
4 Lectures 14:54

Where can you use regular expressions?  Class structure overview and teaching methodology

Preview 03:25

**UPDATE: Please Read Lecture 3 article for how to run interactive tool on newer Anaconda distributions with PyQt5**

Setup development environment from scratch:

Anaconda Python Environment, Course Notebook files and Learn Regex GUI Tool setup, Data Setup and Overview of Python Regular Expression Quick Reference Guide

Setup Python Development Environment and Solution
07:45

Learn Regex Interactive Tool - Additional Details
00:20

iPython Notebook Overview and run course notebook files

Preview 03:24
+
Regular Expression Usage and Demo
7 Lectures 25:01

Regular Expression Terminology used in this course

Coding tips to correctly handle patterns using raw strings

Preview 03:08

Shows different ways in which you can find a match for pattern in text

Preview 04:48

How do get all matches for a pattern.  Different ways of retrieving the matches

Preview 03:31

How to break a pattern into sub-patterns using Groups.  How are groups beneficial

Preview 03:59

Shows different ways of doing a find-replace capability using patterns and custom replacement logic

Preview 05:37

Shows how to split a text using patterns

Preview 01:34

Shows differences between python string and raw string.  Why raw strings are better for patterns

Preview 02:24

Regular Expression Usage Quiz
6 questions
+
Regular Expression Interactive Tool
2 Lectures 09:55

Shows how to open, launch and use interactive regular expression tool that comes with this course.  This tool is written in Python and uses Qt.

Preview 09:32

Setup Help - Interactive Tool
00:23
+
Regular Expression Language Overview
9 Lectures 36:44

Let's start with simplest building block: Single Character Patterns and how to use Set based match

Preview 04:44

Singe Character Pattern Quiz
3 questions

How to find matches using NOT operation

How to specify a range of characters using shortcut available in regular expression language

Set Negation, Range Characters
02:33

How to specify different ranges of characters, use of wildcard, escape special characters and use of control characters

Multi-Range, Wildcards, Escape, Controls Characters
03:40

How to provide instructions to regular expression engine to fine tune criteria for matching

Anchors
06:50

Anchor Quiz
2 questions

How to write concise patterns using ready-made shortcuts known as character classes

Character Classes
05:19

How to specify repetition, upper and lower bounds, optional experssions

Quantifiers - Optional, Frequency of Occurence
05:57

Quantifiers Quiz
1 question

How to insert comments inside a regular expression pattern

Comments
03:15

If..Then...else style branching in regular expression

Conditional Expression
03:29

Summary of what we learned so far

Summary and Wrapup
00:57
+
Regular Expression Engine - Five Key Points
16 Lectures 01:04:19

Regular Expression Engine Overview

Regular Expression Engine Introduction
02:48

How Engine works internally to find a match

1. One Character at time
03:37

What order does Engine use to find a match

2. Left to Right
03:43

Left to Right Quiz
2 questions

What order does Engine use to find a match

2.1 Left to Right - One more example
05:38

Demo

Demo - Left to Right
02:34

How does engine handle quantifiers?  What does Greedy, Lazy and Back Tracking mean?

Greedy Example

3. Greedy, Lazy and Backtracking
05:52

Greedy Quiz
1 question

Lazy concept, example and demo

3.1 Lazy
04:45

How does engine perform exhaustive search to find the right match?

3.2 Exhaustive
04:45

Greedy hands-on demo

Demo - Greedy
02:38

Groups - What is it? How can you use it? Different types of groups

4. Groups - Indexed, Named, Non Capturing
06:53

Group Quiz
3 questions

Non capture groups to turn-off specific capability

Demo - Non Capture
00:31

How to refer back to a match inside the same pattern?  How to replace a matching text?

4.1 Groups - Back-reference, Substitution
03:24

What are look ahead and look behind? Where are they used and what capability does it provide

5. Look-Ahead and Look-Behind
03:12

Go forward only if certain pre-condition is met

5.1 Positive and Negative Look-Ahead
06:14

Go forward only if certain pre-condition is met before the current character

5.2 Positive and Negative Look-Behind
05:54

Demo - Negative Look-Ahead
01:51
+
Project 1 - Robocopy Log File Parsing with Regular Expression
2 Lectures 18:03

Overview of the project, how robocopy log is structured, what information is the solution going to extract

Overview and Scope
03:56

Strategy for solving the problem with regular expression, review of the solution and demo

Solution and Demo
14:07
+
Regular Expression Performance
7 Lectures 24:42

What can cause performance issues in patterns?

Introduction
00:34

What type of patterns to watch for performance issues

Patterns Exhibiting Exponential Run Time - Under Partial Match Scenarios
02:43

Why is the performance degrading?  What is happening behind the scenes?

Root Cause
04:30

Various options for addressing the performance issue

How to fix performance issues?
03:42

Hands-on overview of all the concepts, solution with demo

Demo - Performance issue and fix
04:44

What is the difference between compiled regular expression objects and module methods?  When to use compiled objects?

Compiled Versus Module Regular Expression Methods
03:36

Demo of comparison between module and compiled object methods

Demo - Module Methods and Compiled Methods Performance
04:53
+
Project 2 - Intelligent Sensor Data Handling with Regular Expression
3 Lectures 16:29

Introduction to sensors and how they collect and store data

Introduction
03:41

How to use regular expression to extract the data and convert to JSON. Solution walk-through and demo

Solution and Demo
12:14

Summary of how we use regular expressions

Summary and Wrapup
00:34
+
Project 3 - Health Care Electronic Medical Record
1 Lecture 07:30

In this project, we will use regular expression to extract information from hospital medical report files sent as HTML payload

Introduction, Solution and Demo
07:30
+
Project 4 - Network Configuration Parser
1 Lecture 00:03
Network Configuration Parser Problem
00:03
1 More Section
About the Instructor
Chandra Lingam
4.5 Average rating
307 Reviews
7,204 Students
5 Courses
Data Scientist and Solutions Architect

Chandra Lingam spent 15 years at Intel, developing and managing systems that handled hundreds of terabytes of worldwide factory data.  Chandra is an expert on Amazon Web Services, mission critical systems and machine learning.  He has a Master's degree in Computer Science from ASU and Bachelor's degree in Computer Science from Thiagarajar College of Engineering, Madurai.  

Chandra is the author of popular iOS educational apps Geometry Test, Math Stripes and Arithmetic Test.