Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

50 Hrs Big Data Mastery: PySpark, AWS, Scala & Data Scraping

Name: 50 Hrs Big Data Mastery: PySpark, AWS, Scala & Data Scraping
Rating: 4.1 (228 reviews)

Comprehensive Big Data Mastery: Scala, Spark, PySpark, AWS, Data Scraping & Data Mining with Python, Mining and MongoDB

Created byAI Sciences, AI Sciences Team

Last updated 12/2025

English

What you'll learn

Introduction and importance of this course in this day and age
Approach all essential concepts from the beginning
Clear unfolding of concepts with examples in Python,Scrapy, Scala, PySpark and MongoDB
All theoretical explanations followed by practical implementations
Data Scraping & Data Mining for Beginners to Pro with Python
Master Big Data with Scala and Spark
Master Big Data With PySpark and AWS
Mastering MongoDB for Beginners
Building your own AI applications

Course content

4 sections • 623 lectures • 54h 39m total length

Introduction: Why Data Scraping2:42
Explore how data scraping extracts internet data for research, analysis, and machine learning, and why it’s a high-demand, high-pay skill with freelance and professional opportunities.
Introduction: Applications of Data Scraping7:09
Introduction: Introduction of Instructor0:40
Introduction: Introduction to Course, Scraping, Tools1:39
Introduction: Projects Overview3:42
Introduction: Request for Your Honest Review1:18
Explore remaining sections to judge how concepts are presented and whether the content merits five-star ratings in the Udemy review system, then we update the course to ensure your satisfaction.
Requests: Introduction to Python Requests3:57
Requests: Hand on with Requests8:28
Use the Python requests module to fetch a web page, inspect the HTML response, and use status codes to validate the request for effective web scraping and data extraction.
Requests: Extracting Quotes Manually10:05
Practice using the requests module to fetch a server response, extract text and emails, parse HTML to pull quotes, and save the results to a file.
Requests: Quiz(Extracting Authors)0:40
Participate in a quiz that requires extracting author names from a code diff and saving them in a file, noting the two codes and their order.
Requests: Solution(Extracting Authors)6:11
Requests: Pagination9:46
Requests: Quiz(Extracting Author and Quotes)0:58
Requests: Solution 01(Extracting Author and Quotes)6:27
Requests: Solution 02(Extracting Author and Quotes)5:52
Extract quotes and author names from a structured response by iterating lines, saving codes, and pairing each code with its following author name, then write to a file.
Requests: Ajax Requests6:36
Requests: Ajax Requests for Cricinfo8:25
Learn to fetch Cricinfo data with the requests module, parse JSON with json.loads, and extract authors and news summaries from a list of dictionaries, including pagination across pages.
Requests: Ajax Requests Paggination3:53
Requests: Quiz(Extracting Top Stats from Cricinfo)1:22
Requests: Solution 01(Extracting Top Stats from Cricinfo)7:16
Inspect ajax requests with the browser network panel to identify the API endpoints that fetch data as you scroll, then replicate these requests locally to extract top stats from Cricinfo.
Requests: Solution 02(Extracting Top Stats from Cricinfo)9:17
Beautiful Soap 4(BS4): Introduction to BS43:02
Beautiful Soap 4(BS4): Quiz(Difference between Requests and BS4)0:25
Beautiful Soap 4(BS4): Solution(Difference between Requests and BS4)1:04
Beautiful Soap 4(BS4): Hands on with BS45:54
Beautiful Soap 4(BS4): Extracting Data from Tree8:50
Beautiful Soap 4(BS4): Extracting Quotes from the Website7:33
Beautiful Soap 4(BS4): Quiz(Extracting Author Names)0:38
Beautiful Soap 4(BS4): Solution(Extracting Author Names)5:28
Use python with requests and BeautifulSoup to extract author names from html by targeting small tags with class author, then write the names to a csv file.
Beautiful Soap 4(BS4): Attributes of Tags in BS49:10
Beautiful Soap 4(BS4): Multi Valued Attributes of Tags in BS43:55
Beautiful Soap 4(BS4): Scraping Movie Names from IMDB19:31
Beautiful Soap 4(BS4): Quiz(Getting the Rattings,Year,Name of the Movie)0:55
Beautiful Soap 4(BS4): Solution 01(Getting the Rattings,Year,Name of the Movie)7:00
Fetch movie name, year, and IMDb rating by scraping HTML with requests and BeautifulSoup, then parse the table body and rows to extract data safely.
Beautiful Soap 4(BS4): Solution 02(Getting the Rattings,Year,Name of the Movie)7:08
Beautiful Soap 4(BS4): Scraping Time,Genre and Releasing Date from IMDB 016:56
Beautiful Soap 4(BS4): Scraping Time,Genre and Releasing Date from IMDB 0217:21
Beautiful Soap 4(BS4): Combining Two Requests Data for IMDB6:50
Beautiful Soap 4(BS4): Movies Recommender System (CreatingMovie Url)6:26
Beautiful Soap 4(BS4): Movies Recommender System (Creating Director Url)6:10
Beautiful Soap 4(BS4): Movies Recommender System using BS4(Getting Top 4 Movies)8:55
Beautiful Soap 4(BS4): Movies Recommender System using BS4(Merge All Requests Together)4:02
CSS Selectors: Introduction to CSS Selectors2:49
Explore the basics of CSS selectors, how they target specific DOM elements, and how to inspect, highlight, and extract text or attributes from targeted regions.
CSS Selectors: CSS Selectors Handson(Tags)5:17
CSS Selectors: Quiz(Tags)1:08
Explore CSS selectors through a quiz that focuses on extracting specific tags like span and paragraphs from a sample page. Apply real-world tag patterns to precise selectors.
CSS Selectors: Solution(Tags)2:15
CSS Selectors: CSS Selectors Handson(Decendants, Id, Class)7:04
CSS Selectors: Quiz(Descendants)0:49
Practice writing a CSS selector to target the two nested span elements inside a div, using descendant selectors, in this quick quiz.
CSS Selectors: Solution(Descendants)1:50
CSS Selectors: Quiz(ID)0:44
CSS Selectors: Solution(ID)1:46
CSS Selectors: Solution(Class)1:00
CSS Selectors: Solution(Class)3:16
CSS Selectors: CSS Selectors Handson(Nested Tags, ID Tags, Class Tags)4:32
CSS Selectors: Quiz(Class with Tag)0:40
CSS Selectors: Solution(Class with Tag)2:26
Explore css selectors to target a specific element by combining its tag name with its class. Use tag and class combinations to limit selections to the desired element.
CSS Selectors: CSS Selectors Handson(Coma Seprator, Universial Selectors6:31
CSS Selectors: Quiz(Combining Two Selectors)0:46
CSS Selectors: Solution(Combining Two Selectors)2:48
CSS Selectors: CSS Selectors Handson(Sibling Notations and Direct Child)7:24
Learn adjacent and general sibling selectors in CSS, using plus and tilde, and master direct child selectors for precise, immediate element targeting.
CSS Selectors: Quiz(Adjacent Sibling)0:45
CSS Selectors: Solution(Adjacent Sibling)2:38
Explore how to correctly apply the adjacent sibling selector by first identifying a unique element, then selecting its adjacent sibling to avoid unintended matches in CSS.
CSS Selectors: Quiz(General Sibling)0:57
CSS Selectors: Solution(General Sibling)2:59
CSS Selectors: CSS Selectors Handson(Child Selectors)7:19
CSS Selectors: Quiz(First Child)0:40
Practice writing a css selector to target only this div in a nested structure, focusing on the first-child concept, and review the solution in the next video.
CSS Selectors: Solution(First Child)3:49
Master css selectors for reliably selecting the first child, explain why first-child can select multiple elements, and show how to use an id-based path to target a specific first child.
CSS Selectors: Quiz(Only Child)0:40
CSS Selectors: Solution(Only Child)2:58
Master CSS selectors by applying first-child and only-child approaches to locate a specific element in a DOM, as demonstrated with practical browser inspection.
CSS Selectors: Quiz(Last Child)0:44
CSS Selectors: Solution(Last Child)3:10
CSS Selectors: CSS Selectors Handson (Nigations, Attributes)6:36
CSS Selectors: Quiz(Negation)0:41
Explore negation in CSS selectors by identifying a selector that targets all child divs except the first one inside a container; practice with a quick quiz.
CSS Selectors: Solution(Negation)2:06
CSS Selectors: CSS Selectors Handson (Attributes, Attributes Values)3:51
CSS Selectors: Quiz(Attributes Values)0:39
CSS Selectors: Solution(Attributes Values)3:26
Explore how to use CSS selectors to filter elements by attribute values, focusing on random attributes and narrowing with span to select specific elements.
CSS Selectors: CSS Selectors Handson (Attributes Wild Cards Values)6:25
discover how to use css selectors to match attribute values with starts with, ends with, contains, and wildcards, including case sensitivity, for precise element selection.
CSS Selectors: Quiz(Attributes Wild Card)0:50
CSS Selectors: Solution(Attributes Wild Card)2:49
Scrapy: Introduction to Scrapy4:10
Explore Scrapy as a fast, powerful Python framework for crawling websites, extracting structured data, and enabling asynchronous data pipelines with easy extensibility and cross-platform support.
Scrapy: Comparison of Scrapy and Requests3:40
Scrapy: Scrapy at a Glance Documentation8:31
Learn how to use the Scrapy framework to crawl websites, extract structured data, and build spiders with Python, requests, callbacks, and css selectors.
Scrapy: Getting Started with Scrapy11:04
Scrapy: Running Documentation Spider 13:25
Scrapy: Running Documentation Spider 212:00
Scrapy: Writing Spider from the Scratch7:23
Create a new scrappy project from scratch, organize it in a dedicated folder, and define a class inheriting from scrappy spider to use start URLs and handle responses.
Scrapy: Understanding the Response(url, Status)7:09
Scrapy: Understanding the Response(headers)4:12
Scrapy: Understanding the Response(values in headers)6:51
Scrapy: Understanding the Response(body)6:04
Scrapy: Understanding the Response(request)4:41
Scrapy: Understanding the Response(meta)8:29
Learn how scrapy uses the response meta to transfer data between callbacks, by passing a dictionary through requests across redirects to combine extracted information.
Scrapy: Understanding the Response(flags, certificate, ip_address, copy)5:16
Learn how Scrapy exposes flags, certificate information, server IP address, and the ability to copy a response for testing, logging, and debugging, with emphasis on response status such as 200.
Scrapy: Understanding the Response(replace, urljoin, follow, follow_all)8:07
Learn how to manipulate Scrapy responses with replace and AllJoyn options, and use response.follow and response.follow_all to follow links, handle relative URLs, and chain requests with callbacks.
Scrapy: Response CSS and Scrapy Shell9:26
Scrapy: Extracting quotes5:47
Scrapy: Understanding Nested selectors10:02
Scrapy: Extracting the Author and Quotes10:05
Scrapy: Checking for Next Page7:36
Scrapy: Checking for Next Page in Spider5:36
Scrapy: Checking for Next Page URL8:16
Scrapy: Scraping Quotes from Next Pages11:07
Scrapy: Exporting Extracted Data3:26
Learn how to export scraped data to a csv file with Scrapy crawl, specifying the output file, ensuring the spider name matches the file, and cleaning the file before export.
Scrapy: Quiz(Get The Tags)0:58
Write a Scrapy spider to extract the code, author, and associated tags from a page, then output the author and comma-separated tag values.
Scrapy: Solution(Get The Tags)7:30
Scrapy: Next Website1:57
Scrapy: CSS Selectors for Movie Names and URLs12:09
Learn to build a Scrapy project, create a spider, and use CSS selectors to extract movie names and URLs from IMDb pages, including anchor text and href attributes.
Scrapy: Combined CSS Selectors for Movie Names and URLs3:22
Scrapy: Sent request to the film info page4:35
Scrapy: Merge Data from Two Callbacks8:44
Scrapy: Extracting Movie Duration and Genres6:59
Scrapy: Exporting the Extracted Data5:34
Export scraped IMDb data with Scrapy by building and yielding dictionaries of movie name, duration, and genres, and save output with -o while tuning concurrency for parallel requests.
Scrapy: Quiz(Extracting the Year)1:18
Scrapy: Solution(Extracting the Year)10:08
Learn to build a scrapy spider that scrapes IMDb to extract movie names and release dates, using anchor tags and CSS selectors to navigate pages and export data.
Scrapy: Getting Director Name and Url8:21
Scrapy: Getting Top Four Movies of Directors9:28
Scrapy: Extracting Data Anomaly (dont_filter Flag)9:50
Scrapy Project: Hugoboss webiste for scraping2:30
Scrapy Project: Understanding Site Structure7:11
Scrapy Project: Writing CSS Selectors for Listings7:43
Learn to craft a css selector to extract listings from a website, selecting the relevant anchor tags and handling mobile and desktop variants with unique classes.
Scrapy Project: Listings in Scrapy Shell4:20
Scrapy Project: Sending Request to Listings Urls7:23
Master sending requests to listing URLs with Scrapy by switching from response.follow to Scrapy's Request, iterating over category pages, and printing product listings for each category.
Scrapy Project: Extracting Products Url from the Listings11:02
Scrapy Project: Sending Requests to Products of the Listings5:02
Scrapy Project: Writing CSS for getting the Product Info16:55
Scrapy Project: Getting the bigger Images of the Product7:54
Learn to fetch bigger product images in a Scrapy project by swapping URL parameters and using Python to split on the question mark and assemble bigger image URLs.
Scrapy Project: Checking Next Page Url13:57
Scrapy Project: Adding Pagination to Spider and Running it9:40
Master Scrapy pagination by teaching a spider to detect next page buttons, issue requests to subsequent pages, and reuse the same callback to extract products across categories.
Scrapy Project: Output of the Spider4:36
Selenium: Introduction To Selenium2:12
Selenium: Getting Started with Selenium3:36
Selenium: Configuring the Webdriver3:40
Selenium: Extracting Quotes10:16
Selenium: Extracting Quotes and Author Names7:17
Selenium: Quiz(Extracting Quotes)0:41
Selenium: Solution(Extracting Quotes)7:22
Selenium: Clicking on Button5:01
Selenium: Paggination and Extracting Data8:06
Selenium: Exception Handling for Unavailable Element5:41
Learn how to use Selenium to handle unavailable elements with try-except blocks during pagination, preventing script termination while extracting quotes and authors from successive pages.
Selenium: Navigating the Website for Login9:37
Selenium: Quiz(Log in and Extract Quote)0:43
Selenium: Solution(Log in and Extract Quote)7:03
Project Selenium: Overview of Project1:28
Project Selenium: Closing the Cookie Button3:26
Project Selenium: Setting the Language for Translation5:50
Project Selenium: Sending the Text for Transaltion3:46
Project Selenium: Downaloading the Translation3:55
Automate a translation workflow with Selenium by entering text, waiting for translation, and triggering a file download using element selectors and a deliberate delay.
Project Selenium: Reading Data from File for Translation3:44
Read text from a local file and automate a Selenium-based translation workflow, sending text to a website, waiting for translation, and downloading the translation to the local machine.
Project Selenium: THANK YOU Bonus Video1:20
Link for the Course's Materials and Codes0:09

Link for the Course's Materials and Codes0:09
Introduction: Why Scala4:56
Introduction: Scala Applications3:27
Introduction: About the Instructor0:50
Introduction: Introduction to Course3:30
Introduction: Projects Overview2:55
Introduction: Request for Your Honest Review1:18
Overview: What is Scala2:02
Explore Scala, a concise high-level language that blends object-oriented and functional programming. Learn its compatibility with the JVM, access to Java libraries, and how Scala integrates these paradigms.
Overview: Scala Setup (Local Machine)9:46
Overview: Scala Setup (Online)5:05
Overview: Variables in Scala9:18
Overview: Arithamatic Operations on Variables-15:55
explore basic arithmetic with variables by declaring integers, performing addition, subtraction, multiplication, and division, and printing results to the console, including integer division behavior.
Overview: Arithamatic Operations on Variables-29:39
Overview: Quiz (Arithmatic Operations)0:55
Practice a quick Scala quiz by declaring three integer variables A, B, and C and implementing an arithmetic equation to check your understanding.
Overview: Solution (Arithmatic Operations)8:17
Overview: Quiz (Strings)0:42
Overview: Solution (Strings)7:06
Overview: Type Casting11:23
Overview: Taking input from User5:47
Take user input in Scala using the read line function, treat input as a string by default, and convert to integers to sum two numbers, avoiding string concatenation.
Overview: Quiz (User Input and Type Casting)0:29
Overview: Solution (User Input and Type Casting)3:50
Flow Control: Overview of Control Statements3:49
Explore flow statements in scala, including if-else AFL statements and loops, which control conditional execution and repeat code blocks, with examples like birthday wishes and continue statements.
Flow Control: If else statements6:29
Learn flow control with if else statements and how conditions decide which code runs. See examples that compare numbers and print outcomes.
Flow Control: Conditions in If6:10
Flow Control: Quiz (if statement)1:27
Master flow control with an if statement by building a quiz that checks the playland entrance age, allows entry only if older than 13, and prints welcome or underage messages.
Flow Control: Solution (if statement)4:17
Flow Control: Nested if else7:37
Master nested if statements to control flow with conditions, inputs, and else branches. Practice prompting for two numbers, testing greater than 10, and computing sums when conditions hold.
Flow Control: Quiz (nested if else)1:06
Flow Control: Solution (nested if else)5:15
Flow Control: Logical operators10:27
Flow Control: Quiz (Logical operators)0:43
Flow Control: Solution (Logical operators)6:27
Demonstrates flow control using logical operators by prompting age and height, applying an and condition to allow entry if age is over 30 and height is at least five feet.
Flow Control: If else if7:01
Master flow control with if, else, and else if in Scala, learning how to evaluate conditions and nest checks to guide program execution.
Flow Control: Quiz (if else if)1:00
Flow Control: Solution(if else if)7:54
Flow Control: Overview of Loops1:52
Flow Control: Overview of While Loop5:34
Flow Control: While Loop8:17
Flow Control: Quiz (while loop)1:18
Flow Control: Solution 1 (while loop)12:00
Flow Control: Solution 2 (while loop)4:31
Flow Control: Do While Loop5:23
Flow Control: For Loop9:21
Flow Control: Quiz (For Loop)1:13
Flow Control: Solution (For Loop)6:53
Flow Control: Quiz(For Loop)1:20
Flow Control: Solution(For Loop)12:26
Flow Control: Break7:30
Explore how the break statement stops loops in for and while constructs by evaluating a condition inside the loop, including a zero exit and a running sum.
Flow Control: Break Fix3:45
Flow Control: Project Overview4:19
Explore flow control with a fortune game project, using loops and if statements to provide hints, track five guesses, and handle win, loss, and random number generation.
Flow Control: Project Solution Design5:58
Design a Scala flow-control project building a number-guessing game with a 0-100 X, using for or while loops and if-else logic, plus random number generation.
Flow Control: Project Solution Code 18:07
Flow Control: Project Solution Code 25:11
Flow Control: Project Solution Code 38:18
This lecture demonstrates flow control in the project, using a for loop and a game status variable to determine win or loss, including future random number generation.
Flow Control: Project Solution Code 47:34
Functions: Overview of Functions5:57
Explore how a function is a reusable block of code with parameters, a return type, and a body, and learn how to declare it in Scala to avoid repetition.
Functions: Writing addition function8:10
Functions: Quiz (Basic Function)0:52
Functions: Solution (Basic Function)5:18
Learn to write a basic function that takes two integers, compares them with an if statement, and returns the greater value, including input handling and function calls.
Functions: Functions common issues5:24
Functions: Named Arguments5:11
Learn how named arguments let you pass function parameters in any order by mapping values to parameter names. This technique helps manage functions with many parameters and avoids type mismatch.
Functions: Quiz (String Concatination Function)0:46
Functions: Solution (String Concatination Function)3:08
Functions: Quiz (Dividing Code in Functions)1:12
Functions: Solution (Dividing Code in Functions)9:55
Functions: Default Arguments7:08
Functions: Quiz(Default Arguments)1:44
Functions: Solution(Default Arguments)10:00
Learn to implement Python functions for bill calculation: get bill amount, get discount, apply discount with a default of $10 when zero, and print both discounted and actual bills.
Functions: Anonymous Functions5:15
Functions: Quiz(Anonymous Functions)1:11
Functions: Solution(Anonymous Functions)4:55
Implement four anonymous two-parameter functions for add, subtract, multiply, and divide, then integrate them into a complete equation and print the result.
Functions: Scopes10:44
Explore scope in programming by showing how variables declared inside braces are accessible within those blocks and how global variables differ from local ones across functions.
Functions: Project Overview4:04
Functions: Checking Credentials7:12
Functions: Prompting the menu7:50
Design a simple menu-driven program that prompts the user to check balance, withdraw, deposit, or quit. Start with a main function and refactor into balance, withdraw, and deposit functions.
Functions: Baisc Functions8:12
Explore the use of a global balance variable inside functions to check balance, withdraw, and deposit funds, and update the balance accordingly.
Functions: Breaking code in more functions12:28
Break the loop when credentials are valid and extract code into modular functions for taking credentials, showing the menu, and making transactions, improving readability and maintainability.
Functions: Final Run5:11
Classes: Introduction to Classes1:59
Classes: Creating Class6:46
Classes: Class Constructor4:29
Classes: Functions and Classes13:24
Create classes that host variables and functions, access them via methods, and instantiate objects with constructors. Define function to print data and another to return the name with greater semester.
Classes: Project Overview2:32
Practice building a class number that stores a value and exposes comparison methods with another class object. Return true if the parameter’s value is greater than the calling instance.
Classes: Basic Strucuture6:20
Classes: Final Run3:30
Data Structures: Introduction of Data Structures2:46
Data Structures: Lists introduction4:59
Data Structures: Lists Create and Delete Elements6:08
Learn how Scala lists are immutable and require creating new lists to add elements, using start and end appends, and nesting lists to build complex structures.
Data Structures: Lists Take3:38
Learn to extract elements from a list by slicing, taking the first few elements into a new list up to a given index without changing the original.
Data Structures: ListBuffer Introduction4:38
Data Structures: Add data in ListBuffer3:43
Data Structures: Remove data from ListBuffer3:13
Data Structures: Take data from ListBuffer2:32
Data Structures: Project Overview3:35
Data Structures: Project Architecture Discussion4:52
Data Structures: Project Architecture Implementation10:17
Data Structures: User Input for Objects5:34
Prompt users for item, price, and count, cast types, create objects, and append them to a list buffer, turning input into structured data for processing.
Data Structures: Implementing the control flow6:11
Data Structures: Creating Required Functions inside Class7:25
Learn to implement required functions inside a class and manage data with a list buffer to print items, prices, counts, and compute the total grocery bill.
Data Structures: Overview of Maps2:53
Data Structures: Creating Maps4:00
Data Structures: Check Key in Map3:15
Data Structures: Update Value in Map3:04
Data Structures: Add and Remove items from Maps5:04
Data Structures: Iterating on Maps3:12
Data Structures: Project Overview1:57
Data Structures: Project Architecture5:54
Data Structures: Project Structure Code3:26
Data Structures: Using Maps for word count7:05
Data Structures: Final Run7:46
Data Structures: Sets Overview5:01
Data Structures: Add and Remove Item from the Set3:23
Learn how to add and remove items in a set, preserving only unique elements using the plus-equals and minus-equals notation, with practical examples.
Data Structures: Set Operations3:41
Data Structures: Overview of Stack1:59
Explore the stack data structure, a last-in, first-out structure where you add and remove elements from the top, with a focus on Scala's mutable stack and its basic syntax.
Data Structures: Push and Pop in Stack3:58
Explore push and pop operations in stacks, including top of the stack access, as you build and manipulate a stack with integers, printing results to understand data structure behavior.
Data Structures: Stack Attributes5:23
Data Structures: Project Overview2:41
Develop a mini project to understand stacked data structures by building an equation bracket validator that checks valid opening and closing brackets using a stack.
Data Structures: Project Architecture10:47
Data Structures: Extra Closing Bracket Use Case10:26
Data Structures: Extra Starting Bracket Use Case8:29
Validate a bracketed expression with a stack in Scala. Address extra opening or closing brackets using a validation flag and empty stack checks.
Project: Project Introduction0:56
Project: Why Spark5:05
Project: Hadoop EcoSystem5:22
Explore the Hadoop ecosystem, core concepts HDFS, YARN, and MapReduce, and how Spark distributes data across multiple machines with flow analysis that speeds up processing compared to MapReduce.
Project: Spark Architecture2:29
Explore Spark architecture by understanding the driver (master) node, cluster manager, and worker nodes, how tasks and transformations are distributed, executed, and returned as final output.
Project: Spark EcoSystem3:07
Explore the spark ecosystem, from Spark SQL for transforming data and querying as tables, to Spark Streaming for real-time inputs, MLlib for machine learning, and GraphX for graph visualization.
Project: DataBricks Account3:16
Create a Databricks account using the community edition, verify your email, and sign in to explore notebooks and begin writing Spark code.
Project: Setting up DataBricks Cluster4:14
Set up a Databricks cluster, attach it to a notebook, and write a hello world in Scala to verify the environment.
Project: Spark Local Setup4:23
Project: Spark Hadoop Setup4:02
Set up Spark and Hadoop on Windows by installing required utilities, configuring Hadoop home, Spark home, and Java home, and launching the Spark shell to verify a clean environment.
Project: Spark RDDs1:55
Project: Spark RDDs (textFile, collect)15:36
Project: Spark Local Run2:37
Project: Understanding Map6:06
Project: Understanding Flat Map9:54
Project: Understanding Reduce By Key5:25
Project: Word Count Example14:52
Project: Spark DFs3:13
Project: Spark DF Read Data6:24
Create a Spark session and read a CSV into a data frame with header true, then compare dataframe reading with Spark context.
Project: Spark Print Schema, Select3:30
Project: Spark GroupBy4:22
Project: Spark DF Write11:27
Demonstrates writing a Spark DataFrame to a file or folder with df.write, setting header and format options, using overwrite mode, and reading back from file or folder with Spark.
Project: Creating S3 bucket4:12
Project: Creating Database in RDS4:15
Project: Performing ETL19:38

Links for the Course's Materials and Codes0:09
Introduction: Why Big Data3:23
Introduction: Applications of PySpark3:12
Introduction: Introduction to Instructor0:46
Introduction: Introduction to Course1:49
Introduction: Projects Overview3:25
Introduction: Request for Your Honest Review1:18
Introduction to Hadoop, Spark EcoSystems and Architectures: Why Spark3:53
Introduction to Hadoop, Spark EcoSystems and Architectures: Hadoop EcoSystem4:49
Introduction to Hadoop, Spark EcoSystems and Architectures: Spark Architecture and EcoSystem8:08
Explore Hadoop and Spark ecosystems, detailing Spark architecture with driver and cluster managers, and how workers execute tasks across languages and libraries like Spark SQL, Spark Streaming, MLlib, and GraphX.
Introduction to Hadoop, Spark EcoSystems and Architectures: DataBricks SignUp3:41
Introduction to Hadoop, Spark EcoSystems and Architectures: Create DataBricks Notebook4:52
Introduction to Hadoop, Spark EcoSystems and Architectures: Download Spark and Dependencies3:16
Introduction to Hadoop, Spark EcoSystems and Architectures: Java Setup on Window4:16
Introduction to Hadoop, Spark EcoSystems and Architectures: Python Setup on Window1:31
Introduction to Hadoop, Spark EcoSystems and Architectures: Spark Setup on Window2:58
Introduction to Hadoop, Spark EcoSystems and Architectures: Hadoop Setup on Window2:40
Introduction to Hadoop, Spark EcoSystems and Architectures: Runing Spark on Window2:49
Validate spark installation on Windows by launching Spark Shell and PySpark, verify Spark version 3.1.x, and prep for writing PySpark code, with Databricks workflow upcoming.
Introduction to Hadoop, Spark EcoSystems and Architectures: Java Download on MAC1:52
Introduction to Hadoop, Spark EcoSystems and Architectures: Installing JDK on MAC1:01
Install the JDK on macOS using the installation wizard, click continue, enter your password, and confirm a successful installation.
Introduction to Hadoop, Spark EcoSystems and Architectures: Setting Java Home on MAC3:02
Introduction to Hadoop, Spark EcoSystems and Architectures: Java check on MAC1:18
Introduction to Hadoop, Spark EcoSystems and Architectures: Installing Python on MAC1:13
Install Python on Mac to continue setting up the big data environment; download Python 3.9.6 from the Mac download page and run the installer.
Introduction to Hadoop, Spark EcoSystems and Architectures: Setup Spark on MAC4:18
Introduction to Hadoop, Spark EcoSystems and Architectures: Which of the following statement is True
Introduction to Hadoop, Spark EcoSystems and Architectures: Which of the following is not a part of spark ecosystem?
Spark RDDs: Spark RDDs8:28
Spark RDDs: Creating Spark RDD11:00
Spark RDDs: Running Spark Code Locally10:16
Spark RDDs: RDD stands for:
Spark RDDs: RDD is created by using:
Spark RDDs: RDD Map (Lambda)11:08
Master Spark RDD map by applying a lambda to each element to produce a new RDD, with examples like splitting strings by spaces and appending text.
Spark RDDs: RDD Map (Simple Function)9:37
Discover how to replace a lambda with a regular function in Spark RDDs map, split strings, convert to integers, and build robust map workflows.
Spark RDDs: Quiz (Map)1:23
Spark RDDs: Solution 1 (Map)6:37
Spark RDDs: Solution 2 (Map)4:01
Learn how to replicate an RDD map operation using a lambda function in Spark, building a split and length-based transformation with list comprehension for readable, concise code.
Spark RDDs: RDD FlatMap10:13
Spark RDDs: RDD Filter8:02
Spark RDDs: Quiz (Filter)1:37
Spark RDDs: Solution (Filter)16:19
Spark RDDs: RDD Distinct6:24
Spark RDDs: RDD GroupByKey17:02
Explore the groupByKey transformation on Spark RDDs by converting data to key–value notation, using map or flatMap to create (key, value) pairs, and collecting values into grouped lists per key.
Spark RDDs: RDD ReduceByKey13:46
Spark RDDs: Quiz (Word Count)1:02
Spark RDDs: Solution (Word Count)15:07
Spark RDDs: RDD (Count and CountByValue)7:11
Spark RDDs: RDD (saveAsTextFile)15:30
Spark RDDs: RDD (Partition)18:06
Learn how to manage Spark RDD partitions by repartitioning and coalescing, understand when to increase or decrease partitions, and see how partitioning affects read and write performance.
Spark RDDs: Finding Average-115:03
Spark RDDs: Finding Average-27:09
Spark RDDs: Quiz (Average)1:29
Spark RDDs: Solution (Average)11:25
Spark RDDs: Finding Min and Max10:18
Compute the minimum and maximum ratings per movie using Spark RDDs with map and reduceByKey, converting strings to key-value pairs and applying a lambda for min and max.
Spark RDDs: Quiz (Min and Max)0:57
Spark RDDs: Solution (Min and Max)6:13
Spark RDDs: Project Overview2:26
Spark RDDs: Total Students3:40
Spark RDDs: Total Marks by Male and Female Student6:51
Spark RDDs: Total Passed and Failed Students4:49
Spark RDDs: Total Enrollments per Course5:06
Spark RDDs: Total Marks per Course3:13
Spark RDDs: Average marks per Course12:45
Spark RDDs: Finding Minimum and Maximum marks3:50
Spark RDDs: Average Age of Male and Female Students5:48
Spark DFs: Introduction to Spark DFs8:08
Spark DFs: Creating Spark DFs10:34
Spark DFs: DF stands for:
Spark DFs: DF is created by using:
Spark DFs: Spark Infer Schema7:48
Spark DFs: Spark Provide Schema8:28
Spark DFs: Create DF from Rdd8:21
Spark DFs: Rectifying the Error5:17
Spark DFs: Select DF Colums11:49
Spark DFs: Spark DF withColumn19:46
Spark DFs: Spark DF withColumnRenamed and Alias6:12
Spark DFs: Spark DF Filter rows16:05
Learn to filter Spark DataFrames by rows using filter and where, apply single and multiple conditions, and use is in, starts with, ends with, contains, and like with column expressions.
Spark DFs: Quiz (select, withColumn, filter)1:26
Spark DFs: Solution (select, withColumn, filter)10:19
Spark DFs: Spark DF (Count, Distinct, Duplicate)10:56
Spark DFs: Quiz (Distinct, Duplicate)0:45
Spark DFs: Solution (Distinct, Duplicate)5:19
Spark DFs: Spark DF (sort, orderBy)6:24
Explore Spark df sorting with sort and orderBy, applying ascending or descending orders on single or multiple columns. Understand interchangeable notations and the integer data requirement for accurate sorting.
Spark DFs: Quiz (sort, orderBy)1:55
Spark DFs: Solution (sort, orderBy)9:14
Spark DFs: Spark DF (Group By)12:30
Spark DFs: Spark DF (Group By - Multiple Columns and Aggregations)10:37
Spark DFs: Spark DF (Group By -Visualization)13:25
Spark DFs: Spark DF (Group By - Filtering)11:08
Spark DFs: Quiz (Group By)0:52
Read file into Spark dataframe, then use group by to display counts, male and female splits, marks by gender, and min, max, and average marks by course and age group.
Spark DFs: Solution (Group By)7:50
Spark DFs: Quiz (Word Count)0:54
Spark DFs: Solution (Word Count)4:39
Spark DFs: Spark DF (UDFs)8:34
Spark DFs: Quiz (UDFs)1:30
Spark DFs: Solution (UDFs)8:09
Spark DFs: Solution (Cache and Presist)7:30
Learn how Spark dataframes use caching and persist to store intermediate results in memory. See how actions trigger evaluation and subsequent transformations read from the cache, speeding up workflows.
Spark DFs: Spark DF (DF to RDD)7:24
Learn to refer to the underlying rdd instead of the dataframe and perform operations on it, including converting between dataframe and rdd and grouping by multiple columns.
Spark DFs: Spark DF (Spark SQL)6:16
Spark DFs: Spark DF (Write DF)10:45
Learn how to write a Spark DataFrame back to memory or an output directory, control through write options, modes (overwrite, append, ignore, error), and read data back.
Spark DFs: Project Overview2:11
Spark DFs: Project (Count and Select)4:11
Spark DFs project reads a file into a dataframe, counts employees, derives unique departments using group by or select with dropDuplicates, and prints department names.
Spark DFs: Project (Group By)4:26
Spark DFs: Project (Group By, Aggregations and Order By)5:03
Spark DFs: Project (Filtering)8:20
Spark DFs: Project (UDF and WithColumn)6:11
Spark DFs: Project (Write)3:17
Collaborative filtering: Collaborative filtering2:31
Collaborative filtering: Utility Matrix4:04
Collaborative filtering: Explicit and Implicit Ratings4:15
Collaborative filtering: Expected Results3:09
Collaborative filtering: Dataset6:38
Explore a collaborative filtering dataset by loading movie ratings into Databricks, configuring Spark read options, and inspecting the resulting data frame to start collaborative filtering workflows.
Collaborative filtering: Joining Dataframes6:42
Collaborative filtering: Train and Test Data6:26
Split the ratings dataframe into training and test sets using an 80/20 random split to train a collaborative filtering model and evaluate the recommender system.
Collaborative filtering: ALS model5:56
Collaborative filtering: Hyperparameter tuning and cross validation8:24
Explore collaborative filtering with hyperparameter tuning and cross-validation by building multiple models, evaluating with root mean squared error, and using a grid builder and cross validator to find parameters.
Collaborative filtering: Best model and evaluate predictions4:13
Collaborative filtering: Recommendations10:43
Spark Streaming: Introduction to Spark Streaming4:46
Spark Streaming: Spark Streaming with RDD4:25
Spark Streaming: Spark streaming is used to:
Spark Streaming: Spark Streaming Context5:09
Spark Streaming: Spark Streaming Reading Data5:18
Spark Streaming: Spark Streaming Cluster Restart4:00
Spark Streaming: Spark Streaming RDD Transformations7:41
Spark Streaming: Which statement is true about SparkContext and StreamingContext
Spark Streaming: Spark Streaming DF8:22
Spark Streaming: Spark Streaming Display5:14
Spark Streaming: Spark Streaming DF Aggregations5:35
Explore Spark Streaming df aggregations by performing a group by and count on a dataframe, observing how new files update the word counts in real time in Databricks.
ETL Pipeline: Introduction to ETL4:58
Explore the etl pipeline with spark as the driver that extracts data from diverse sources, optionally transforms it, and loads it to a chosen output format or destination.
ETL Pipeline: We can perform ETL using PySpark:
ETL Pipeline: ETL stands for:
ETL Pipeline: ETL pipeline Flow2:20
ETL Pipeline: Data set2:34
ETL Pipeline: Extracting Data3:20
This video demonstrates extracting data in the ETL pipeline by reading a text file into a data frame, displaying the results, and outlining a subsequent word count transformation before loading.
ETL Pipeline: Transforming Data14:15
Transform data in the ETL pipeline by converting lines into word lists, exploding them into individual words, and counting occurrences to produce a word frequency result.
ETL Pipeline: Loading data (Creating RDS-I)9:07
ETL Pipeline: Load data (Creating RDS-II)2:49
ETL Pipeline: RDS Networking5:30
ETL Pipeline: Downloading Postgres1:16
ETL Pipeline: Installing Postgres1:53
ETL Pipeline: Connect to RDS thorugh PgAdmin2:35
ETL Pipeline: Loading Data15:40
Project - Change Data Capture / Replication On Going: Introduction to Project1:48
Introduce change data capture (cdc) and a pipeline to capture and replicate all changes from a database into storage, outlining the architecture for the end-to-end cdc project.
Project - Change Data Capture / Replication On Going: Project Architecture15:43
Project - Change Data Capture / Replication On Going: In this project we are going to implement:
Project - Change Data Capture / Replication On Going: The cloud service DMS will be used to:
Project - Change Data Capture / Replication On Going: Creating RDS MySql instance9:27
Project - Change Data Capture / Replication On Going: Creating S3 Bucket3:32
Project - Change Data Capture / Replication On Going: Creating DMS Source Endpoint5:37
Execute change data capture by creating and testing a data migration service source endpoint for a MySQL database, then name and verify the endpoint before configuring the destination.
Project - Change Data Capture / Replication On Going: Creating DMS Destination Endpoint5:35
Project - Change Data Capture / Replication On Going: Creating DMS Instance2:42
Create a DMS replication instance and an end point to enable change data capture and data migration, using the minimal available instance size in the VPC for the DMF task.
Project - Change Data Capture / Replication On Going: MySql WorkBench1:16
Project - Change Data Capture / Replication On Going: Connecting with RDS and Dumping Data6:02
Establish a MySQL Workbench connection to the target database, create the schema and a primary key table, and run the dump to enable change data capture and ongoing replication.
Project - Change Data Capture / Replication On Going: Quering RDS1:57
Project - Change Data Capture / Replication On Going: DMS Full Load8:30
Project - Change Data Capture / Replication On Going: DMS Replication Ongoing6:03
Project - Change Data Capture / Replication On Going: Stoping Instances1:45
Stop ongoing change data capture replication and related instances to prevent costs, then create a spark job to read data from the three and write to the three buckets.
Project - Change Data Capture / Replication On Going: Glue Job (Full Load)8:28
Create a glue job in Databricks to perform a full load and updates for change data capture, reading full and updated data, renaming columns, and writing final output with overwrite.
Project - Change Data Capture / Replication On Going: Glue Job (Change Capture)3:50
Project - Change Data Capture / Replication On Going: Glue Job (CDC)15:26
Project - Change Data Capture / Replication On Going: Creating Lambda Function and Adding Trigger6:46
Project - Change Data Capture / Replication On Going: Checking Trigger5:21
Project - Change Data Capture / Replication On Going: Getting S3 file name in Lambda4:28
extract the bucket name and file name from the lambda event to identify which S3 object triggers the function, and verify by uploading a file.
Project - Change Data Capture / Replication On Going: Creating Glue Job5:24
Project - Change Data Capture / Replication On Going: Adding Invoke for Glue Job4:49
Project - Change Data Capture / Replication On Going: Testing Invoke4:59
Project - Change Data Capture / Replication On Going: Writing Glue Shell Job5:51
Project - Change Data Capture / Replication On Going: Full Load Pipeline6:41
Spin up the data migration task, perform the full load into the S3 bucket, and enable change data capture with replication ongoing via a Lambda-triggered Glue job.
Project - Change Data Capture / Replication On Going: Change Data Capture Pipeline7:12

Links for the Course's Materials and Codes0:09
Introduction: Why MongoDB3:03
Explore why MongoDB, a NoSQL database, is in high demand for web and mobile apps, with freelancing opportunities and strong industry relevance.
Introduction: Applications of MongoDB4:03
Introduction: Instructor Introduction0:57
Meet instructor Muhammad Ahmed, a cloud and big data engineer who brings experience with databases, data migration, cloud deployment, and DevOps to help you master big data skills, including MongoDB.
Introduction: What_s inside2:13
Introduction: Methodology1:10
Introduction: Project1:29
Introduction: Request for Your Honest Review1:18
Explore how the Udemy review system works, encouraging you to assess the remaining sections and real-world concepts covered, and rate honestly if you think the content is five-star material.
Overview: SQL Schema10:16
Overview: NoSQL Schema12:13
Learn how NoSQL databases use flexible key-value pairs instead of fixed columns to store employee data and dependents. See how this approach reduces joins and enables document-based storage in MongoDB.
Overview: What's the major difference between Sql and NoSql?
Overview: Installing MongoDB4:54
Overview: Setting Enviroment Variable3:20
Overview: Analogies4:26
Overview: Which of the below statement is not true?
Basic Mongo Operations: Basic Database commands6:31
Basic Mongo Operations: Basic Database commands6:14
Basic Mongo Operations: Basic Collection Commands10:18
Basic Mongo Operations: Which is the correct Command to see all collections?
Basic Mongo Operations: Is it mandatory to create collection in database for viewing it?
Basic Create Operation: Introduction to module1:53
Basic Create Operation: Create operation is used to
Basic Create Operation: Create Document (Single)9:10
Basic Create Operation: Create Documents (Many)8:04
Learn to insert multiple documents into a MongoDB collection in one go using insertMany, compare with single inserts, and see how IDs are returned for bulk operations.
Basic Create Operation: Which command is used to see all the documents in the collection?
Basic Create Operation: Quiz (Create Docuements)1:56
Basic Create Operation: Solution (Create Docuements)7:59
Basic Create Operation: Quiz (Create Document)1:09
Basic Create Operation: Solution (Create Document)5:46
Basic Create Operation: Outro1:23
Basic Update Operation: Introduction2:53
Basic Update Operation: Update Documents (Sinlge Filter)13:02
Basic Update Operation: Update operation is used to
Basic Update Operation: Update Documents4:32
Basic Update Operation: Which command will update document where name is John?
Basic Update Operation: Quiz (Update Operation)0:53
Basic Update Operation: Solution (Update Operation)4:23
Basic Update Operation: Quiz (Update Operation)0:46
Execute basic update operations on documents by updating quantity to 204 for abc 1 2 3 and setting the order to 50 where quantity is 4.5, using the sample data.
Basic Update Operation: Solution (Update Operation)4:23
Basic Update Operation: Solution (Update Operation)4:53
Learn how to perform update operations in MongoDB by updating quantities and nested metrics ratings, while navigating document structure and common pitfalls of updating multiple documents.
Basic Update Operation: Outro1:08
Basic Read Operation: Introduction1:52
Basic Read Operation: Read operation is used to
Basic Read Operation: Read Docuements3:06
Use filters to read documents in MongoDB by criteria, such as name Emmett or age, returning only matching records; empty criteria returns all records.
Basic Read Operation: Which command is used to see all the documents in the collection?
Basic Read Operation: Quiz (Read Documents)1:12
Basic Read Operation: Solution (Read Documents)6:09
Perform basic read operations to retrieve documents from a collection using find and pretty for formatted output, and filter by fields such as French, science, and history marks.
Basic Read Operation: Quiz (Read Documents)0:48
Basic Read Operation: Solution (Read Documents)2:26
Learn to read documents from a collection using a read operation: print all documents, format with pretty, and filter by quantity and metrics.ratings equals 3.5 using a find condition.
Basic Read Operation: Outro1:14
Basic Delete Operation: Introduction0:36
Basic Delete Operation: Delete operation is used to
Basic Delete Operation: Delete Document6:25
Learn how to perform delete operations in a database using empty criteria or specific filters, and delete by id by supplying the full object.
Basic Delete Operation: Which command is used to delete all the documents in the collection where quantity is 10?
Basic Delete Operation: Quiz (Delete Operation)0:47
Practice basic delete operations by writing queries to remove documents with nine marks in French and those named John, then delete all remaining documents.
Basic Delete Operation: Solution (Delete Operation)4:18
Demonstrate deleting documents by criteria in MongoDB, such as nine marks in French or specific student names, and clearing a collection by using an empty query parameter.
Basic Delete Operation: Quiz (Delete Operation)0:43
Basic Delete Operation: Solution (Delete Operation)1:23
Basic Delete Operation: Outro1:00
Master the basics of delete operations and preview MongoDB operators, as the course prepares you for the upcoming MongoDB module.
Query and projection operators: Module Introduction1:57
Query and projection operators: $eq Operator15:23
Query and projection operators: $gt Operator3:35
Query and projection operators: $lt Operator3:26
Query and projection operators: $in Operator3:09
Discover the MongoDB $in operator for query and projection, using a single condition to find documents with quantity matching values like 15, 20, or 25.
Query and projection operators: $ne Operator2:07
Query and projection operators: $nin Operator2:25
Explore the $nin operator, the inverse of $in, and learn to filter documents by not in a specified list, applying multiple not-in criteria to inventory data.
Query and projection operators: $and Operator6:39
Query and projection operators: $or Operator3:24
Query and projection operators: $not Operator5:11
Query and projection operators: $exists Operator7:18
Query and projection operators: $types Operator3:09
Query and projection operators: $expr Operator6:19
Explore evaluation query operators with the expression operator to compare spent and budget. Use the $expr operator in db.collection.find, referencing $spent and $budget with $gt, $lt, $gte, $lte, or $eq.
Query and projection operators: $mod Operator4:26
Query and projection operators: $text Operator10:53
Learn to use MongoDB's text operator on indexed fields, including phrase searches, case sensitivity, and negation, with text indexes on the article collection.
Query and projection operators: $all Operator5:47
Explore the MongoDB $all operator to filter documents by array contents. Learn how to require all specified elements in fields like tags and quantity.
Query and projection operators: $elemMatch Operator10:04
Query and projection operators: $size Operator4:10
Explore the $size operator in MongoDB within the realm of query and projection operators, filtering documents by array length using inventory data to query tags and quantity.
Query and projection operators: $ Operator7:30
Query and projection operators: $slice Operator3:39
Query and projection operators: Select the correct statement:
Query and projection operators: Select the correct statement:
Query and projection operators: Quiz ($eq)0:51
Query and projection operators: Solution ($eq)7:07
Create a MongoDB collection, insert records, and use the $eq operator to filter documents by quantity and nested item codes in tags arrays.
Query and projection operators: Quiz ($gt)1:02
Query and projection operators: Solution ($gt)3:01
Query and projection operators: Quiz ($gte)0:54
Query and projection operators: Solution ($gte)6:14
Create a collection, load data, and query documents using find and pretty. Learn to use the $gte operator to filter by quantity, including nested item fields accessed via dot.
Query and projection operators: Quiz ($in)0:54
Query and projection operators: Solution ($in)8:02
Explore MongoDB query and projection operators with the $in operator, filtering documents by quantity, nested item fields, and codes. Learn practical examples involving arrays and tags to refine results.
Query and projection operators: Quiz ($lt)1:17
Master the less than ($lt) operator in a quiz that filters documents by quantity under 15 and size under 12, reinforcing query and projection concepts.
Query and projection operators: Solution ($lt)2:32
Query and projection operators: Quiz ($lte)0:51
Query and projection operators: Solution ($lte)4:47
Query and projection operators: Solution ($lte)1:48
Query and projection operators: Quiz ($ne)0:45
Explore query and projection operators through a practical quiz, filtering documents by quantity not equal to 20, equal to 5, and not containing the value five.
Query and projection operators: Solution ($ne)4:22
Explore querying a MongoDB collection with the $ne operator to filter documents by quantity not equal to 20 and by nested values not equal to 10, with projection.
Query and projection operators: Quiz ($nin)0:56
Practice using the not in operator in a quiz on query and projection with $nin, filtering documents by quantity values, tags, and names.
Query and projection operators: Solution ($nin)2:13
Query and projection operators: Solution ($nin)3:03
Query and projection operators: Solution ($nin)1:39
Explore using the not in ($nin) operator in MongoDB to filter documents by name not in a given set, using the Mongo Shell find.
Query and projection operators: Quiz ($and)0:55
Query and projection operators: Solution ($and)7:44
Query and projection operators: Quiz ($or)1:36
Query and projection operators: Solution ($or)8:40
Query and projection operators: Solution ($or)5:18
Query and projection operators: Quiz ($not)1:47
Query and projection operators: Solution ($not)4:58
Query and projection operators: Solution ($not)4:35
Query and projection operators: Solution ($not)6:10
Use the not operator with query and projection operators to combine conditions, such as not one to three, quantity greater than 15, and tags do not contain a or b.
Query and projection operators: Quiz ($exists)0:51
Explore query and projection operators through an exists-based quiz, filtering documents by quantity and field presence to practice building precise database queries.
Query and projection operators: Solution ($exists)4:56
Explore query and projection operators, especially $exists, to filter documents by field presence and values, using the and operator. Lecture walks through creating a collection, inserting data, and applying queries.
Query and projection operators: Quiz($expr)0:57
Query and projection operators: Solution($expr)6:22
Query and projection operators: Quiz($mod)0:41
Query and projection operators: Solution($mod)4:12
Use the mod operator in find queries to filter documents by quantity, showing how even and odd values are identified and selected in a database collection.
Query and projection operators: Quiz($text)1:22
Query and projection operators: Solution($text)13:19
Create a MongoDB collection with a text index on subject, then use $text to search for shop or coffee. Combine text queries with and/or operators, negation, and views filters.
Query and projection operators: Quiz($all)0:59
Apply the all operator to query documents by tags such as school and book, filter by colors brown and orange, and select quantities greater than six.
Query and projection operators: Solution($all)4:32
Query and projection operators: Solution($all)4:23
Query and projection operators: Quiz($elemMatch)1:11
Query and projection operators: Solution($elemMatch)6:00
Explore MongoDB queries with $elemMatch by creating a collection, importing data, and retrieving documents where an array field's numbers satisfy >80, <10, or 30–80, and blue or green.
Query and projection operators: Solution($elemMatch)5:33
Query and projection operators: Quiz($size)0:42
Query and projection operators: Solution($size)3:20
Update Operators: $currentDate operator9:05
Update Operators: $inc operator7:25
Update Operators: $inc operator2:54
Explore the MongoDB $inc operator to increment and decrement fields, updating documents with update or updateMany, using minus values to reduce quantities.
Update Operators: $min operator2:57
Apply the $min update operator to set a field to the lesser of its current and new values, and use updateMany with an id-based filter to update multiple documents.
Update Operators: $max operator4:28
Explore the $max update operator, comparing a provided value with the current value and updating fields like high score when the new value is greater, with examples of update many.
Update Operators: $mul operator3:06
Learn how the mul update operator multiplies field values, applying to price and quantity, with examples updating all documents or a specific ID, doubling or halving values.
Update Operators: $rename operator5:26
Update Operators: $inc operator is used to:
Update Operators: $set operator7:17
Update Operators: $set operator3:50
Explore update operations with the $set operator to modify documents, including updating all records, specific array elements, and nested fields like tags and ratings.
Update Operators: $unset operator2:58
Update Operators: $addToSet operator3:52
Update Operators: $pop operator3:49
Learn how the MongoDB update $pop operator removes the first or last array element, using minus one for the first and plus one for the last, with no random removals.
Update Operators: $pull operator9:29
Update Operators: $push operator2:13
Update Operators: $each operator3:44
Update Operators: $position operator2:51
Update Operators: $sort operator2:52
Update Operators: $push operator is used to:
Update Operators: Quiz (Update Operators)1:02
Update Operators: Solution (Update Operators)3:26
Learn to load data into a MongoDB collection, then use update many with a condition of orders greater than 20 to increment quantity by two.
Update Operators: Solution (Update Operators)2:56
This lecture demonstrates using update many with the mul operator to double the quantity field for documents where metrics.ratings are greater than 4.2.
Update Operators: Solution (Update Operators)1:32
Discover how to use update operators to modify document quantities, using find to locate matches and update many to set the quantity to zero for selected records.
Update Operators: Solution (Update Operators)3:22
Update Operators: Quiz (Update Operators)0:47
Explore update operators through a quiz, writing a query to add schoolbag to missing document tags, then add texture and update ID3 entries for Bottle Gable and Mike.
Update Operators: Solution (Update Operators)2:11
Use the addToSet update operator to add the schoolbag tag to every document’s tags array if missing, showing updates from book bag and appliance to include school.
Update Operators: Solution (Update Operators)1:41
Update Operators: Solution (Update Operators)2:45
Mongo with Node: Installing Node on local machine2:43
Install MongoDB on your Windows local machine, then set up environment variables and update the PATH so Node can access MongoDB features.
Mongo with Node: Installing VS code4:33
Install node.js, verify the version with node -v, and set up Visual Studio Code. Open a folder, create a demo file, run a hello world program, and observe the output.
Mongo with Node: Mongo atlas1:37
Mongo with Node: Create Cluster on Mongo atlas4:21
Mongo with Node: Creating User in Atlas6:21
Mongo with Node: Network Access3:23
Mongo with Node: Is it a good practice to make your mongodb cluster publicly accessible from all IP addresses:
Mongo with Node: Database and Collections6:32
Mongo with Node: Connect Node with Mongo9:10
Connect a Node.js app to a MongoDB Atlas cluster by installing the MongoDB driver, creating a MongoClient, and reading databases. Handle errors and close the connection.
Mongo with Node: Get databases4:26
Mongo with Node: Insert in Mongo using Node8:27
Mongo with Node: Read from Mongo using Node5:21
Mongo with Node: Update in Mongo using Node7:25
Mongo with Node: Delete from Mongo using Node3:32
Execute delete many in Node to remove documents from a MongoDB collection by specific conditions, verify acknowledged deletions, and practice multi-document deletion scenarios.
Mongo with Node: Which of the statement is true:
Mongo with Python: PyCharm4:21
Mongo with Python: Creating Connection5:25
Learn to connect a Python script to MongoDB Atlas using the MongoDB driver, install the Python driver, and access database and collection for cloud-based data operations.
Mongo with Python: Insert in Mongo using Python6:36
Mongo with Python: Read from Mongo using Python4:28
Mongo with Python: Update in Mongo using Python6:24
Mongo with Python: Delete in Mongo using Python3:38
Delete documents in a MongoDB collection using Python by defining a condition and applying delete_many or delete_one, demonstrating how to reference the client, database, and collection.
Django with Mongo: Django Installation3:33
Django with Mongo: Which of the statement is true:
Django with Mongo: Creating App4:10
Django with Mongo: Setting up Django with Mongo6:04
Django with Mongo: Django Migrations2:18
Django with Mongo: Django Urls and Views4:22
Django with Mongo: Django with Postman4:35
Django with Mongo: Django get Data from Postman5:29
Django with Mongo: Insert in Mongo using Django2:44
Django with Mongo: Read from Mongo using Django6:03
Django with Mongo: Update in Mongo using Django4:57
Update a MongoDB document using Django by retrieving the document ID and new title from Postman, sending an update request, and saving the changes to the database.
Django with Mongo: Detele in Mongo using Django2:53
Spark With Mongo: Databricks for Spark3:06
Spark With Mongo: Installing Libraries2:05
Spark With Mongo: Data Overview2:46
Explore Spark with MongoDB by loading a simple employee dataset in Databricks, reading data with a notebook, and loading it into MongoDB.
Spark With Mongo: ETL13:05
Create a Spark session, configure the Spark MongoDB connector for ETL, read data from a file, and write it to a MongoDB collection with overwrite or append options.

Requirements

Basic understanding of HTML tags. Python, SQL and Node JS
No prior knowledge of data scraping and Scala is needed. You start right from the basics and then gradually build your knowledge of the subject.
Basic understanding of programming.
A willingness to learn and practice.
Since we teach by practical implementations so practice is a must thing to do

Description

Welcome to the comprehensive Big Data and Data Science bundle, where you'll embark on an educational journey covering a wide range of essential skills and technologies. This course equips you with expertise in Scala, PySpark, AWS, Data Scraping, Data Mining, and MongoDB. Whether you're an absolute beginner or possess some programming knowledge, this course provides in-depth coverage of these critical topics.

I. Scala:

Scala may not be the most popular coding language, but it's undeniably one of the most sought-after skills for data scientists and data engineers. This course is meticulously designed to make Scala simple to grasp and implement. You'll engage with quizzes and mini-projects to reinforce your learning, making your Scala experience seamless.

Key Highlights:

High Demand Skill: Scala is in high demand in the industry, and this course ensures you acquire essential skills
Practical Learning: Quizzes and mini-projects serve as building blocks for a comprehensive understanding of Scala
Hands-on Experience: Gain practical experience by working on a Scala Spark project
Versatility: Scala is a powerful language suitable for a wide range of applications, from web development to machine learning

Learning Materials:

Comprehensive Scala tutorials
Scala quizzes and assessments
Hands-on Scala Spark project
Scala code examples and exercises

II. PySpark and AWS:

Python and Apache Spark are at the forefront of Big Data analytics, and PySpark bridges the gap between them. In this section, you'll start with the basics and progress to advanced data analysis. You'll work with PySpark for data analysis, explore Spark RDDs, Dataframes, and Spark SQL queries, and delve into Spark and Hadoop ecosystems. Additionally, you'll discover how to leverage AWS cloud services with Spark.

Key Highlights:

Python and Spark Integration: Master the art of using Python and Spark together for effective Big Data analysis
Comprehensive Coverage: Explore Spark RDDs, Dataframes, Spark SQL queries, and seamlessly integrate with AWS
Hands-on Practice: Apply your knowledge through practical exercises and projects

Learning Materials:

In-depth PySpark and AWS tutorials
PySpark quizzes and assessments
AWS integration guides and examples
PySpark code samples and hands-on projects

III. Data Scraping and Data Mining:

Data scraping involves extracting data from websites and APIs, making it a valuable skill for data professionals. This section is tailored for beginners, starting with foundational concepts and gradually delving into advanced techniques through practical implementations. Hands-on projects are a pivotal part of this segment, allowing you to learn through experimentation and real-world applications.

Key Highlights:

Beginner-Friendly: Perfect for individuals new to data scraping and mining
Practical Implementation: Gain deep insights through hands-on projects and real-world examples
Lucrative Career: Data scraping offers rewarding career prospects and competitive salaries

Learning Materials:

Comprehensive Data Scraping and Mining tutorials
Hands-on data extraction projects
Data scraping and mining quizzes and assessments
Data scraping code samples and automation scripts

IV. MongoDB:

This section introduces you to MongoDB, a popular NoSQL database. You'll learn the fundamentals of MongoDB, including Create, Read, Update, and Delete operations. Dive deep into MongoDB query and project operators, enhancing your understanding of NoSQL databases. Two comprehensive projects will provide you with practical experience using MongoDB in Django and implementing an ETL (Extract, Transform, Load) pipeline with PySpark.

Key Highlights:

NoSQL Proficiency: Develop expertise in MongoDB, a highly sought-after NoSQL database
Hands-on Projects: Apply your knowledge to real-world scenarios and gain practical skills
Versatile Skills: MongoDB is invaluable for data management and analytics

Learning Materials:

MongoDB fundamentals and advanced tutorials
Hands-on MongoDB projects, including Django integration and ETL pipeline development
MongoDB quizzes and assessments
MongoDB code examples and best practices

Course Benefits:

Upon completing this comprehensive course successfully, you will be proficient in implementing projects from scratch that require expertise in Data Scraping, Data Mining, Scala, PySpark, AWS, and MongoDB. You'll be adept at connecting theoretical concepts to real-world problem-solving, efficiently extracting data from websites, and be well-prepared for various data-related roles.

Learning Materials:

Video lectures and tutorials.
Quizzes, assessments, and solutions.
Hands-on projects with step-by-step guidance.
Code examples and templates.
Reference materials and best practices.

Enroll now to embark on your journey toward mastering Big Data and Data Science comprehensively!

Who Should Enroll:

Ideal for beginners or those looking to apply theoretical knowledge in practical scenarios
Aspiring data scientists and machine learning experts
Individuals aiming to excel in the realm of Big Data and Data Science

What You'll Learn:

Proficiency in implementing projects requiring expertise in Data Scraping, Data Mining, Scala, PySpark, AWS, and MongoDB
Efficient data extraction from websites
Skills applicable to various data-related roles

Why This Course:

High demand for Scala skills in the industry
Comprehensive coverage of PySpark, AWS, Data Scraping, Data Mining, and MongoDB
Hands-on experience through projects and practical exercises
Versatile skills for a wide range of applications

List of Keywords:

Big Data
Data Science
Scala
PySpark
AWS
Data Scraping
Data Mining
MongoDB
NoSQL Database
Data Extraction
Data Analysis

Who this course is for:

People who are absolute beginners.
People who want to make smart solutions.
People who want to learn with real data.
People who love to learn theory and then implement it practically.
Data Scientists, Machine learning experts and Drop Shippers.

50 Hrs Big Data Mastery: PySpark, AWS, Scala & Data Scraping

What you'll learn

Explore related topics

Course content

Data Scraping & Data Mining for Beginners to Pro with Python151 lectures • 13hr 27min

Scala & Spark-Master Big Data with Scala and Spark144 lectures • 12hr 58min

PySpark & AWS: Master Big Data With PySpark and AWS157 lectures • 16hr 27min

MongoDB-Mastering MongoDB for Beginners (Theory & Projects)171 lectures • 11hr 47min

Requirements

Description

Who this course is for: