Udemy
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
Development
Web Development Data Science Mobile Development Programming Languages Game Development Database Design & Development Software Testing Software Engineering Development Tools No-Code Development
Business
Entrepreneurship Communications Management Sales Business Strategy Operations Project Management Business Law Business Analytics & Intelligence Human Resources Industry E-Commerce Media Real Estate Other Business
Finance & Accounting
Accounting & Bookkeeping Compliance Cryptocurrency & Blockchain Economics Finance Finance Cert & Exam Prep Financial Modeling & Analysis Investing & Trading Money Management Tools Taxes Other Finance & Accounting
IT & Software
IT Certification Network & Security Hardware Operating Systems Other IT & Software
Office Productivity
Microsoft Apple Google SAP Oracle Other Office Productivity
Personal Development
Personal Transformation Personal Productivity Leadership Career Development Parenting & Relationships Happiness Esoteric Practices Religion & Spirituality Personal Brand Building Creativity Influence Self Esteem & Confidence Stress Management Memory & Study Skills Motivation Other Personal Development
Design
Web Design Graphic Design & Illustration Design Tools User Experience Design Game Design Design Thinking 3D & Animation Fashion Design Architectural Design Interior Design Other Design
Marketing
Digital Marketing Search Engine Optimization Social Media Marketing Branding Marketing Fundamentals Marketing Analytics & Automation Public Relations Advertising Video & Mobile Marketing Content Marketing Growth Hacking Affiliate Marketing Product Marketing Other Marketing
Lifestyle
Arts & Crafts Beauty & Makeup Esoteric Practices Food & Beverage Gaming Home Improvement Pet Care & Training Travel Other Lifestyle
Photography & Video
Digital Photography Photography Portrait Photography Photography Tools Commercial Photography Video Design Other Photography & Video
Health & Fitness
Fitness General Health Sports Nutrition Yoga Mental Health Dieting Self Defense Safety & First Aid Dance Meditation Other Health & Fitness
Music
Instruments Music Production Music Fundamentals Vocal Music Techniques Music Software Other Music
Teaching & Academics
Engineering Humanities Math Science Online Education Social Science Language Teacher Training Test Prep Other Teaching & Academics
AWS Certification Microsoft Certification AWS Certified Solutions Architect - Associate AWS Certified Cloud Practitioner CompTIA A+ Cisco CCNA Amazon AWS CompTIA Security+ Microsoft AZ-900
Photoshop Graphic Design Adobe Illustrator Drawing Digital Painting InDesign Character Design Canva Figure Drawing
Life Coach Training Neuro-Linguistic Programming Mindfulness Personal Development Personal Transformation Meditation Life Purpose Coaching Neuroscience
Web Development JavaScript React CSS Angular PHP WordPress Node.Js Python
Google Flutter Android Development iOS Development Swift React Native Dart Programming Language Mobile Development Kotlin SwiftUI
Digital Marketing Google Ads (Adwords) Social Media Marketing Google Ads (AdWords) Certification Marketing Strategy Internet Marketing YouTube Marketing Email Marketing Retargeting
SQL Microsoft Power BI Tableau Business Analysis Business Intelligence MySQL Data Analysis Data Modeling Big Data
Business Fundamentals Entrepreneurship Fundamentals Business Strategy Online Business Business Plan Startup Freelancing Blogging Home Business
Unity Game Development Fundamentals Unreal Engine C# 3D Game Development C++ 2D Game Development Unreal Engine Blueprints Blender
30-Day Money-Back Guarantee

This course includes:

  • 2 hours on-demand video
  • 23 downloadable resources
  • Full lifetime access
  • Access on mobile and TV
IT & Software IT Certification PySpark

PySpark - Python Spark Hadoop coding framework & testing

Big data Python Spark PySpark coding framework logging error handling unit testing PyCharm PostgreSQL Hive data pipeline
Rating: 4.5 out of 54.5 (21 ratings)
4,108 students
Created by FutureX Skill
Last updated 12/2020
English
English [Auto]
30-Day Money-Back Guarantee

What you'll learn

  • Python Spark PySpark industry standard coding practices - Logging, Error Handling, reading configuration, unit testing
  • Building a data pipeline using Hive, Spark and PostgreSQL
  • Python Spark Hadoop development using PyCharm

Requirements

  • Basic programming skills
  • Basic database skills
  • Hadoop entry level knowledge

Description

This course will bridge the gap between your academic and real world knowledge and prepare you for an entry level Big Data Python Spark developer role. You will learn the following

  • Python Spark coding best practices

  • Logging

  • Error Handling

  • Reading configuration from properties file

  • Doing development work using PyCharm

  • Using your local environment as a Hadoop Hive environment

  • Reading and writing to a Postgres database using Spark

  • Python unit testing framework

  • Building a data pipeline using Hadoop , Spark and Postgres

Prerequisites :

  • Basic programming skills

  • Basic database knowledge

  • Hadoop entry level knowledge

Who this course is for:

  • Students looking at moving from Big Data Spark academic background to a real world developer role

Course content

8 sections • 33 lectures • 2h 3m total length

  • Preview01:22
  • What is Big Data Spark?
    02:12

  • Preview00:39
  • Preview00:46
  • Preview02:28
  • Preview01:34
  • Preview02:26
  • Running PySpark in the Console
    01:18
  • PyCharm PySpark Hello DataFrame
    03:59
  • Preview02:14
  • Python basics
    10:02

  • Structuring code with classes and methods
    06:29
  • How Spark works?
    01:30
  • Creating and reusing SparkSession
    07:54
  • Spark DataFrame
    05:49
  • Separating out Ingestion, Transformation and Persistence code
    06:12

  • Python Logging
    05:05
  • Managing log level through a configuration file
    09:15
  • Having custom logger for each Python class
    04:43
  • Error Handling with try except and raise
    06:12

  • Ingesting data from Hive
    04:55
  • Transforming ingested data
    02:12
  • Installing PostgreSQL
    03:55
  • Spark PostgreSQL interaction with Psycopg2 adapter
    07:06
  • Spark PostgreSQL interaction with JDBC driver
    03:49
  • Preview02:39

  • Organizing code further
    02:25
  • Reading configuration from a property file
    02:27

  • Python unittest framework
    03:30
  • Unit testing PySpark transformation logic
    04:02
  • Preview01:18

  • PySpark spark-submit
    01:47
  • Thank you
    00:51

Instructor

FutureX Skill
Big Data, Cloud and AI Solution Architects
FutureX Skill
  • 4.3 Instructor Rating
  • 743 Reviews
  • 26,762 Students
  • 6 Courses

We are a group of Solution Architects and Developers with expertise in Java, Python, Scala , Big Data , Machine Learning and Cloud.

We have years of experience in building Data and Analytics solutions for global clients.

Our primary goal is to simplify learning for our students.

We take a very practical use case based approach in all our courses.

  • Udemy for Business
  • Teach on Udemy
  • Get the app
  • About us
  • Contact us
  • Careers
  • Blog
  • Help and Support
  • Affiliate
  • Terms
  • Privacy policy
  • Cookie settings
  • Sitemap
  • Featured courses
Udemy
© 2021 Udemy, Inc.