Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Writing Tests for SimEng Python Code conversion concepts 101
Rating: 4.0 out of 5(9 ratings)
967 students

Writing Tests for SimEng Python Code conversion concepts 101

Interview Prep: Write unit test for prevailing code in Python Pyspark SQL and configure tests in Pycharm
Created byShivgan Joshi
Last updated 11/2024
English

What you'll learn

  • How to write unit test for prevailing code in Python Pyspark SQL
  • How to use Excel power queries
  • How to setup Pycharm, venv, unit testing, coverage for testing
  • How to write code for smaller and bigger function class task workflows

Course content

6 sections18 lectures31m total length
  • Introduction1:35
  • What you will learn in the Course2:41

    Learn to write unit tests for sim engines, using Excel power queries for joins (including left joins) and group bys, and set up PyCharm testing with debugging and virtual environments.

  • Learning Points and Summary2:32

    Apply groupby and join on small data to validate results in Python or Spark SQL, and debug Spark sessions while managing Hadoop and Hive views in a local virtual environment.

Requirements

  • No programming exp needed

Description

Interview Prep: Writing Tests for SimEng - Code conversion concepts

Write unit test for prevailing code in Python Pyspark SQL and configure tests in Pycharm


What you will learn:

  1. How to write unit test for prevailing code in Python Pyspark SQL

  2. How to use Excel power queries

  3. How to setup Pycharm, venv, unit testing, coverage for testing

  4. How to write code for smaller and bigger function class task workflows

  5. How to setup correct venv locally

  6. Intro to code release process

  7. Setting up local env variables

Topics:

  1. In Excel power queries we use mostly group by and join on small data to check our results and share with seniors.

  2. We can create a simpler notebook of the same logic of the code

  3. A lot of type the errors are due to Local Spark sessions which can be created in many many ways, but if we want to create spark sql views which uses hive then we have to install a hadoop and spard by downloading them.

  4. Debugging and creating break points and then using step into / step over to generate the output and match or save it with excel

  5. Making sure we mimic spark from older tests because otherwise we can get error in Jenkins build



Who this course is for:

  • Begineers