CCA 175 - Spark and Hadoop Developer - Python (pyspark)
What you'll learn
- Entire curriculum of CCA Spark and Hadoop Developer
- HDFS Commands
- Python Fundamentals
- Core Spark - Transformations and Actions
- Spark SQL and Data Frames
Requirements
- Basic programming skills using any programming language
- Cloudera Quickstart VM or valid account for IT Versity Big Data labs or any Hadoop clusters where Hadoop, Hive and Spark are well integrated.
- Minimum memory required based on the environment you are using with 64 bit operating system
- 4 GB RAM with access to proper clusters or 16 GB RAM with virtual machines such as Cloudera QuickStart VM
Description
CCA 175 Spark and Hadoop Developer is one of the well recognized Big Data certifications. This scenario-based certification exam demands basic programming using Python or Scala along with Spark and other Big Data technologies.
This comprehensive course covers all aspects of the certification using Python as a programming language.
Python Fundamentals
Spark SQL and Data Frames
File formats
Please note that the syllabus is recently changed and now the exam is primarily focused on Spark Data Frames and/or Spark SQL.
Exercises will be provided to prepare before attending the certification. The intention of the course is to boost the confidence to attend the certification.
All the demos are given on our state of the art Big Data cluster. You can avail one-week complimentary lab access by filling this form which is provided as part of the welcome message.
Who this course is for:
- Any IT aspirant/professional willing to learn Big Data and give CCA 175 certification
Featured review
Course content
- Preview08:01CCA 175 Spark and Hadoop Developer - Curriculum
- 08:55Using labs for preparation
- 02:25Setup Development Environment (Windows 10) - Introduction
- 04:12Setup Development Environment - Python and Spark - Pre-requisites
- 03:07Setup Development Environment - Python Setup on Windows
- 02:31Setup Development Environment - Configure Environment Variables
- 05:28Setup Development Environment - Setup PyCharm for developing Python applications
- 02:31Setup Development Environment - Pass run time arguments or parameters
- 01:38Setup Development Environment - Download Spark compressed tar ball
- 01:00Setup Development Environment - Install 7z for uncompress and untar on windows
- 02:26Setup Development Environment - Setup Spark
- 06:05Setup Development Environment - Install JDK
- 03:46Setup Development Environment - Configure environment variables for Spark
- 06:30Setup Development Environment - Install WinUtils - integrate Windows and HDFS
- 07:06Setup Development Environment - Integrate PyCharm and Spark on Windows 10
Instructors
13+ years of experience in executing complex projects using vast array of technologies including Big Data and Cloud.
ITVersity, Inc. - a US based organization to provide quality training for IT professionals and we have the track record of training hundreds of thousands of professionals globally.
Building IT career for people with required tools such as high quality material, labs, live support etc to upskill and cross skill is paramount for our organization.
At this time our training offerings are focused on following areas:
* Application Development using Python and SQL
* Big Data and Business Intelligence
* Cloud
* Datawarehousing, Databases
We have built a team to support going forward. If you send messages to this account for our courses, they will be sent to our Helpdesk from where we will be rewriting to our team.
3+ years of IT Experience in the areas of Python using Django as well as Flask, Spark, Linux, SQL using any RDBMS, Java Script, Node JS, Mongo DB etc.
I will be primarily providing support for Python, SQL and other related courses as co-instructor to ITVersity courses.
ITVersity, Inc. - a US based organisation to provide quality training for IT professionals and we have the track record of training hundreds of thousands of professionals globally.
Building IT career for people with required tools such as high quality material, labs, live support etc to up skill and cross skill is paramount for our organisation.
At this time our training offerings are focused on following areas:
* Application Development using Python and SQL
* Big Data and Business Intelligence
* Cloud
* Data Warehousing, Databases
- 4.1 Instructor Rating
- 3,941 Reviews
- 25,017 Students
- 3 Courses
Experienced Data Engineer with a demonstrated history of working in the consumer goods industry. Skilled in Apache Airflow, Apache Kafka, Hive, Apache Spark, and Amazon Web Services (AWS). Strong information technology professional with a Master's degree focused in Analytics from University of Cincinnati.
ITVersity, Inc. is a US-based organisation providing quality training for IT professionals and we have a track record of training hundreds of thousands of professionals globally.
Helping build IT careers of people with high-quality content, Labs, live support etc. to upskill and cross-skill is paramount for our organisation.
I will be overseeing the support for ITVersity courses related to Data Engineering and DevOps Engineering
At this time our training offerings are focused on the following areas:
* Application Development using Python and SQL
* Big Data and Business Intelligence
* Cloud
* Data Warehousing, Databases