Databricks Certified Data Engineer Associate Exam Guide
What you'll learn
- Databricks Clusters, Notebooks, data storage
- Databricks Lakehouse Platform (architecture, descriptions, benefits)
- Delta Lake
- ELT with Spark SQL and Python
- Relational entities (databases, tables, views)
- Accessing Data from Azure Data Lake Storage (ADLS)
- Structured Streaming, Auto Loader
- Delta Live Tables, Multi-hop Architecture
- Databricks Jobs
- Databricks Dashboards
- Data Governance
Requirements
- Basic knowledge of SQL and Python
- Basic understanding of cloud fundamentals
Description
Welcome to our comprehensive course on Databricks Certified Data Engineer Associate certification. This course is designed to help you master the skills required to become a certified Databricks data engineer associate.
Databricks is a cloud-based data analytics platform that offers a unified approach to data processing, machine learning, and analytics. With the growing demand for data engineers, Databricks has become one of the most sought-after skills in the industry.
In this course, you'll learn the core concepts of Databricks, including Databricks Lakehouse Platform, ELT with Spark SQL and Python, Incremental Data Processing, Production Pipelines, and Data Governance.
This course is designed by industry experts with years of experience in Databricks and data engineering. This course has theoretical concepts and hands-on labs to help you apply the concepts learned in the course.
Upon completion of the course, you'll be able to take the Databricks Certified Data Engineer Associate exam with confidence and succeed in your career as a data engineer.
At the end of this course you should be able to:
Understand how to use and the benefits of using the Databricks Lakehouse Platform and its tools, including:
Data Lakehouse (architecture, descriptions, benefits)
Data Science and Engineering workspace (clusters, notebooks, data storage)
Delta Lake (general concepts, table management, manipulation, optimizations)
Build ETL pipelines using Apache Spark SQL and Python, including:
Relational entities (databases, tables, views)
ELT (creating tables, writing data to tables, cleaning data, combining and reshaping tables, SQL UDFs)
Python (facilitating Spark SQL with string manipulation and control flow, passing data between PySpark and Spark SQL)
Incrementally process data, including:
Structured Streaming (general concepts, triggers, watermarks)
Auto Loader (streaming reads)
Multi-hop Architecture (bronze-silver-gold, streaming applications)
Delta Live Tables (benefits and features)
Build production pipelines for data engineering applications and Databricks SQL queries and dashboards, including:
Jobs (scheduling, task orchestration, UI)
Dashboards (endpoints, scheduling, alerting, refreshing)
Understand and follow best security practices, including:
Unity Catalog (benefits and features)
Entity Permissions (team-based permissions, user-based permissions)
Enroll now and take the first step towards becoming a certified Databricks data engineer associate.
Who this course is for:
- Anyone who wants to prepare for the Databricks Data Engineer Associate certification exam
- Students who wants to peruse a career in Data Engineering
- Professionals who wants to move from other technologies to Data Engineering
- Anyone who wants to start learning Databricks
Instructors
I am Ankit Mistry, completed my master from IIT Kharagpur in area of machine learning, Artificial intelligence. Now working as Software Developer, Big Data Engineer in one of leading private investment bank with 8+ years of experience in software industry.
Over the time I developed interest related to data discipline and learned about data analysis, machine learning model development, Cloud Computing.
Created course in area of Cloud Computing, Google Cloud, Python, Data Science, Data analysis, Machine Learning.
I am so excited to be on Udemy online learning platform and want to make big impact on your software career.
I hope you will like my course offering.
Hello Data Lover,
I am glad that you are reading this!
I am Vijay Gadhave and I have 10+ years of experience in the IT Industry. I am passionate about Cloud Computing and Machine Learning.
I teach in areas of Cloud Computing, Machine Learning, Python, Data Science, and Data analysis.
I hope you will enjoy my course and it will help you to grow in your career.