An Introduction to Delta Lake

Exploring open source storage layer that brings reliability to data lakes
New
Rating: 4.4 out of 5 (3 ratings)
364 students
English
English [Auto]

Experience Classroom like environment via White-boarding sessions
Challenges with Delta Lake
Key Big Data Architectures
Why Delta Lake?
Delta Architecture
Delta Lake Demo

Requirements

  • Having basic knowledge of Apache Spark and Python

Description

Data Lakes built using Hadoop framework were lacking a very basic functionality i.e. ACID compliance. Hive tried to overcome some of the limitations by providing update functionality but the overall process was messy. Databricks (the company behind Spark) came up with a unique solution i.e. Delta Lake. Delta Lake enables ACID transactions over existing Data Lakes. It can seamlessly integrate with many Big Data Frameworks like Spark, Presto, Athena, Redshift, Snowflake etc. Let’s explore this interesting technology more.

It is an Open source storage layer that provides reliability to the data lakes. Delta Lakes provides ACID(Atomicity, Consistency, Integrity and Durability) properties, scalable metadata handling, and unifies streaming and batch data processing. It runs on the top of data lakes and it is fully compatible with Spark APIs. Data in Delta Lake is stored in Parquet format. It enables Delta Lake to leverage the efficient compression and encoding schemes that are native to Parquet.

Enroll Now to explore about Delta Lake. Here is detailed Agenda for the course:

  • Challenges with Delta Lake

  • Key Big Data Architectures

  • Why Delta Lake?

  • Delta Architecture

  • Delta Lake Demo

We are an official training delivery partner of Confluent Kafka.. We conduct corporate trainings on various topics including Confluent Kafka Developer, Confluent Kafka Administration, Confluent Kafka Real-Time Streaming using KSQL & KStreams and Confluent Kafka Advanced Optimization. Our instructors are well qualified and vetted by Confluent for delivering such courses.


Who this course is for:

  • Software Developer
  • Data Engineer
  • Architects

Instructors

Authorized Instructor for GCP,Snowflake,Cloudera,Confluent
Bhavuk Chawla
  • 4.3 Instructor Rating
  • 809 Reviews
  • 16,495 Students
  • 8 Courses

Bhavuk has over 17 years of experience in IT, more than 8 years of experience implementing Cloud/ML/AI/Big Data Science related projects. He is an official instructor for Google, Confluent, and Cloudera. He has delivered and continues to deliver his training sessions in various companies like Google Singapore, Microsoft Bangalore, Starbucks Coffee Seattle, Adobe India, and EMEA Region, etc.

He was recognized by Cloudera as the Instructor of the Year 2016 (APAC) for his exceptionally high ratings received in various training sessions. Connect with him on LinkedIn for Long term Learning partnership.

Experts in Bleeding Edge Technologies
DataCouch Support Team
  • 4.3 Instructor Rating
  • 809 Reviews
  • 16,495 Students
  • 8 Courses

DataCouch has been an established training, staffing & consultation provider since 2016. We offer consulting, talent development, skills enhancement programs, academic curricular development and staff augmentation in major cutting edge technologies such as Artificial Intelligent, Scalable Machine Learning, Data Science, DevOps, Robotics Processing Automation and many more. As a professional company comprising of a highly competent and experienced facilitators, consultants, architects and data scientist from multi domain knowledge. Our instructors and associates serve our global customer footprint, and combined experience over 100+ years of diverse real world. Serving national and multinational corporations, government agencies, education and non-profits across globe. We provide private, public batches and on-site training using our well-equipped virtual labs with latest technologies and tools. We believe that we will become your value added partner in assisting your organization towards achieving your goals. and below technologies in below domains.


Reach out to us at support@datacouch.io