Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Hands On Presto Mastery - Learn by doing!
Rating: 4.4 out of 5(202 ratings)
1,189 students

Hands On Presto Mastery - Learn by doing!

Create an end to end Presto based pipeline and query data in HDFS, MySQL & Kafka via the Presto CLI, R Studio & Tableau
Created byCurvedArrow LLC
Last updated 1/2021
English

What you'll learn

  • A deep dive into PrestoSQL
  • Learn how to architect Presto based solutions
  • How to install and configure Presto
  • How to install and setup the Presto CLI
  • How to connect RStudio to Presto
  • How to connect Tableau to Presto (JDBC Connector)
  • How to query data in a MySQL DB via Presto
  • How to query data in Kafka via Presto
  • How to query data in HDFS via Presto
  • Setup and connect a Hive Metastore to Presto

Course content

5 sections18 lectures1h 57m total length
  • Course Overview2:52
  • Real World Scenario3:43

    See how Presto queries data in place from HDFS, MySQL, and Kafka, delivering fast analytics with compute–storage separation and in‑memory processing.

  • Product Requirements3:52

    Define three end users—data scientist, data engineer, and BI analyst—and design a Presto-based solution that enables Tableau, SQL, and R Studio access to data from Hdfs, MySQL, and Kafka.

Requirements

  • Basic familiarity with SQL
  • Basic familiarity with the Linux CLI
  • A Computer (PC, Mac or Linux)
  • An Internet connection

Description

Welcome to my Presto course - Hands On Presto Mastery - Learn by doing!. I want to first of all thank you for considering this course. Without you the student, I will not be able to create this course.

Big Data is very hot at the moment and Presto is one of the exciting projects in the Big Data ecosystem. Presto is a distributed query engine that excels at crunching petabytes of data efficiently and low latency analytics.

This Presto course contains everything you need to get started with Presto. We will go from zero to Presto in this single course and back it up with a lot of hands on work. By the end of this course, you will be full of confidence, skilled up and ready to take your career to the next level. So let's buckle up and lean in

In this course, we will wear many hats as we solve an end to end Big Data problem:

  1. Product Hat: We will start with a real world scenario and dive into the user segments and pain points.

  2. Architect Hat: Once we understand the pain points, we will architect a Presto based solution to address those pain points

  3. Engineer Hat: Once we have that, we will then install and setup Presto. We will cover the fundamentals – coordinator, worker, connectors etc.

  4. Engineer Hat: One nice thing is that presto forces you to interact with other projects in the Big Data ecosystem.

    1. And so, to visualize data, we will setup clients like the Presto CLI, RStudio and Tableau.

    2. For storage, we will setup data sources like the MySQL DB, Kafka and HDFS. Presto is strictly a compute engine and this means that it does not have its own storage.

  5. Engineer Hat: Once we are done, we will demo the final product and show you how to join data from multiple data sources in a single query.

At the end of this journey, we will deliver a solution that solves the pain points identified earlier.

Who this course is for:

  • Beginners