
Explore how Apache Drill delivers schema-free, multi-store SQL queries across Hadoop, RDBMS, NoSQL, cloud storage, and filesystems, with seamless integrations and a scalable three-layer architecture.
Install drill on macOS or Linux, use the drill console, connect to filesystem and data stores like MySQL, MongoDB, Hive, Kafka, and HBase, and learn ANSI SQL, nested data.
Learn to run Apache Drill in non-embedded and cluster mode by installing and starting zookeeper, configuring drillbit with zookeeper hosts, and launching drillbit across a multi-host cluster.
Configure Apache Drill storage plugins to connect to data sources via the web control, including DFS and RDBMS. Define workspaces to simplify file paths and access within plugins.
Open a sample local file, then query it as a table with the Beerfest storage plugin by specifying the local path and limiting results to five rows.
Learn how to connect Apache Drill to a MySQL data source using the RDBMS storage plugin, including configuring the MySQL connector, loading required JAR files, and verifying the connection.
Connect Apache Drill to a local MongoDB instance using the MongoDB storage plugin, access via localhost:27017, and insert documents into a database and collection for querying.
Connect to Hive by configuring the Hive storage plugin in Apache Drill, connecting to a Hive server on localhost or remote, and confirm schema and data access.
Learn to connect Apache Drill to HBase by installing Zookeeper locally, configuring Drillbit, creating test tables, and running queries with convert from and join operations.
Explore how Apache Drill applies ANSI SQL to both tables and files, using familiar select and group by patterns with formats like CSV, DSV, Avro, and JSON.
Explore sql sessions in drill and see how a session retains schema and table context across commands in a stateful drill shell. Compare this with stateless consoles that reset context.
Learn to access and name columns in a query by handling headers and delimiters in a CSP/CSB file, using storage plugin options or the table function with extract header.
Connect to Apache Drill from Python using two libraries, install the Drill library, and run queries with a cursor, demonstrating basic vs. more sophisticated drivers and parameter binding.
Discover Apache Drill's rest interface, viewing version, options, state, and metrics, while managing storage plugins and executing cross-datasource queries in a stateless system with large results limits.
Connect to a local apache drill instance, install and use the Sergeant library, and run simple to complex queries across Mongo and MySQL while exploring drill options.
Install and run Apache Zeppelin, download binary with interpreters, and start the server. Configure a JDBC interpreter with the driver artifact to run queries and visualize bar and pie charts.
1. Agility: Get faster insights without the overhead (data loading, schema creation and maintenance, transformations, etc.)
2. Flexibility: Analyze the multi-structured and nested data in non-relational datastores directly without transforming or restricting the data
3. Familiarity: Leverage your existing SQL skillsets and BI tools including Tableau, Qlikview, MicroStrategy, Spotfire, Excel and more
Apache Drill can scale upto 1000s of servers, and is very high performant because of the way it is architected. This course not only covers how to use Drill to connect and query different data sources, but also explains the functioning of Drill and why it beats query engines like Spark, Presto and Impala.
Drill is a go-to tool for interactive Data Exploration. With its capability to connect seamlessly to any data store, and schema-free data exploration, it has become a go-to tool in the analyst's toolkit. Drill uses ANSI SQL for querying, and treats every data like a table, even if it is not, making it very easy to use for querying purposes.
What's included in the course?
Complete course on Apache Drill concepts and capabilities explained from Scratch.
Each and Every Apache Drill concept is explained with a HANDS-ON.
Includes even those concepts, the explanation to which is not very clear even in Drill official documentation.
Related Commands and Datasets used in lectures are attached in the course for your convenience.