Apache Drill Query Optimization Training Course

Course Code

apachedrillqueryoptim

Duration

7 hours (usually 1 day including breaks)

Requirements

  • An understanding of Hadoop, NoSQL, and other data storage concepts
  • A general understanding of SQL queries
  • Experience with Linux command line

Overview

Apache Drill is a schema-free, distributed, in-memory columnar SQL query engine for Hadoop, NoSQL and other Cloud and file storage systems. The power of Apache Drill lies in its ability to join data from multiple data stores using a single query. Apache Drill supports numerous NoSQL databases and file systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files. Apache Drill is the open source version of Google's Dremel system which is available as an infrastructure service called Google BigQuery.

In this instructor-led, live training, participants will learn how to optimize and debug Apache Drill to improve the performance of queries on very large data sets. The course begins with an architectural overview and feature comparison between Apache Drill and other interactive data analysis tools. Participants then step through a series of interactive, hands-on practice sessions that include installation, configuration, performance evaluation, query optimization, data partitioning, and debugging of an Apache Drill instance in a live lab environment.

By the end of this training, participants will be able to:

  • Install and configure Apache Drill
  • Understand Apache Drill's architecture and features
  • Understand how Apache Drills receives and executes queries
  • Optimize Drill queries for distributed SQL execution
  • Debug Apache Drill

Audience

  • Developers
  • Systems administrators
  • Data analysts

Format of the course

  • Part lecture, part discussion, exercises and heavy hands-on practice

Notes

  • To request a customized training for this course, please contact us to arrange.

Course Outline

Introduction to Apache Drill

How does Apache Drill compare to Spark SQL, Hive and Impala?

Overview of Apache Drill Features and Architecture

  • Apache Drill Components

Understanding Apache Drill Queries

  • Query Execution Process

Performing SQL Queries

  • Connecting to the data source
  • Querying the data

Using the Drill Web Console

  • Query, Profiles, Storage, Metrics, Threads, and Options

Performance Optimization Strategy

  • Identifying the source of performance issues
  • Analyzing Query Plans and Profiles

Apache Drill Query Optimization

  • Optimizing a Query

Limiting the Data that Drill Reads

  • Partitioning the data (partition pruning)

Apache Drill Logging and Debugging

  • Analyzing Drill Error Messages
  • Configuring Log File Options

Troubleshooting Apache Drill

Summary and Conclusion

Testimonials

★★★★★
★★★★★

Bookings, Prices and Enquiries

Guaranteed to run even with a single delegate!

Private Classroom

From £1250

Private Remote

From £1100 (106)

Public Classroom

Cannot find a suitable date? Choose Your Course Date >>Too expensive? Suggest your price

Course Discounts

Course Venue Course Date Course Price [Remote / Classroom]
Selenium WebDriver in C#: Introduction to Web Testing Automation in C# Sheffield Wed, 2018-09-26 09:30 £2178 / £2578
Introduction to Ansible Automation London, Hatton Garden Mon, 2018-10-08 09:30 £1089 / £1464
Jenkins: Continuous Integration for Agile Development Manchester, King Street Thu, 2018-10-18 09:30 £2574 / £3224
Introduction to Recommendation Systems Swansea- Princess House Thu, 2018-10-18 09:30 £990 / £1140
Impact Evaluation – Quantitative Analysis London, Hatton Garden Wed, 2018-10-24 09:30 £2574 / £3324
CakePHP: Rapid Web Application Development Birmingham Tue, 2018-11-06 09:30 £4356 / £5656

Course Discounts Newsletter

We respect the privacy of your email address. We will not pass on or sell your address to others.
You can always change your preferences or unsubscribe completely.

Some of our clients

is growing fast!

We are looking to expand our presence in your region!

As a Business Development Manager you will:

  • expand business in the region
  • recruit local talent (sales, agents, trainers, consultants)
  • recruit local trainers and consultants

We offer:

  • Artificial Intelligence and Big Data systems to support your local operation
  • high-tech automation
  • continuously upgraded course catalogue and content
  • good fun in international team

If you are interested in running a high-tech, high-quality training and consulting business.

contact us right away!