Apache Spark Training Course

Course Code

68780

Duration

14 hours (usually 2 days including breaks)

Course Outline

Why Spark?

  • Problems with Traditional Large-Scale Systems
  • Introducing Spark

Spark Basics

  • What is Apache Spark?
  • Using the Spark Shell
  • Resilient Distributed Datasets (RDDs)
  • Functional Programming with Spark

Working with RDDs

  • RDD Operations
  • Key-Value Pair RDDs
  • MapReduce and Pair RDD Operations

The Hadoop Distributed File System

  • Why HDFS?
  • HDFS Architecture
  • Using HDFS

Running Spark on a Cluster

  • Overview
  • A Spark Standalone Cluster
  • The Spark Standalone Web UI

Parallel Programming with Spark

  • RDD Partitions and HDFS Data Locality
  • Working With Partitions
  • Executing Parallel Operations

Caching and Persistence

  • RDD Lineage
  • Caching Overview
  • Distributed Persistence

Writing Spark Applications

  • Spark Applications vs. Spark Shell
  • Creating the SparkContext
  • Configuring Spark Properties
  • Building and Running a Spark Application
  • Logging

Spark, Hadoop, and the Enterprise Data Center

  • Overview
  • Spark and the Hadoop Ecosystem
  • Spark and MapReduce

Spark Streaming

  • Spark Streaming Overview
  • Example: Streaming Word Count
  • Other Streaming Operations
  • Sliding Window Operations
  • Developing Spark Streaming Applications

Common Spark Algorithms

  • Iterative Algorithms
  • Graph Analysis
  • Machine Learning

Improving Spark Performance

  • Shared Variables: Broadcast Variables
  • Shared Variables: Accumulators
  • Common Performance Issues

Testimonials

★★★★★
★★★★★

Bookings, Prices and Enquiries

Guaranteed to run even with a single delegate!

Private Classroom

From £2500

Private Remote

From £2200 (102)

Public Classroom

Cannot find a suitable date? Choose Your Course Date >>Too expensive? Suggest your price

Course Discounts

Course Venue Course Date Course Price [Remote / Classroom]
Excel VBA Introduction Belfast City Centre Mon, 2018-09-03 09:30 £2178 / £2678
Introduction to Selenium York - Priory Street Centre Tue, 2018-09-04 09:30 £1089 / £1239
Minitab for Statistical Data Analysis Cambridge Mon, 2018-09-10 09:30 £2574 / £3024
AWS: A Hands-on Introduction to Cloud Computing Edinburgh Training and Conference Venue Tue, 2018-09-11 09:30 £1287 / £1487
JMeter Fundamentals and JMeter Advanced Birmingham Tue, 2018-09-18 09:30 £2178 / £2828
Test Automation with Selenium St Helier, Jersey, Channel Isles Tue, 2018-09-18 09:30 £2970 / £4395
Jenkins: Continuous Integration for Agile Development Manchester, King Street Thu, 2018-10-18 09:30 £2574 / £3224
CakePHP: Rapid Web Application Development Birmingham Tue, 2018-11-06 09:30 £4356 / £5656

Course Discounts Newsletter

We respect the privacy of your email address. We will not pass on or sell your address to others.
You can always change your preferences or unsubscribe completely.

Some of our clients