Introduction to Graph Computing Training Course

Course CodeCourse Code


Duration Duration

28 hours (usually 4 days including breaks)

Requirements Requirements

  • An undersanding of Java programming and frameworks
  • A general understanding of Python is helpful but not required
  • A general understanding of database concepts

Overview Overview

A large number of real world problems can be described in terms of graphs. For example, the Web graph, the social network graph, the train network graph and the language graph. These graphs tend to be extremely large; processing them requires a specialized set of tools and mindset referred to as graph computing.

In this instructor-led, live training, participants will learn about the various technology offerings and implementations for processing graph data. The aim is to identify real-world objects, their characteristics and relationships, then model these relationships and process them as data using graph computing approaches. We start with a broad overview and narrow in on specific tools as we step through a series of case studies, hands-on exercises and live deployments.

By the end of this training, participants will be able to:

  • Understand how graph data is persisted and traversed
  • Select the best framework for a given task (from graph databases to batch processing frameworks)
  • Implement Hadoop, Spark, GraphX and Pregel to carry out graph computing across many machines in parallel
  • View real-world big data problems in terms of graphs, processes and traversals


  • Developers

Format of the course

  • Part lecture, part discussion, exercises and heavy hands-on practice

Course OutlineCourse Outline

    Graph databases and libraries

Understanding graph data
    The graph as a data structure
    Using vertices (dots) and edges (lines) to model real-world scenarios

Using Graph databases to model, persist and process graph data
    Local graph algorithms/traversals
    neo4j, OrientDB and Titan

Exercise: Modeling Graph Data with neo4j
    Whiteboard data modeling

Beyond Graph databases: Graph computing
    Understanding the property graph
    Graph modeling different scenarios (software graph, discussion graph, concept graph)

Solving Real-World Problems with Traversals
    Algorithmic/directed walk over the graph
    Determining circular cependencies

Case Study: Ranking Discussion Contributors
    Ranking by number and depth of conributed discussions
    A note on sentiment and concept analysis

Graph Computing: Local, in-memory graph toolkits
    Graph analysis and visualization
    JUNG, NetworkX, and iGraph

Exercise: Modeling Graph Data with NetworkX
    Using NetworkX to model a complex s

Graph Computing: Batch Processing Graph Frameworks
    Leveraging leverage Hadoop for storage (HDFS) and processing (MapReduce)
    Overview of iterative algorithms
    Hama, Giraph, and GraphLab

Graph Computing: Graph-parallel Computation
    Unifying ETL, exploratory analysis, and iterative graph computation within a single system

Setup and Installation
    Hadoop and Spark

GraphX Operators
    Property, structural, join, neighborhood aggregation, caching and uncaching

Iterating with Pregel API
    Passing arguments for sending, receiving and computing

Building a Graph
    Using vertices and edges in an RDD or on disk

Designing Scalable Algorithms
    GraphX Optimization

Accessing Additional Algorithms
    PageRank, Connected Components, Triangle Counting

Exercis: Page Rank and Top Users
    Building and processing graph data using text files as input

Deploying to Production

Closing Remarks

Bookings, Prices and EnquiriesBookings, Prices and Enquiries

Guaranteed to run even with a single delegate!
Private Classroom
Private Classroom
Participants are from one organisation only. No external participants are allowed. Usually customised to a specific group, course topics are agreed between the client and the trainer.
Private Remote
From £4400
Private Remote
The instructor and the participants are in two different physical locations and communicate via the Internet. More Information

The more delegates, the greater the savings per delegate. Table reflects price per delegate and is used for illustration purposes only, actual prices may differ.

Number of Delegates Private Remote
1 £4400
2 £3100
3 £2667
4 £2450
Public Classroom
From £5000
Public Classroom
Participants from multiple organisations. Topics usually cannot be customised

The more delegates, the greater the savings per delegate. Table reflects price per delegate and is used for illustration purposes only, actual prices may differ.

Number of Delegates Public Classroom
1 £5000
2 £3425
3 £2900
4 £2638
Cannot find a suitable date? Choose Your Course Date >>
Too expensive? Suggest your price

Related Categories

Related Courses

Upcoming Courses

VenueCourse DateCourse Price [Remote / Classroom]
EdinburghMon, 2018-02-05 09:30£4400 / £6400
PortsmouthMon, 2018-02-05 09:30£4400 / £5000
York - Priory Street Centre Mon, 2018-02-05 09:30£4400 / £5000
Leicester - St. Georges HouseTue, 2018-02-06 09:30£4400 / £5400
Bradford - Carlisle Business CentreMon, 2018-02-12 09:30£4400 / £5400

Course Discounts

Course Discounts Newsletter

We respect the privacy of your email address. We will not pass on or sell your address to others.
You can always change your preferences or unsubscribe completely.

Some of our clients