Hadoop for Project Managers

Course Code



14 hours (usually 2 days including breaks)


  • A general understanding of programming
  • An understanding of databases
  • Basic knowledge of Linux


As more and more software and IT projects migrate from local processing and data management to distributed processing and big data storage, Project Managers are finding the need to upgrade their knowledge and skills to grasp the concepts and practices relevant to Big Data projects and opportunities.

This course introduces Project Managers to the most popular Big Data processing framework: Hadoop.  

In this instructor-led training, participants will learn the core components of the Hadoop ecosystem and how these technologies can be used to solve large-scale problems. In learning these foundations, participants will also improve their ability to communicate with the developers and implementers of these systems as well as the data scientists and analysts that many IT projects involve.


  • Project Managers wishing to implement Hadoop into their existing development or IT infrastructure
  • Project Managers needing to communicate with cross-functional teams that include big data engineers, data scientists and business analysts

Format of the course

  • Part lecture, part discussion, exercises and heavy hands-on practice

Course Outline


  • Why and how project teams adopt Hadoop
  • How it all started
  • The Project Manager's role in Hadoop projects

Understanding Hadoop's architecture and key concepts

  • HDFS
  • MapReduce
  • Other pieces of the Hadoop ecosystem

What constitutes Big Data?

Different approaches to storing Big Data

HDFS (Hadoop Distributed File System) as the foundation

How Big Data is processed

  • The power of distributed processing

Processing data with MapReduce

  • How data is picked apart step by step

The role of clustering in large-scale distributed processing

  • Architectural overview
  • Clustering approaches

Clustering your data and processes with YARN

The role of non-relational database in Big Data storage

Working with Hadoop's non-relational database: HBase

Data warehousing architectural overview

Managing your data warehouse with Hive

Running Hadoop from shell-scripts

Working with Hadoop Streaming

Other Hadoop tools and utilities

Getting started on a Hadoop project

  • Demystifying complexity

Migrating an existing project to Hadoop

  • Infrastructure considerations
  • Scaling beyond your allocated resources

Hadoop project stakeholders and their toolkits

  • Developers, data scientists, business analysts and project managers

Hadoop as a foundation for new technologies and approaches

Closing remarks

Bookings, Prices and Enquiries

Guaranteed to run even with a single delegate!

Private Classroom

From £2500

Private Remote

From £2200 (101)

Public Classroom

Cannot find a suitable date? Choose Your Course Date >>Too expensive? Suggest your price

Course Discounts

Course Venue Course Date Course Price [Remote / Classroom]
Javascript And Ajax St Helier, Jersey, Channel Isles Mon, 2018-07-02 09:30 £4950 / £7325
PostgreSQL for Administrators Swansea- Princess House Mon, 2018-07-02 09:30 £2178 / £2478
OCUP2 UML 2.5 Certification - Advanced Exam Preparation St Helier, Jersey, Channel Isles Mon, 2018-07-23 09:30 £1980 / £2930
Introduction to R Glasgow Wed, 2018-08-01 09:30 £3861 / £4911
Subversion for Users Newcastle Fri, 2018-08-03 09:30 £1089 / £1289
OCUP2 UML 2.5 Certification - Intermediate Exam Preparation St Helier, Jersey, Channel Isles Tue, 2018-08-07 09:30 £2340 / £3290
jQuery Swansea- Princess House Wed, 2018-08-15 09:30 £1980 / £2280
AWS: A Hands-on Introduction to Cloud Computing Edinburgh Training and Conference Venue Tue, 2018-09-11 09:30 £1287 / £1487
Test Automation with Selenium St Helier, Jersey, Channel Isles Tue, 2018-09-18 09:30 £2970 / £4395

Course Discounts Newsletter

We respect the privacy of your email address. We will not pass on or sell your address to others.
You can always change your preferences or unsubscribe completely.