Big Data Tools and Techniques - Hands On

5 Day Course
Hands On

Book Now - 2 Delivery Methods Available:

Classroom Virtual Classroom Private Group - Virtual Self-Paced Online


The rate at which organizations are producing data is growing at an exponential rate. Big Data tools and stores now enable this data to be effectively stored, efficiently processed and business value extracted. Providing solutions that utilize the new technology to store and process these large data sets in a timely manner is a significant challenge.

This course aims to provide attendees with the knowledge of how to select and use appropriate technology to design effective Big Data solutions.


  • The different types of NoSQL stores available
  • How to select a suitable data store for a particular task
  • Architect effective NoSQL solutions
  • Process large data sets with Hadoop
  • Transform and query data using Pig and Hive
  • Consider effective and alternative transaction strategies when working with Big Data
  • Evaluate features and benefits of scale-out storage
  • Benefit from Data Mining





Target Audience

This course is for programmers working on designing a new Big Data solution, or who are joining an existing project as a new team member.



Hide all

Course Content

Big Data: What’s New (2 topics)

  • What is Big Data
  • The technical challenges posed by Big Data

Overview of NoSQL Data Storage (5 topics)

  • Types of data stores
  • Key Value
  • Document based
  • Table Column
  • Graph

Key Value Data Stores (9 topics)

  • Introducing Redis
  • Comparing Redis to other data stores
  • When to use Redis
  • Redis data structures
  • Sorting and searching
  • Transactions with Redis
  • Locks and sempahores
  • Queues and publish and subscribe
  • Example Redis use cases

Column Family Data Stores (8 topics)

  • Introducing Cassandra
  • The Cassandra data model
  • Cassandra architecture
  • Writing and reading from Cassandra
  • Working with CQL
  • Eventual consistency
  • Lightweight transactions
  • Example Cassandra use cases

Document Data Stores (7 topics)

  • Introducing MongoDB
  • Creating, updating and deleting documents
  • Querying
  • Indexing
  • Aggregation
  • Working around lack of transactions
  • Example MongoDB use cases

Batch Processing Big Data (7 topics)

  • Preparing data for processing
  • Integrating disparate data
  • Hadoop Distributed File System (HDFS)
  • Batch processing with Hadoop
  • Customizing processing
  • Pig, Hive and Impala
  • Hadoop ecosystem: Oozie, Flume, Sqoop, Zookeeper

Scale-out Storage (4 topics)

  • Scale-out Storage defined
  • Scale-out Storage Architecture
  • Scale-out Storage Software
  • Scale-out Storage Benefits

Data Mining (5 topics)

  • Data mining defined
  • Data Information and Knowledge
  • Functions of Data Mining
  • How does Data Mining work
  • Data Mining technological infrastructure


The course is a hands-on one, and a working knowledge of Java is required.

Scheduled Dates

Please select from the dates below to make an enquiry or booking.


Different pricing structures are available including special offers. These include early bird, late availability, multi-place, corporate volume and self-funding rates. Please arrange a discussion with a training advisor to discover your most cost effective option.

Code Location Duration Price Mar Apr May Jun Jul Aug
Virtual Classroom (London)
4 Days $3,470

Course PDF


Share this Course


Recommend this Course