Apache Storm Introduction – Architecture and Installation


Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description
Apache Storm is a fast and scalable open source distribution system that drives real-time computations. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun. This course will introduce Storm providing an in depth approach to the concepts and how Apache Storm ticks. Apache Storm can integrate with many other products and this course will introduce you to ZooKeeper along with the Storm Trident Architecture and how they are used with Storm model.

Target Audience
Individuals interested in learning the architecture of Apache Storm and how it can be used.

Prerequisites
None

Expected Duration (hours)
2.8

Lesson Objectives

Apache Storm Introduction – Architecture and Installation

  • start the course
  • describe in a higher scope, Apache Storm and its characteristics
  • describe why Apache Storm is used
  • describe the Apache Storm Architecture
  • identify a tuple and a bolt and their use in Storm
  • identify a spout and its use in Storm
  • identify streams and their use in Storm
  • describe the different operation modes of Storm
  • identify Storm components and their functionality in the source code for an example Storm application
  • describe the setup process for an Integrated Storm development environment
  • use Maven to compile and run a Storm application
  • describe the installation and setup process for ZooKeeper as a standalone server
  • install and set up ZooKeeper on a development machine
  • deploy a ZooKeeper server in standalone mode and test it with a ZooKeeper client connection
  • describe the process for setting up and deploying a ZooKeeper cluster
  • demonstrate the process of setting up a production Storm cluster
  • describe the process of configuring the parallelism of a topology
  • configure the parallelism of spout and bolt components in a Storm topology
  • describe briefly stream groupings and their types
  • use stream groupings in a Storm topology
  • describe the Guaranteed Messaging Process
  • describe the fault-tolerant characteristics of Storm
  • describe briefly what Trident is and how it's used
  • describe Trident's data model and its use
  • describe several operations of Trident
  • test your knowledge of Apache Storm and the components of the system
  • Course Number:
    df_apsi_a01_it_enus

    Expertise Level
    Beginner