Hadoop

You are viewing the ZenPack Archive

This page is part of the ZenPack Archive. Archived ZenPacks may not be compatible with your version of Zenoss Service Dynamics or Zenoss Cloud.

Open Source

This ZenPack is developed and supported by Zenoss Inc. Contact Zenoss to request more information regarding this or any other ZenPacks. Click here to view all available Zenoss Open Source ZenPacks.

Organization

Zenoss, Inc.

License

GNU General Public License, Version 2, or later

Name

ZenPacks.zenoss.Hadoop

More Information

GitHub page/HomePage

Git Sources (For Cloning)

Link

Applications Monitored

Apache Hadoop

Hadoop ZenPack

Monitoring for the Apache Hadoop framework.

Support

This is an Open Source ZenPack developed by Zenoss, Inc. Enterprise support for this ZenPack is available to commercial customers with an active subscription.

Releases

Version 1.0.0- Download: Released on 2014/09/05: Requires PythonCollector ZenPack: Compatible with Zenoss Core 4.2.x, Zenoss Resource Manager 4.1.x, Zenoss Resource Manager 4.2.x, Zenoss Resource Manager 5.0.x

Background

The ZenPacks.zenoss.Hadoop ZenPack monitors Hadoop clusters via JMX web interface (JMXJsonServlet) and Python twisted.web.client asynchronous framework.

Features

The features added by this ZenPack can be summarized as follows. They are each detailed further below.

Discovery and periodic remodeling of relevant components.
Performance monitoring.
Event monitoring.
Optional service impact with addition of Zenoss Service Dynamics product.
Optional HBase auto discovery.

Discovery

The following components will be automatically discovered through zProperties you provide:

Hadoop Data Node: Attributes: Name, HBase Device, Last Contacted, Health State

Hadoop Secondary Name Node: Attributes: Name, Health State

Hadoop Job Tracker: Attributes: Name, Health State

Hadoop Task Tracker: Attributes: Name, Health State

Hadoop Resource Manager: Attributes: Name, Health State

Hadoop Node Manager: Attributes: Name, Health State

Hadoop Job History: Attributes: Name, Health State

Note: Yarn components (Resource Manager, Node Manager, Job History) will be modeled for 2.x version of Hadoop and MapReduce components (Job Tracker, Task Tracker) for previous versions.

Performance Monitoring

The following metrics will be collected and graphed every 5 minutes by default:

Device (Hadoop Name Node): Heap/Non Heap Memory: Heap Memory Capacity Bytes, Heap Memory Used Bytes, Non Heap Memory Capacity Bytes, Non Heap Memory Used Bytes: Nodes: Dead Nodes Count, Live Nodes Count: Performance metrics: Threads, Total Files, HDFS: Configured Capacity, DFS Remaining, DFS Used

Hadoop Data Node: Blocks: Blocks Written, Blocks Read, Blocks Removed: Heap/Non Heap Memory: Heap Memory Capacity Bytes, Heap Memory Used Bytes, Non Heap Memory Capacity Bytes, Non Heap Memory Used Bytes

Hadoop Secondary Name Node: Heap/Non Heap Memory: Heap Memory Capacity Bytes, Heap Memory Used Bytes, Non Heap Memory Capacity Bytes, Non Heap Memory Used Bytes

Hadoop Task Tracker: Tasks: Tasks Completed, Tasks Failed Timeout, Tasks Failed Ping: Performance metrics: Reduces Running, Maps Running

Hadoop Job Tracker: Heap/Non Heap Memory: Heap Memory Capacity Bytes, Heap Memory Used Bytes, Non Heap Memory Capacity Bytes, Non Heap Memory Used Bytes: Jobs: Jobs Running, Jobs Failed, Jobs Completed: Performance metrics: Reduces Running, Maps Running

Hadoop Resource Manager: Heap/Non Heap Memory: Heap Memory Capacity Bytes, Heap Memory Used Bytes, Non Heap Memory Capacity Bytes, Non Heap Memory Used Bytes: Nodes: Active Nodes, Decommissioned Nodes, Lost Nodes, Unhealthy Nodes, Rebooted Nodes: Apps: Apps Submitted, Apps Pending, Apps Running, Apps Completed

Hadoop Node Manager: Heap/Non Heap Memory: Heap Memory Capacity Bytes, Heap Memory Used Bytes, Non Heap Memory Capacity Bytes, Non Heap Memory Used Bytes: Containers: Containers Running, Containers Launched, Containers Completed, Containers Failed, Containers Initing, Containers Killed

Hadoop Job History: Heap/Non Heap Memory: Heap Memory Capacity Bytes, Heap Memory Used Bytes, Non Heap Memory Capacity Bytes, Non Heap Memory Used Bytes

Event monitoring

The following events will be triggered:

Error: Any of the components is down or not available
Info: HBase was discovered on one of the Data Nodes or Name Node

Service Impact

When combined with the Zenoss Service Dynamics product, this ZenPack adds built-in service impact capability for Hadoop. The following service impact relationships are automatically added. These will be included in any services containing one or more of the explicitly mentioned entities.

Service Impact Relationships

Secondary Name Node, Job Tracker, Task Tracker, Resource Manager, Node Manager or Job History failure affects associated Data Node.
Secondary Name Node, Job Tracker, Task Tracker, Resource Manager, Node Manager and Job History are affected by hosting Device failure.

External Impact Relationships

Data Node failure affects associated HBase Region Servers.
Secondary Name Node, Job Tracker, Task Tracker, Resource Manager, Node Manager and Job History are affected by ZooKeeper failure.

HBase Autodiscover

With a fully deployed Hadoop environment, the HBase Master server can move around to different Hadoop Nodes. This ZenPack provides optional discovery of HBase Master Device within Hadoop Data Nodes, which may be configured with the help of the following configuration properties:

zHBaseAutodiscover
zHBaseDeviceClass

If you checkmark zHBaseAutodiscover property, every 5 minutes by default each Hadoop Data Node will be probed for hosting HBase Master. In case HBase is found, the HBase device will be created in the Device Class set in zHBaseDeviceClass (usually /Server/Linux or /Server/Microsoft/Windows) with HBase modeler plugins bound. If the HBase device was already discovered, but HBase Master moves to a different Node, the device's IP address will be changed to a new one.

Usage

Use the following steps to start monitoring Hadoop cluster:

Select Infrastructure from the navigation bar.
Click the device name in the device list. The device overview page appears.
Select Configuration Properties from the left panel.
Set zHadoopUsername, zHadoopPassword and select https for zHadoopScheme if you have Basic access authentication configured on your Hadoop UI instances (otherwise leave zHadoopUsername and zHadoopPassword blank).
Set zHadoopNameNodePort if the Name Node UI instance port differs from the default one.
Optionally checkmark zHBaseAutodiscover to setup discovering HBase on Hadoop data nodes.
Optionally set the device class for discovered HBase device in zHBaseDeviceClass if it differs from the default one.
Navigate to the Modeler plugins page of the device containing your Hadoop server, add the HadoopDataNode and HadoopServiceNode modeler plugins.
Select Model device from the gear menu.

Alternatively you can use zenbatchload to add Hadoop monitoring servers from the command line. To do this, you must create a text file with hostname, username and password of all the servers you want to add. The devices should be added under the target machine device class (e.g. '/Devices/Server/Linux' in the example below).

/Devices/Server/Linux <HOST NAME or IP>
zCollectorPlugins=['HadoopDataNode', 'HadoopServiceNode'], zHadoopNameNodePort='50070' zHadoopPassword='', zHadoopUsername='', zHadoopScheme='<http(s)>'

You can then load the Hadoop monitoring servers into Zenoss Core or Resource Manager as devices with the following command.

$ zenbatchload <filename>

Prerequisites

This ZenPack uses the NameNode WebUI port for Hadoop modeling and performance monitoring, therefore please make sure that the UI instance is running. For more information refer to HDFS Ports.

Troubleshooting

Please refer to the Zenoss Service Dynamics documentation if you run into any of the following problems:

ZenPack will not install
Adding a device fails
Don't understand how to add a device
Don't understand how to model a device

If you cannot find the answer in the documentation, then Resource Manager (Service Dynamics) users should contact Zenoss Customer Support. Core users can use the #zenoss IRC channel or the community.zenoss.org forums.

Installed Items

Installing this ZenPack will add the following items to your Zenoss system.

Modeler Plugins

HadoopDataNode
HadoopServiceNode

Monitoring Templates

Hadoop (in /Device)
HadoopDataNode (in /Device)
HadoopSecondaryNameNode (in /Device)
HadoopJobTracker (in /Device)
HadoopTaskTracker (in /Device)
HadoopResourceManager (in /Device)
HadoopNodeManager (in /Device)
HadoopJobHistory (in /Device)
HBaseDiscoverMonitor (in /Device)

Component Types

HadoopDataNode (on related device)
HadoopSecondaryNameNode (on related device)
HadoopJobTracker (on related device)
HadoopTaskTracker (on related device)
HadoopResourceManager (on related device)
HadoopNodeManager (on related device)
HadoopJobHistory (on related device)

Daemons


Type	Name
Performance Collector	zenpython

Changes

1.0.0

Initial release