Skip to content

kzk/elephant-driver

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About

'elephant-driver' is a Ruby library to communicate with Hadoop daemons.

Notice

Currently, only communicating with JobTracker is supported. Other daemons (TaskTracker, NameNode, DataNode) are not supported yet.

Requirements

  • thfift
  • nokogiri

Setup

This library assumes that you're using CDH3 (Cloudera Distribution for Hadoop, version 3).

You first need to install 'hue-plugins' package at the JobTracker node.

$ apt-get install hue-plugins
$ yum install hue-plugins

Then, the following settings need to be included in your mapred-site.xml.

<!-- Enable Hue plugins -->
<property>
  <name>mapred.jobtracker.plugins</name>
  <value>org.apache.hadoop.thriftfs.ThriftJobTrackerPlugin</value>
  <description>Comma-separated list of jobtracker plug-ins to be activated.</description>
</property>
<property>
  <name>jobtracker.thrift.address</name>
  <value>0.0.0.0:9290</value>
</property>

Usage

See spec/ directory for the example usage.

@cln = ElephantDriver::Client.new($HOST, $PORT)
@cln.jobs.each { |j|
  j.tasks.each { |t|
    t.counters
  }
}

About

Ruby library to talks with Hadoop daemons via Thrift

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages