forked from h2oai/h2o-2
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME.txt
78 lines (46 loc) · 1.79 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
RUNNING H2O NODES IN HADOOP
===========================
Note: You may want to do all of this from the machine where you plan
to launch the hadoop jar job from. Otherwise you will end up having
to copy files around.
(If you grabbed a prebuilt h2o-*.zip file, copy it to a hadoop machine
and skip to the PREPARE section below.)
GET H2O TREE FROM GIT
---------------------
$ git clone https://github.com/0xdata/h2o.git
$ cd h2o
BUILD CODE
----------
$ make
COPY BUILD OUTPUT TO HADOOP NODE
--------------------------------
Copy target/h2o-*.zip <to place where you intend to run hadoop command>
PREPARE JOB INPUT ON HADOOP NODE
--------------------------------
$ unzip h2o-*.zip
$ cd h2o-*
$ cd hadoop
RUN JOB
-------
$ hadoop jar h2odriver_cdh4.jar water.hadoop.h2odriver [-jt <jobtracker:port>] -libjars ../h2o.jar -mapperXmx 1g -nodes 1 -output hdfsOutputDirName
(Note: -nodes refers to H2O nodes. This may be less than or equal to
the number of hadoop machines running TaskTrackers where hadoop
mapreduce Tasks may land.)
(Note: Make sure to use the h2odriver flavor for the correct version
of hadoop! We recommend running the hadoop command from a
machine in the hadoop cluster.)
(Note: Port 8021 is the default jobtracker port for Cloudera.
Port 9001 is the default jobtracker port for MapR.)
MONITOR JOB
-----------
Use standard job tracker web UI. (http://<jobtrackerip>:50030)
Different distros sometimes have different job tracker Web UI ports.
The cloudera default is 50030.
SHUT DOWN THE CLUSTER
---------------------
Bring up H2O web UI: http://<h2onode>:54321
Choose Admin->Shutdown
(Note: Alternately use the "hadoop job -kill" command.)
FOR MORE INFORMATION
--------------------
$ hadoop jar hadoop/h2odriver_cdh4.jar water.hadoop.h2odriver -help