Download and Run
Install in R
Install on Hadoop
Get started with H2O in 3 easy steps
1. Download H2O. This is a zip file that contains everything you need to get started.
2. From your terminal, run:
cd ~/Downloads
unzip h2o-2.8.1.1.zip
cd h2o-2.8.1.1
java -jar h2o.jar
3. Point your browser to http://localhost:54321
Use H2O directly from R
Copy and paste these commands into R one line at a time:
# The following two commands remove any previously installed H2O packages for R.
if ("package:h2o" %in% search()) { detach("package:h2o", unload=TRUE) }
if ("h2o" %in% rownames(installed.packages())) { remove.packages("h2o") }
# Next, we download, install and initialize the H2O package for R.
install.packages("h2o", repos=(c("http://h2o-release.s3.amazonaws.com/h2o/rel-markov/1/R", getOption("repos"))))
library(h2o)
localH2O = h2o.init()
# Finally, let's run a demo to see H2O at work.
demo(h2o.glm)
Run H2O on Hadoop in just 3 steps.
1. Download H2O. This is a zip file that contains everything you need to get started.
wget http://h2o-release.s3.amazonaws.com/h2o/rel-markov/1/h2o-2.8.1.1.zip
2. Unpack the zip file and launch a 1g instance of H2O:
unzip h2o-2.8.1.1.zip
cd h2o-2.8.1.1/hadoop
hadoop jar h2odriver_hdp2.1.jar water.hadoop.h2odriver -libjars ../h2o.jar -mapperXmx 1g -nodes 1 -output hdfsOutputDirName
3. Point your browser to any of the H2O nodes launched. The IP addresses of the nodes will be available in the output:
Determining driver host interface for mapper->driver callback...
[Possible callback IP address: 172.16.2.181]
[Possible callback IP address: 127.0.0.1]
...
Waiting for H2O cluster to come up...
H2O node 172.16.2.184:54321 requested flatfile
Sending flatfiles to nodes...
[Sending flatfile to node 172.16.2.184:54321]
H2O node 172.16.2.184:54321 reports H2O cluster size 1
H2O cluster (1 nodes) is up
Blocking until the H2O cluster shuts down...