Download Sparkling Water 1.4.3

Download Run on Hadoop Run on Standalone Cluster Use from Maven R/Python/Spark

Get started with Sparkling Water in a few easy steps

1. Download Spark (if not already installed) from the Spark Downloads Page

Choose Spark release : 1.4.0
Choose a package type: Pre-built for Hadoop 2.4 and later

2. Download Sparkling Water and point it to the existing installation of Spark by setting the SPARK_HOME environment variable:

export SPARK_HOME='/path/to/spark/installation'

3. From your terminal, run:

cd ~/Downloads
unzip sparkling-water-1.4.3.zip
cd sparkling-water-1.4.3
bin/sparkling-shell

4. Create an H₂O cloud inside the Spark cluster:

import org.apache.spark.h2o._
val h2oContext = new H2OContext(sc).start()
// Or if you know the number of Spark workers:
// val h2oContext = new H2OContext(sc).start( <number of Spark workers> )
import h2oContext._

5. Follow this demo, which imports airlines and weather data and runs predictions on delays.

Documentation

Integration info

H2O version: 3.0.0.26 shannon (documentation)
Spark version: 1.4.0 (documentation)

Sparkling Water

Get started with Sparkling Water in a few easy steps

Launch Sparkling Water on Hadoop using Yarn.

Launch H2O on a Standalone Spark Cluster

Gradle-style specification for Maven artifacts

R client

Python client

Documentation

Integration info