Download Sparkling Water 2.1.33

Download Run on Hadoop Run on Standalone Cluster Kluster Use from Maven RSparkling PySparkling Spark Package

Download Sparkling Water

Integration info

H2O version: 3.20.0.2 wright (documentation)
Spark version: 2.1.2 (documentation)
Sparkling Water: Documentation Changelog

Get started with Sparkling Water in a few easy steps

1. Download Spark (if not already installed) from the Spark Downloads Page

Choose Spark release : 2.1.2
Choose a package type: Pre-built for Hadoop 2.7 and later

2. Point SPARK_HOME to the existing installation of Spark and export variable MASTER.

export SPARK_HOME="/path/to/spark/installation"
# To launch a local Spark cluster.
export MASTER="local[*]"

3. From your terminal, run:

cd ~/Downloads
unzip sparkling-water-2.1.33.zip
cd sparkling-water-2.1.33
bin/sparkling-shell --conf "spark.executor.memory=1g"

4. Create an H₂O cloud inside the Spark cluster:

import org.apache.spark.h2o._
val h2oContext = H2OContext.getOrCreate(spark)
import h2oContext._

5. Follow this demo, which imports airlines and weather data and runs predictions on delays.

Download Sparkling Water

Integration info

H2O version: 3.20.0.2 wright (documentation)
Spark version: 2.1.2 (documentation)
Sparkling Water: Documentation Changelog

Launch Sparkling Water on Hadoop using Yarn.

1. Download Spark (if not already installed) from the Spark Downloads Page.

Choose Spark release : 2.1.2
Choose a package type: Pre-built for Hadoop 2.7 and later

2. Point SPARK_HOME to an existing installation of Spark:

export SPARK_HOME='/path/to/spark/installation'

3. Set the HADOOP_CONF_DIR and Spark MASTER environmental variables.

export HADOOP_CONF_DIR=/etc/hadoop/conf
export MASTER="yarn"

4. Download Spark and Use spark-submit to launch Sparkling Shell on YARN.

wget /sparkling-water-2.1.33.zip
unzip sparkling-water-2.1.33.zip
cd sparkling-water-2.1.33/
bin/sparkling-shell --num-executors 3 --executor-memory 2g --master yarn --deploy-mode client

5. Create an H₂O cloud inside the Spark cluster:

import org.apache.spark.h2o._
val h2oContext = H2OContext.getOrCreate(spark)
import h2oContext._

Download Sparkling Water

Integration info

H2O version: 3.20.0.2 wright (documentation)
Spark version: 2.1.2 (documentation)
Sparkling Water: Documentation Changelog

Launch H2O on a Standalone Spark Cluster

1. Download Spark (if not already installed) from the Spark Downloads Page.

Choose Spark release : 2.1.2
Choose a package type: Pre-built for Hadoop 2.7 and later

2. Point SPARK_HOME to an existing installation of Spark:

export SPARK_HOME='/path/to/spark/installation'

3. From your terminal, run:

cd ~/Downloads
unzip sparkling-water-2.1.33.zip
cd sparkling-water-2.1.33
bin/launch-spark-cloud.sh
export MASTER="spark://localhost:7077"
bin/sparkling-shell

4. Create an H₂O cloud inside the Spark cluster:

import org.apache.spark.h2o._
val h2oContext = H2OContext.getOrCreate(spark)
import h2oContext._

Sparkling Water

Integration info

Get started with Sparkling Water in a few easy steps

Integration info

Launch Sparkling Water on Hadoop using Yarn.

Integration info

Launch H2O on a Standalone Spark Cluster

Integration info

Kluster

Integration info

Gradle-style specification for Maven artifacts

Integration info

RSparkling

H2O R Client

Integration info

PySparkling

PySparkling installed from PyPi repository

H2O Python Client

Integration info

Sparkling Water as Spark Package

Documentation