Running Sparkling Water¶
In order to run Sparkling Water, the environment must contain the property SPARK_HOME that points to the Spark distribution.
H2O on Spark can be started in the Spark Shell or in the Spark application as:
./bin/sparkling-shell
Sparkling Water (H2O on Spark) can be initiated using the following call:
val hc = H2OContext.getOrCreate()
The semantic of the call depends on the configured Sparkling Water backend. For more information about the backends, please see Sparkling Water Backends.
In internal backend mode, the call will:
- Collect the number and hostnames of the executors (worker nodes) in the Spark cluster 
- Launch H2O services on each detected executor 
- Create a cloud for H2O services based on the list of executors 
- Verify the H2O cloud status 
In external backend mode, the call will:
- Start H2O in client mode on the Spark driver 
- Start the separated H2O cluster on the configured YARN queue 
- Connect to the external cluster from the H2O client 
To see how to run Sparkling Water on Windows, please visit Use Sparkling Water in Windows Environments.