Use Sparkling Water via Spark Packages¶
Sparkling Water is also published as a Spark package. You can use it directly from your Spark distribution. The name of the published package is ai.h2o:sparkling-water-package, and it references all published Sparkling Water modules. Moreover, each module can be used as a Spark package if necessary.
If you would like to use Sparkling Water version 188.8.131.52-1-3.0 on Spark 3.0 and launch example
CraigslistJobTitlesStreamingApp, then you can use the following
$SPARK_HOME/bin/spark-submit --packages ai.h2o:sparkling-water-package_2.12:184.108.40.206-1-3.0 --class ai.h2o.sparkling.examples.CraigslistJobTitlesStreamingApp /dev/null
The Spark option
--packages points to the Duke package and the published Sparkling Water packages in the Maven repository.
A similar command works for
$SPARK_HOME/bin/spark-shell --packages ai.h2o:sparkling-water-package_2.12:220.127.116.11-1-3.0
The same command works for Python programs:
$SPARK_HOME/bin/spark-submit --packages ai.h2o:sparkling-water-package_2.12:18.104.22.168-1-3.0 example.py
Note: When you are using Spark packages, you do not need to download the Sparkling Water distribution. The Spark installation is sufficient.