Change Log

v2.3.31 (2019-06-24)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/31/index.html

  • Bug
    • SW-1140 - Add more logging to discover intermittent RSparkling Issue in jenkins tests
    • SW-1318 - add back to JavaH2OContext method asDataFrame(.., SQLContext) but deprecated
    • SW-1321 - Remove mention of H2O UDP from user documentation
    • SW-1322 - Fix wrong doc in ssl.rst -> val conf: H2OConf = // generate H2OConf file
    • SW-1323 - Model ID not available on our algo pipeline wrappers
    • SW-1338 - Follow up fixes after RSparkling change
    • SW-1339 - Use s3-cli instead of s3cmd because of performance reasons on nightlies
    • SW-1340 - Fix spinx warning
    • SW-1342 - Fix dist
    • SW-1343 - Fix dist structure
    • SW-1345 - Fix missing rsparkling in dist package
    • SW-1347 - Scaladoc not uploaded to S3 after porting make-dist to gradle
    • SW-1359 - Fix wrong links on nightly build page
    • SW-1360 - Explicitly send hearbeat after we have complete flatfile
    • SW-1361 - sparkling water package on maven should assembly jar
    • SW-1362 - gradle.properties in distribution contains wrong version
    • SW-1364 - Rename SVM to SparkSVM
    • SW-1374 - Minor documentation fixes
  • New Feature
    • SW-1021 - Upload RSparkling to S3 in a form of R repository
    • SW-1353 - Introduce logic flatting data frames with arbitrarily nested structures
  • Improvement
    • SW-554 - Include all used dependency licenses in the uber jar.
    • SW-1308 - Bundle Sparkling Water jar into rsparkling -> making rsparkling version dependent on specific sparkling water
    • SW-1317 - Unify repl acros different rel branches
    • SW-1325 - Expose jks_alias in Sparkling Water
    • SW-1326 - Include SW version in more log statements
    • SW-1330 - Add additional log to H2O cloudup in internal backend mode
    • SW-1331 - Create local repo with RSparkling
    • SW-1332 - [RSparkling] Make installation from S3 the default recommended option
    • SW-1333 - Move the conversion logic from Spark Row to H2O RowData to a separate entity
    • SW-1334 - Store H2O models in transient lazy variables of SW Mojo models
    • SW-1335 - Make automl tests more deterministic by using max_models instead of max_runtime_secs
    • SW-1341 - Use readme as main dispatch for documentation
    • SW-1346 - Remove chache and unpersist call in SpreadRDDBuilder
    • SW-1348 - Switch to s3 cli on release pipelines
    • SW-1349 - Use withColumn instead of select in MOJO models
    • SW-1350 - Fix links to doc & scaladoc on nightly builds
    • SW-1352 - Upgrade H2O to 3.24.0.5
    • SW-1365 - Run only last build in jenkins
    • SW-1369 - Download page is missing one step on RSparkling tab -> library(rsparkling)
    • SW-1371 - Create maven repo on our s3 for each release and nightly
    • SW-1373 - Update DBC documentation with respoect to latest RSparkling development

v2.3.30 (2019-06-03)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/30/index.html

  • Bug
    • SW-1259 - Unify ratio param across pipeline api
    • SW-1287 - Use RPC endpoints to orchestrate cloud in internal mode
    • SW-1290 - Fix doc
    • SW-1301 - Fix class-loading for Sparkling Water assembly JAR in PySparkling
    • SW-1311 - Add numpy as PySparkling dependency ( it is required because of Spark but missing from list of dependencies)
    • SW-1312 - Warn that default value of convertUnknownCategoricalLevelsToNa will be changed to false on GridSearch & AutoML
    • SW-1316 - Fix wrong fat jar name
  • Task
    • SW-1292 - Benchmarks: Subproject Skeleton
  • Improvement
    • SW-1212 - Make sure python zip/wheel is downloadable from our release s3
    • SW-1274 - On download page -> list all supported minor versions
    • SW-1286 - Remove Param propagation of MOJOModels from Python to Java
    • SW-1288 - H2OCommonParams in pysparkling
    • SW-1289 - Move shared params to H2OCommonParams
    • SW-1298 - Don't use deprecated methods
    • SW-1299 - Warn user that default value of predictionCol on H2OMOJOModel will change in the next major release to 'prediction'
    • SW-1300 - Upgrade to H2O 3.24.0.4
    • SW-1304 - Definition of assembly jar via transitive exclusions
    • SW-1305 - Move ability to change behavior of MOJO models to MOJOLoader
    • SW-1306 - Move make-dist logic to gradle
    • SW-1307 - Expose binary model in spark pipeline stage
    • SW-1309 - Fix xgboost doc
    • SW-1313 - Rename the 'create_from_mojo' method of H2OMOJOModel and H2OMOJOPipelineModel to 'createFromMojo'

v2.3.29 (2019-05-17)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/29/index.html

  • Bug
    • SW-1256 - Fix constructor of H2OMojoModel
    • SW-1258 - Remove internal constructors & Deprecate implicit constructor parameters for H2O Algo Spark Estimators( to be the same as in PySparkling)
    • SW-1270 - Fix version check in PySpakrling shell
    • SW-1278 - Clean workspace on the hadoop node in integ tests
    • SW-1279 - Fix inconsistencies between H2OAutoML, H2OGridSearch & H2OALgorithm
    • SW-1281 - Fix bad representation of predictionCol on H2OMOJOModel
    • SW-1282 - XGBoost can't be used in H2OGridSearch pipeline wrapper
    • SW-1283 - Correctly return mojo model in pysparkling after fit
  • Story
    • SW-1271 - Remove SparkContext from H2OSchemaUtils
    • SW-1273 - Upgrade to H2O 3.24.0.3
  • New Feature
    • SW-1248 - getFeaturesCols() should not return the fold column or weight column
    • SW-1249 - probability calibration does not work in Sparkling Water Dataframe API
  • Improvement
    • SW-369 - Override spark locality so we use only nodes on which h2o is running.
    • SW-1216 - Improve PySparkling README
    • SW-1261 - Remove binary H2O model from ML pipelines
    • SW-1263 - Don't require initializer call to be called during pysparkling pipelines
    • SW-1264 - Use default params reader in pipelines
    • SW-1268 - Non-named columns are long time deprecated. Switch to named columns by default
    • SW-1269 - Remove six as dependency from PySparkling launcher ( six is no longer dependency)
    • SW-1275 - Remove unnecessary constructor in helper class
    • SW-1280 - Add predictionCol to mojo pipeline model

v2.3.28 (2019-04-26)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/28/index.html

  • Bug
    • SW-1186 - No need to pass properties defined in spark-defaults.conf to cli
    • SW-1189 - Fix Sparkling Water 2.1.x compile on Scala 2.10
    • SW-1194 - RSparkling Can't be used on Spark 2.4
    • SW-1195 - Disable gradle daemon via gradle.properties
    • SW-1196 - Fix org.apache.spark.ml.spark.models.PipelinePredictionTest
    • SW-1203 - Custom metric not evaluated in internal mode of Sparkling Water
    • SW-1227 - Change get-extended-jar to use https instead of http
    • SW-1230 - Fix typo in GLM API - getRemoteCollinearColumns, setRemoteCollinearColumns
    • SW-1232 - Fix RUnits after upgrading to Gradle 5.3.1
    • SW-1234 - Deprecate asDataFrame with implicit argument
  • Story
    • SW-1198 - Introduce new annotation deprecating legacy methods in API
    • SW-1209 - Rename the 'predictionCol' model parameter to 'labelCol'
    • SW-1226 - Introduce mechanism for enabling backward compatibility of MOJO files when properties are renamed
  • New Feature
    • SW-1193 - Expose weights_column parameter
  • Improvement
    • SW-1188 - RSparkling: Add ability to add authentication details when calling h2o_context(sc)
    • SW-1190 - Improve hint description for disabling automatic usage of broadcast joins
    • SW-1199 - Improve memory efficiency of H2OMOJOPipelineModel
    • SW-1202 - Simplify Sparkling Water build
    • SW-1204 - Fix formating in python tests
    • SW-1208 - Create pysparkling tests report file if it does not exist
    • SW-1210 - Add fold column to python and scala pipelines
    • SW-1211 - Automatically download H2O Wheel
    • SW-1213 - Upgrade to H2O 3.24.0.2
    • SW-1214 - Remove PySparkling six dependency as it was removed in H2O
    • SW-1215 - Automatically generate PySparkling README
    • SW-1217 - Automatically generate last pieces of doc subproject
    • SW-1219 - Remove suport for testing external cluster in manual mode
    • SW-1221 - Remove unnecessary branch check
    • SW-1222 - Remove duplicate readme file (contains old info & the correct info is in doc)
    • SW-1223 - Remove confusing meetup dir
    • SW-1224 - Upgrade to Gradle 5.3.1
    • SW-1228 - Rename the 'ignoredColumns' parameter of H2OAutoML to 'ignoredCols'
    • SW-1236 - Reformat few python classes
    • SW-1238 - Parametrize EMR version in templates generation
    • SW-1239 - Remove old README and DEVEL doc files (not just pointer to new doc)
    • SW-1240 - Use minSupportedJava for source and target compatibility in build.gradle

v2.3.27 (2019-04-03)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/27/index.html

  • Bug
    • SW-1162 - Exception when there is a column with BOOLEAN type in dataset during H2OMOJOModel transformation
    • SW-1177 - In Pysparkling script, setting –driver-class-path influences the environment
    • SW-1178 - Upgrade to h2O 3.24.0.1
    • SW-1180 - Use specific metrics in grid search, in the same way as H2O Grid
    • SW-1181 - Document off heap memory configuration for Spark in Standalone mode/IBM conductor
    • SW-1182 - Fix random project name generation in H2OAutoML Spark Wrapper
  • New Feature
    • SW-1167 - Expose search_criteria for H2OGridSearch
    • SW-1174 - expose H2OGridSearch models
    • SW-1183 - Add includeAlgos to H2o AutoML pipeline stage & ability to ignore XGBoost
  • Improvement
    • SW-1164 - Add Sparkling Water to Jupyter spark/pyspark kernels in EMR terraform template
    • SW-1171 - Upgrade build to Gradle 5.2.1
    • SW-1175 - Integrate with H2O native hive support

v2.3.26 (2019-03-15)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/26/index.html

  • Bug
    • SW-1163 - Expose missing variables in shared TF EMR SW tamplate
  • Improvement
    • SW-1145 - Start jupyter notebook with Scala & Python Spark in AWS EMR Terraform template
    • SW-1165 - Upgrade to H2O 3.22.1.6

v2.3.25 (2019-03-07)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/25/index.html

  • Bug
    • SW-1150 - hc.stop() shows 'exit' not defined error
    • SW-1152 - Fix RSparkling in case the jars are being fetched from maven
    • SW-1156 - H2OXgboost pipeline stage does not define updateH2OParams method
    • SW-1159 - Unique project name in automl to avoid sharing one leaderboard
    • SW-1161 - Fix grid search pipeline step on pyspark side
  • Improvement
    • SW-1052 - Document teraform scripts for AWS
    • SW-1089 - Document using Google Cloud Storage In Sparkling Water
    • SW-1135 - Speed up conversion between sparse spark vectors and h2o frames by using sparse new chunk
    • SW-1141 - Improve terraform templates for AWS EMR and make them part of the release process
    • SW-1147 - Integrate with Spark 2.3.3
    • SW-1149 - Allow login via ssh to created cluster using terraform
    • SW-1153 - Add H2OGridSearch pipeline stage to PySpark
    • SW-1155 - Test GBM Grid Search Scala pipeline step
    • SW-1158 - Generalize H2OGridSearch Pipeline step to support other available algos
    • SW-1160 - Upgrade to H2O 3.22.1.5

v2.3.24 (2019-02-18)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/24/index.html

  • Bug
    • SW-1136 - Fix bug affecting loading pipeline in python when stored in scala
    • SW-1138 - Fix several cases in spark vector -> h2o conversion
  • Improvement
    • SW-1134 - Add H2OGLM Wrapper to Sparkling Water
    • SW-1139 - Update mojo2 to 0.3.16
    • SW-1143 - Fix s3 bootstrap templates for nightly builds
    • SW-1144 - Upgrade to H2O 3.22.1.4

v2.3.23 (2019-01-29)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/23/index.html

  • Bug
    • SW-1133 - Upgrade to H2O 3.22.1.3

v2.3.22 (2019-01-21)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/22/index.html

  • Bug
    • SW-1129 - Fix support for unsupervised mojo models
  • Improvement
    • SW-1101 - Update code to work with latest jetty changes
    • SW-1127 - Upgrade H2O to 3.22.1.2

v2.3.21 (2019-01-17)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/21/index.html

  • Bug
    • SW-1116 - Cannot serialize DAI model
  • Improvement
    • SW-1113 - Update to H2O 3.22.0.5
    • SW-1115 - Enable tabs in the documentation based on the language
    • SW-1120 - Prepare Terraform scripts for Sparkling Water on EMR
    • SW-1121 - Use getTimestamp method instead of _timestamp directly

v2.3.20 (2019-01-08)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/20/index.html

  • Bug
    • SW-1107 - NullPointerException at water.H2ONode.openChan(H2ONode.java:417) after upgrade to H2O 3.22.0.3
    • SW-1110 - Fix test suite to test PySparkling YARN integration tests on external backend as well
  • Task
    • SW-1109 - Docs: Change copyright year in docs to include 2019
  • Improvement
    • SW-464 - Publish PySparkling as conda package
    • SW-1111 - Update H2O to 3.22.0.4

v2.3.19 (2018-12-27)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/19/index.html

  • Bug
    • SW-1084 - Documentation link does not work on the Nightly Bleeding Edge download page
    • SW-1100 - Fix Travis builds
    • SW-1102 - Fix Travis builds (test just scala unit tests)
  • Improvement
    • SW-464 - Publish PySparkling as conda package
    • SW-1080 - Fix deprecation warning regarding automl -> AutoML
    • SW-1090 - Upgrade shadowJar plugin
    • SW-1091 - Upgrade to Gradle 5.0
    • SW-1092 - Updates to streaming app
    • SW-1093 - Update to H2O 3.22.0.3
    • SW-1095 - Enable GCS in Sparkling Water
    • SW-1097 - Properly integrate GCS with Sparkling Water, including test in PySparkling
    • SW-1106 - Remove deprecated Gradle option in Gradle 5
  • Docs
    • SW-1083 - Add Installation and Starting instructions to the docs

v2.3.18 (2018-11-27)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/18/index.html

  • Improvement
    • SW-1078 - Upgrade H2O to 3.22.0.2

v2.3.17 (2018-10-27)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/17/index.html

  • Bug
    • SW-1071 - Fallback to original IP discovery in case we can't find the same network
    • SW-1072 - Fix handling time column for mojo pipeline
    • SW-1073 - Upgrade MOJO to 0.3.17
  • Improvement
    • SW-1045 - Upgrade H2O to 3.22.0.1

v2.3.16 (2018-10-17)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/16/index.html

  • Bug
    • SW-930 - Enable AutoML tests in Sparkling Water
    • SW-1065 - Fix isssue with empty queue name by default
    • SW-1066 - In PySparkling, don't reconnect if already connected
    • SW-1068 - Fix warning in doc
  • Improvement
    • SW-1057 - Sparkling shell ignores parameters after last updates
    • SW-1058 - Automatic detection of client ip in external backend
    • SW-1059 - Pysparkling in external backend, manual mode stops the backend cluster, but the cluster should be left intact
    • SW-1060 - Create nightly release for 2.1, 2.2 and 2.3
    • SW-1061 - Upgrade to Mojo 0.3.15
    • SW-1062 - Don't expose mojo internal types
    • SW-1063 - More explicit checks for valid values of Backend mode and external backend start mode
    • SW-1064 - Expose run_as_user for External H2O Backend
    • SW-1069 - Upgrade H2O to 3.20.0.10

v2.3.15 (2018-10-02)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/15/index.html

  • Bug
    • SW-1041 - Fix passing –jars to sparkling-shell
    • SW-1042 - More robust check for python package in PySparkling shell
    • SW-1048 - Add missing six dependency to setup.py for PySparkling
  • Improvement
    • SW-1043 - Mojo pipeline with multiple output columns (and also with dots in the names) does not work in SW
    • SW-1047 - Upgrade to Spark 2.3.2
    • SW-1049 - Upgrade pyspark dependency to 2.3.2
    • SW-1054 - Upgrade H2O dependency to 3.20.0.9

v2.3.14 (2018-09-24)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/14/index.html

  • New Feature
    • SW-1020 - Expose leaderboard on H2OAutoML
    • SW-1022 - Display Release creation date on the download page
  • Improvement
    • SW-1024 - remove call to ./gradlew –help in jenkins pipeline
    • SW-1025 - Ensure that release does not depend on build id
    • SW-1030 - [RSparkling] In case only path to SW jar file is specified, discover the version from JAR file instead of requiring it as parameter
    • SW-1031 - Enable installation ot RSparkling using devtools from Github repo
    • SW-1032 - Upgrade mojo pipeline to 0.13.2
    • SW-1033 - Document automatic certificate creation for Flow UI
    • SW-1034 - PySparkling fails if we specify https argument as part of getOrCreate()
    • SW-1035 - Document using s3a and s3n on Sparkling Water
    • SW-1036 - Upgrade to H2O 3.20.0.8
    • SW-1038 - The shell script bin/pysparkling should print missing dependencies
    • SW-1039 - Upgrade Gradle to 4.10.2
  • Docs
    • SW-1018 - Fix link to Installing RSparkling on Windows

v2.3.14 (2018-09-24)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/14/index.html

  • New Feature
    • SW-1020 - Expose leaderboard on H2OAutoML
    • SW-1022 - Display Release creation date on the download page
  • Improvement
    • SW-1024 - remove call to ./gradlew –help in jenkins pipeline
    • SW-1025 - Ensure that release does not depend on build id
    • SW-1030 - [RSparkling] In case only path to SW jar file is specified, discover the version from JAR file instead of requiring it as parameter
    • SW-1031 - Enable installation ot RSparkling using devtools from Github repo
    • SW-1032 - Upgrade mojo pipeline to 0.13.2
    • SW-1033 - Document automatic certificate creation for Flow UI
    • SW-1034 - PySparkling fails if we specify https argument as part of getOrCreate()
    • SW-1035 - Document using s3a and s3n on Sparkling Water
    • SW-1036 - Upgrade to H2O 3.20.0.8
    • SW-1038 - The shell script bin/pysparkling should print missing dependencies
    • SW-1039 - Upgrade Gradle to 4.10.2
  • Docs
    • SW-1018 - Fix link to Installing RSparkling on Windows

v2.3.13 (2018-09-14)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/13/index.html

  • New Feature
    • SW-1023 - Upgrade Gradle to 4.10.1
  • Improvement
    • SW-1019 - Upgrade H2O to 3.20.0.7
    • SW-1027 - Revert Upgrade to Gradle 4.10.1(bug in Gradle) and upgrade to Gradle 4.0
    • SW-1028 - Update docs and mention that ORC is supported
  • Docs
    • SW-1017 - Docs: Add Parquet to list of supported data formats

v2.3.12 (2018-08-28)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/12/index.html

  • Bug
    • SW-270 - Add test for RDD[TimeStamp] -> H2OFrame[Time] -> RDD[Timestamp] conversion
    • SW-319 - SVMModelTest is failing
    • SW-986 - Fix links on RSparkling Readme page
    • SW-996 - Fix typos in documentation
    • SW-997 - Fix javadoc on JavaH2OContext
    • SW-1000 - Setting context path in pysparkling fails to launch h2o
    • SW-1001 - RSparkling does not respect context path
    • SW-1002 - Automatically generate the keystore for H2O Flow ssl (self-signed certificates)
    • SW-1003 - When running in Local mode, we ignore some configuration
    • SW-1004 - Fix context path value checks
    • SW-1005 - Use correct scheme in sparkling water when ssl on flow is enabled
    • SW-1006 - Fix context path setting on RSparkling
    • SW-1015 - Add context path after value of spark.ext.h2o.client.flow.baseurl.override when specified
  • New Feature
    • SW-980 - Integrate XGBoost in Sparkling Water
    • SW-1012 - Sparkling water External Backend Support in kerberized cluster
  • Task
    • SW-988 - Add to docs that pysparkling has a new dependency pyspark
  • Improvement
    • SW-175 - JavaH2OContext#asRDD implementation is missing
    • SW-920 - Sparkling Water/RSparkling needs to declare additional repository
    • SW-989 - Improve Scala Doc API of the support classes
    • SW-991 - Update Gradle Spinx libraries - faster documentation builds
    • SW-992 - Create abstract class from creating parameters from Enum for Sparkling Water pipelines
    • SW-993 - [PySparkling] Fix Wrong H2O version detection on latest bundled H2Os
    • SW-994 - Add timeouts & retries for docker pull
    • SW-998 - Document using PySparkling on the edge node ( EMR)
    • SW-1007 - Upgrade H2O to 3.20.0.6
    • SW-1011 - Fix EMR bootstrap scripts
    • SW-1013 - Add option which can be used to change the flow address which is printed out after H2OConetext started
    • SW-1014 - Document how to run Sparkling Water on kerberized cluster

v2.3.11 (2018-08-09)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/11/index.html

  • Bug
    • SW-971 - Change maintainer of RSparkling to jakub@h2o.ai
    • SW-972 - Fix Content of RSparkling release table
    • SW-973 - Allow passing custom cars when running ./bin/sparkling/shell
    • SW-975 - Fix CRAN issues of Rsparkling
    • SW-981 - Fix wrong comparison of versions when detecing other h2o versions in PySparkling
    • SW-982 - Set up client_disconnect_timeout correctly in context on External backend, auto mode
    • SW-983 - Fix missing mojo impl artifact when running pysparkling tests in jenkins
  • Task
    • SW-633 - Add to doc that 100 columns are displayed in the preview data by default
  • Improvement
    • SW-528 - Update PySparkling Notebooks to work for Python 3
    • SW-548 - List nodes and driver memory in Spark UI - SParkling Water Tab
    • SW-910 - Use Mojo Pipeline API in Sparkling Water
    • SW-969 - Port documentation for mojo pipeline on Spark to SW repo
    • SW-970 - Upgrade Mojo 2 in SW to 0.11.0
    • SW-976 - Upgrade H2O to 3.20.0.5
    • SW-977 - Need ability to disable Flow UI for Sparkling-Water
    • SW-979 - Verify that we are running on correct Spark for PySparkling at init time
    • SW-984 - Cache also test and runtime dependencies in docker image
  • Docs
    • SW-946 - Add "How to" for using Sparkling Water on Google Cloud Dataproc

v2.3.10 (2018-08-01)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/10/index.html

  • Bug
    • SW-903 - Automate releases of RSparkling and create release pipeline for this release proccess
    • SW-911 - Add missing repository to the documentation
    • SW-944 - Fix Sphinx gradle plugin, the latest version does not work
    • SW-945 - Stabilize releasing to Nexus Repository
    • SW-953 - Do not stop external H2O backend in case of manual start mode
    • SW-958 - Fix RSparkling README style issues
    • SW-959 - Fix address for fetching H2O R package in nightly tests
    • SW-961 - Add option to ignore SPARK_PUBLIC_DNS
    • SW-962 - Add option which ensures that items in flatfile are translated to IP address
    • SW-967 - Deprecate old behaviour of mojo pipeline output in SW
  • Improvement
    • SW-233 - Warn if user's h2o in python env is different then the one bundled in pysparkling
    • SW-921 - Move Rsparkling to Sparkling Water repo
    • SW-941 - Upgrade Gradle to 4.9
    • SW-952 - Fix issues when stopping Sparkling Water (Scala) in yarn-cluster mode for external Backend
    • SW-957 - RSparkling should run tests in both, external and internal mode
    • SW-963 - Upgrade H2O to 3.20.0.4
    • SW-965 - Expose port offset in Sparkling Water
    • SW-968 - Remove confusing message about stopping H2OContext in PySparkling

v2.3.9 (2018-07-16)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/9/index.html

  • Bug
    • SW-898 - Issues with HTTP libraries on SPark 2.3
    • SW-902 - Upgrade Gradle to 4.8.1
    • SW-904 - Upgrade Mojo2 version to 0.10.7
    • SW-908 - Exclude Hadoop dependencies as they are provided by Spark
    • SW-909 - Fix issues when stopping Sparkling Water (Scala) in yarn-cluster mode
    • SW-925 - Fix missing aposthrope in documentation
    • SW-929 - Disable temporarily AutoML tests in Sparkling Water
  • New Feature
    • SW-826 - Implement Synchronous and Asynchronous Scala cell behaviour
  • Improvement
    • SW-846 - Don't parse types again when passing data to mojo pipeline
    • SW-886 - Several Scala cell improvements in H2O flow
    • SW-887 - Make sure that we can use schemes unsupported by H2O in H2O Confoguration
    • SW-889 - Port AWS preparation scripts into SW codebase
    • SW-894 - Add support for queuing of Scala cell jobs
    • SW-914 - Wrong Spark version in documentation
    • SW-917 - Dockerize Sparkling Water release pipeline
    • SW-919 - Clean gradle build with regards to mojo2
    • SW-922 - Upgrade H2O to 3.20.0.3
    • SW-928 - Expose AutoML max models
  • Docs
    • SW-878 - Add section for using Sparkling Water with AWS

v2.3.8 (2018-06-18)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/8/index.html

  • Improvement
    • SW-885 - Upgrade H2O to 3.20.0.2

v2.3.7 (2018-06-18)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/7/index.html

  • Bug
    • SW-858 - SVM throws unsupported operations exception on Spark 2.3
    • SW-861 - Upgrade Gradle to 4.8 (publishing plugin)
    • SW-872 - Fix reference to local-cluster on download page
    • SW-880 - Update Hadoop version on download page
    • SW-881 - Fix Script tests on Dockerized Jenkins infrastructure
    • SW-882 - Call h2oContext.stop after ham or spam Scala example
    • SW-883 - Add mising description in publish.gradle
  • Improvement
    • SW-860 - Modify the hadoop launch command on download page
    • SW-863 - Upgrade infrastructure and references to Spark 2.3.1
    • SW-873 - Upgrade H2O to 3.20.0.1
    • SW-874 - Update Mojo2 to 0.10.4
    • SW-876 - FIx local PySparkling integtest on jenkins infrastracture
    • SW-879 - Print output of script tests

v2.3.6 (2018-06-14)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/6/index.html

  • Bug
    • SW-712 - Test non-distributed sparkling water tests in docker
    • SW-850 - Expose methods to get input/output names in H2OMOJOPipelineModel
    • SW-859 - Print Warning when spark-home is defined on PATH
    • SW-862 - Create & fix test in PySparkling for named mojo columns
    • SW-864 - Fix & more readable test
    • SW-865 - Better Naming of the UDF method to obtain predictions
    • SW-869 - Add repository to build required by xgboost-predictor
  • Story
    • SW-856 - Upgrade Mojo2 to latest version
  • Improvement
    • SW-839 - Verify that Spark time column representation can be digested by Mojo2
    • SW-848 - Document Kerberos on Sparkling Water
    • SW-849 - Update use from maven on sparkling water download page
    • SW-851 - Make use of output types when creating Spark DataFrame out of mojo2 predicted values
    • SW-852 - Create spark UDF used to extract predicted values
    • SW-853 - Sparkling Water py should require pyspark dependency
    • SW-854 - Upgrade MojoPipeline to 0.10.0
    • SW-855 - Upgrade H2O to 3.18.0.11

v2.3.5 (2018-05-23)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/5/index.html

  • Bug
    • SW-842 - Enforce system level properties in SW
  • Improvement
    • SW-845 - Upgrade H2O to 3.18.0.10
    • SW-847 - Remove GA from Sparkling Water

v2.3.4 (2018-05-18)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/4/index.html

  • Bug
    • SW-836 - Add support for converting empty dataframe/RDD in Python and Scala to H2OFrame
    • SW-841 - Remove withCustomCommitsState in pipelines as it's now duplicating Github
    • SW-843 - Fix data obtaining for mojo pipeline
    • SW-844 - Upgrade Mojo pipeline to 0.9.9

v2.3.3 (2018-05-15)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/3/index.html

  • Bug
    • SW-817 - Enable running MOJO spark pipeline without H2O init
    • SW-825 - Local creation of Sparkling Water does not work anymore.
    • SW-831 - Check shape of H2O frame after the conversion from Spark frame
    • SW-834 - External Backend stored sparse vector values incorrectly
  • Improvement
    • SW-829 - Type checking in PySparkling pipelines
    • SW-832 - Small refactoring in identifiers
    • SW-833 - Explicitly set source and target java versions
    • SW-837 - Upgrade H2O to 3.18.0.9
    • SW-838 - Upgrade Mojo pipeline dependency to 0.9.8
    • SW-840 - Add test checking column names and types between spark and mojo2

v2.3.2 (2018-05-02)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/2/index.html

  • Bug
    • SW-574 - Process steam handle and use it for connection to external h2o cluster
    • SW-822 - Require correct colorama version
    • SW-823 - Fix Windows starting scripts
    • SW-824 - Fix NPE in mojo pipeline predictions
  • New Feature
    • SW-827 - Change color highlight in scala cell as it is too dark
  • Improvement
    • SW-815 - Upgrade H2O to 3.18.0.8
    • SW-816 - Update Mojo2 dependency to one which is compatible with Java7
    • SW-818 - Spark Pipeline imports do not work in PySparkling
    • SW-819 - Add ability to convert specific columns to categoricals in Sparkling Water pipelines
    • SW-820 - Sparkling Water pipelines add duplicate response column to the list of features

v2.3.1 (2018-04-19)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/1/index.html

  • Bug
    • SW-672 - Enable using sparkling water maven packages in databricks cloud
    • SW-787 - Documentation fixes
    • SW-788 - Fix Travis tests on Spark 2.3
    • SW-790 - Add missing seed argument to H2OAutoml pipeline step
    • SW-794 - Point to proper web-based docs
    • SW-796 - Use parquet provided by Spark
    • SW-797 - Automatically update redirect table as part of release pipeline
    • SW-806 - Fix exporting and importing of pipeline steps and mojo models to and from HDFS
  • Improvement
    • SW-772 - Integrate & Test Mojo Pipeline with Sparkling Water
    • SW-789 - Upgrade H2O to 3.18.0.7
    • SW-791 - Expose context_path in Sparkling Water
    • SW-793 - Create additional test verifying that the new light endpoint works as expected
    • SW-798 - Additional link to documentation
    • SW-800 - Remove references to Sparkling Water 2.0
    • SW-804 - Reduce time of H2OAutoml step in pipeline tests to 1 minute
    • SW-808 - Upgrade to Gradle 4.7

v2.3.0 (2018-03-29)

Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.3/0/index.html

  • Bug
    • SW-696 - Intermittent script test issue on external backend
    • SW-726 - Mark Spark dependencies as provided on artefacts published to maven
    • SW-740 - Increase timeout for conversion in pyunit test for external cluster
    • SW-760 - Fix doc artefact publication
    • SW-763 - Remove support for downloading H2O logs from Spark UI
    • SW-766 - Fix coding style issue
    • SW-769 - Fix import
    • SW-770 - Fix link to Spark 2.3 in travis tests
    • SW-776 - sparkling water from maven does not know the stacktrace_collector_interval option
    • SW-778 - Handle nulls properly in H2OMojoModel
    • SW-779 - As from Spark 2.3, use H2O ip address to show instead of spark's one
    • SW-783 - Make H2OAutoML pipeline tests deterministic by setting the seed
  • New Feature
    • SW-722 - [PySparkling] Check for correct data type as part of as_h2o_frame
  • Improvement
    • SW-733 - Parametrize pipeline scripts to be able to specify different algorithms
    • SW-746 - Log chunk layout after the conversion of data to external H2O cluster
    • SW-750 - Support for Spark 2.3.0
    • SW-755 - Document GBM Grid Search Pipeline Step
    • SW-765 - Remove test artefacts from the sparkling-water assembly
    • SW-768 - Add missing import
    • SW-771 - Travis edits - no longer need the workaround for JDK7
    • SW-773 - Don't use default value for output dir in external backend, it's not required
    • SW-780 - Upgrade H2O to 3.18.0.5
  • Docs
    • SW-775 - Fix link for documentation on DEVEL.md

v2.2.x (2017-08-17)

  • Sparkling Water 2.2 brings support of Spark 2.2.
  • For detailed changelog, please read rel-2.2/CHANGELOG.

v2.1.x (2017-03-02)

  • Sparkling Water 2.1 brings support of Spark 2.1.
  • For detailed changelog, please read rel-2.1/CHANGELOG.

v2.0.x (2016-09-26)

  • Sparkling Water 2.0 brings support of Spark 2.0.
  • For detailed changelog, please read rel-2.0/CHANGELOG.

v1.6.x (2016-03-15)

  • Sparkling Water 1.6 brings support of Spark 1.6.
  • For detailed changelog, please read rel-1.6/CHANGELOG.

v1.5.x (2015-09-28)

  • Sparkling Water 1.5 brings support of Spark 1.5.
  • For detailed changelog, please read rel-1.5/CHANGELOG.

v1.4.x (2015-07-06)

  • Sparkling Water 1.4 brings support of Spark 1.4.
  • For detailed changelog, please read rel-1.4/CHANGELOG.

v1.3.x (2015-05-25)

  • Sparkling Water 1.3 brings support of Spark 1.3.
  • For detailed changelog, please read rel-1.3/CHANGELOG.

v1.2.x (2015-05-18) and older

  • Sparkling Water 1.2 brings support of Spark 1.2.
  • For detailed changelog, please read rel-1.2/CHANGELOG.