Change Log¶
v2.2.19 (2018-06-18)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/19/index.html
- Improvement
- SW-885 - Upgrade H2O to 3.20.0.2
v2.2.18 (2018-06-18)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/18/index.html
- Bug
- SW-861 - Upgrade Gradle to 4.8 (publishing plugin)
- SW-872 - Fix reference to local-cluster on download page
- SW-880 - Update Hadoop version on download page
- SW-881 - Fix Script tests on Dockerized Jenkins infrastructure
- SW-882 - Call h2oContext.stop after ham or spam Scala example
- SW-883 - Add mising description in publish.gradle
- Improvement
v2.2.17 (2018-06-13)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/17/index.html
- Bug
- SW-712 - Test non-distributed sparkling water tests in docker
- SW-850 - Expose methods to get input/output names in H2OMOJOPipelineModel
- SW-859 - Print Warning when spark-home is defined on PATH
- SW-862 - Create & fix test in PySparkling for named mojo columns
- SW-864 - Fix & more readable test
- SW-865 - Better Naming of the UDF method to obtain predictions
- SW-869 - Add repository to build required by xgboost-predictor
- Story
- SW-856 - Upgrade Mojo2 to latest version
- Improvement
- SW-839 - Verify that Spark time column representation can be digested by Mojo2
- SW-848 - Document Kerberos on Sparkling Water
- SW-849 - Update use from maven on sparkling water download page
- SW-851 - Make use of output types when creating Spark DataFrame out of mojo2 predicted values
- SW-852 - Create spark UDF used to extract predicted values
- SW-853 - Sparkling Water py should require pyspark dependency
- SW-854 - Upgrade MojoPipeline to 0.10.0
- SW-855 - Upgrade H2O to 3.18.0.11
v2.2.16 (2018-05-23)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/16/index.html
v2.2.15 (2018-05-18)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/15/index.html
v2.2.14 (2018-05-15)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/14/index.html
- Bug
- Improvement
- SW-829 - Type checking in PySparkling pipelines
- SW-832 - Small refactoring in identifiers
- SW-833 - Explicitly set source and target java versions
- SW-837 - Upgrade H2O to 3.18.0.9
- SW-838 - Upgrade Mojo pipeline dependency to 0.9.8
- SW-840 - Add test checking column names and types between spark and mojo2
v2.2.13 (2018-05-02)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/13/index.html
- Bug
- New Feature
- SW-827 - Change color highlight in scala cell as it is too dark
- Improvement
- SW-815 - Upgrade H2O to 3.18.0.8
- SW-816 - Update Mojo2 dependency to one which is compatible with Java7
- SW-818 - Spark Pipeline imports do not work in PySparkling
- SW-819 - Add ability to convert specific columns to categoricals in Sparkling Water pipelines
- SW-820 - Sparkling Water pipelines add duplicate response column to the list of features
v2.2.12 (2018-04-19)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/12/index.html
- Bug
- SW-672 - Enable using sparkling water maven packages in databricks cloud
- SW-787 - Documentation fixes
- SW-790 - Add missing seed argument to H2OAutoml pipeline step
- SW-794 - Point to proper web-based docs
- SW-796 - Use parquet provided by Spark
- SW-797 - Automatically update redirect table as part of release pipeline
- SW-806 - Fix exporting and importing of pipeline steps and mojo models to and from HDFS
- Improvement
- SW-772 - Integrate & Test Mojo Pipeline with Sparkling Water
- SW-789 - Upgrade H2O to 3.18.0.7
- SW-791 - Expose context_path in Sparkling Water
- SW-793 - Create additional test verifying that the new light endpoint works as expected
- SW-798 - Additional link to documentation
- SW-800 - Remove references to Sparkling Water 2.0
- SW-804 - Reduce time of H2OAutoml step in pipeline tests to 1 minute
- SW-808 - Upgrade to Gradle 4.7
v2.2.11 (2018-03-29)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/11/index.html
- Bug
- SW-696 - Intermittent script test issue on external backend
- SW-726 - Mark Spark dependencies as provided on artefacts published to maven
- SW-740 - Increase timeout for conversion in pyunit test for external cluster
- SW-760 - Fix doc artefact publication
- SW-763 - Remove support for downloading H2O logs from Spark UI
- SW-766 - Fix coding style issue
- SW-769 - Fix import
- SW-776 - sparkling water from maven does not know the stacktrace_collector_interval option
- SW-778 - Handle nulls properly in H2OMojoModel
- SW-783 - Make H2OAutoML pipeline tests deterministic by setting the seed
- New Feature
- SW-722 - [PySparkling] Check for correct data type as part of as_h2o_frame
- Improvement
- SW-733 - Parametrize pipeline scripts to be able to specify different algorithms
- SW-746 - Log chunk layout after the conversion of data to external H2O cluster
- SW-755 - Document GBM Grid Search Pipeline Step
- SW-765 - Remove test artefacts from the sparkling-water assembly
- SW-768 - Add missing import
- SW-771 - Travis edits - no longer need the workaround for JDK7
- SW-773 - Don't use default value for output dir in external backend, it's not required
- SW-780 - Upgrade H2O to 3.18.0.5
- Docs
- SW-775 - Fix link for documentation on DEVEL.md
v2.2.10 (2018-03-08)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/10/index.html
- Bug
- SW-739 - Sparkling Water Doc artefact is still missing Scala version
- SW-742 - Fix setting up node network mask on external cluster
- SW-743 - Allow to set LDAP and different security options in external backend as well
- SW-747 - Fix bug in documentation for manual mode of external backend
- SW-757 - Fix tests after enabling the stack-trace collection
- Improvement
- SW-744 - Document how to use Sparkling Water with LDAP in Sparkling Water docs
- SW-745 - Expose Grid search as Spark pipeline step in the Scala API
- SW-748 - Upgrade to Gradle 4.6
- SW-752 - Collect stack traces on each h2o node as part of log collecting extension
- SW-754 - Upgrade H2O to 3.18.0.3
- SW-756 - Upgrade H2O to 3.18.0.4
- Docs
- SW-753 - Add "How to" for changing the default H2O port
v2.2.9 (2018-02-26)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/9/index.html
- Bug
- SW-723 - Sparkling water doc artefact is missing scala version
- SW-727 - Improve method for downloading H2O logs
- SW-728 - Use new light endpoint introduced in 3.18.0.1
- SW-734 - Make sure we use the unique key names in split method
- SW-736 - Document how to download logs on Databricks cluster
- SW-737 - Expose downloadH2OLogs on H2OContext in PySparkling
- SW-738 - Move spark.ext.h2o.node.network.mask setter to SharedArguments
- Improvement
v2.2.8 (2018-02-14)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/8/index.html
- Technical task
- SW-652 - Deliver SW documentation in HTML output
- Bug
- SW-685 - Fix Typo in documentation
- SW-695 - Make printHadoopDistributions gradle task available again for testing
- SW-701 - Kill the client when one of the h2o nodes went OOM in external mode
- SW-706 - Fix pysparkling.ml import for non-interactive sessions
- SW-707 - parquet import fails on HDP with Spark 2.0 (azure hdi cluster)
- SW-708 - Make sure H2OMojoModel does not required H2OContext to be initialized
- SW-709 - Fix mojo predictions tests
- SW-710 - In PySparkling pipelines, ensure that if users pass integer to double type we handle that correctly for all possible double values
- SW-713 - Write a simple test for parquet import in Sparkling Water
- SW-714 - Add option to H2OModel pipeline step allowing us to convert unknown categoricals to NAs
- SW-715 - Fix driverif configuration on the external backend
- Improvement
- SW-606 - Verify & Document run of RSparkling on top of Databricks Azure cluster
- SW-678 - Document how to change log location
- SW-683 - H2OContext can't be initalized on Databricks cloud
- SW-686 - Fix typo in documentation
- SW-687 - Upgrade Gradle to 4.5
- SW-688 - Update docs - SparklyR supports Spark 2.2.1 in the latest release
- SW-690 - Log Sparkling Water version during startup of Sparkling Water
- SW-693 - Allow to set driverIf on external H2O backend
- SW-694 - Fix creation of Extended JAR in gradle task
- SW-700 - Report Yarn App ID of spark application in H2OContext
- SW-703 - Upload generated sphinx documentation to S3
- SW-704 - Update links on the download page to point to the new docs
- SW-705 - Increase memory for JUNIT tests
- SW-718 - Upgrade to Gradle 4.5.1
- SW-719 - Upgrade to H2O 3.18.0.1
- SW-720 - Fix parquet import test on external backend
- Docs
v2.2.7 (2018-01-18)¶
Download at: http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/7/index.html
- Bug
- SW-273 - Remove workaround introduced by SW-272 for yarn/cluster mode
- SW-551 - Remove hotfix introduced by [SW-541] and implement proper fix
- SW-661 - Use always correct Spark version on the R download page
- SW-662 - Remove extra files that got into the repo
- SW-666 - Kill the cluster when a new executors joins in the internal backend
- SW-668 - Generate download link as part of the release notes
- SW-669 - Remove mentions of local-cluster in public docs
- SW-670 - Deprecated call in H2OContextInitDemo
- SW-671 - Fix jenkinsfile for builds again specific h2o branches
- Improvement
v2.2.6 (2018-01-03)¶
- Bug
- SW-627 - [PySparkling] calling as_spark_frame for the second time results in exception
- SW-630 - Fix ham or spam flow to reflect latest changes in pipelines
- SW-631 - Ensure that we do not access RDDs in pipelines ( to unblock the deployment)
- SW-645 - Fix H2OInterpreter on Scala 2.10
- SW-646 - Fix incosistencies in ham or spam examples between scala and python
- SW-647 - PySparkling shell is failing in Spark 2.2.0
- SW-648 - Fix ham or spam pipeline tests
- SW-649 - Fix ham or spam tests for deeplearning pipeline
- Improvement
- SW-608 - Measure time of conversions to H2OFrame in debug mode
- SW-612 - Port all arguments available to Scala ML to PySparkling ML
- SW-617 - Support for exporting mojo to hdfs
- SW-632 - Dump full spark configuration during H2OContext.getOrCreate into DEBUG
- SW-634 - Integrate with Spark 2.2.1
- SW-635 - Fix wrong instruction at PySparkling download page
- SW-637 - Create new DataFrame with new schema when it actually contain any dot in names
- SW-638 - Port release script into the sw repo
- SW-639 - Use persist layer for exportPOJOModel
- SW-640 - export H2OMOJOMOdel.createFromMOJO to pysparkling
- SW-642 - Create test for mojo predictions in PySparkling
- SW-643 - Add tests for H2ODeeplearning in Scala and Python and Fix potential problems
- SW-644 - Log spark configuration to INFO level
- SW-650 - Upgrade Gradle to 4.4.1
- SW-656 - Upgrade ShadowJar to 2.0.2
v2.2.5 (2017-12-11)¶
- Bug
- SW-615 - pysparkling.__version__ returns incorrectly ‘SUBST_PROJECT_VERSION’
- SW-616 - PySparkling fails on python 3.6 because long time does not exist in python 3.6
- SW-621 - PySParkling failing on Python3.6
- SW-624 - Python build does not support H2O_PYTHON_WHEEL when building against h2o older then 3.16.0.1
- SW-628 - PySparkling fails when installed from pypi
- Improvement
- SW-626 - Upgrade Gradle to 4.4
v2.2.4 (2017-12-01)¶
v2.2.3 (2017-11-25)¶
- Bug
- SW-320 - H2OConfTest Python test blocks test run
- SW-499 - BinaryType handling is not implemented in SparkDataFrameConverter
- SW-535 - asH2OFrame gives error if column names have DOT in it
- SW-547 - Don’t use md5skip in external mode
- SW-569 - pysparkling: h2o on exit does not shut down cleanly
- SW-572 - Additional fix for [SW-571]
- SW-573 - Minor Gradle build improvements and fixes
- SW-575 - Incorrect comment in hamOrSpamMojo pipeline
- SW-576 - Cleanup pysparkling test infrastructure
- SW-577 - Fix conditions in jenkins file
- SW-580 - Fix composite build in Jenkins
- SW-581 - Fix H2OConf test on external cluster
- SW-582 - Opening Chicago Crime Demo Notebook errors on the first opening
- SW-584 - Create extended directory automatically
- SW-588 - Fix links in README
- SW-589 - Wrap stages in try finally in jenkins file
- SW-592 - Properly pass all parameters to algorithm
- SW-593 - H2Conf cannot be initialized on windows
- SW-594 - Gradle ml submodule reports success even though tests fail
- SW-595 - Fix ML tests
- New Feature
- SW-519 - Introduce SW Models into Spark python pipelines
- Task
- SW-609 - Upgrade H2O dependency to 3.16.0.1
- Improvement
- SW-318 - Keep H2O version inside sparklin-water-core.jar and provide utility to query it
- SW-420 - Shell scripts miss-leading error message
- SW-504 - Provides Sparkling Water Spark Uber package which can be used in –packages
- SW-570 - Stop previous jobs in jenkins in case of PR
- SW-571 - In PySparkling, getOrCreate(spark) still incorrectly complains that we should use spark session
- SW-583 - Upgrade to Gradle 4.3
- SW-585 - Add the custom commit status for internal and external pipelines
- SW-586 - [ML] Remove some duplicities, enable mojo for deep learning
- SW-590 - Replace deprecated method call in ChicagoCrime python example
- SW-591 - Repl doesn’t require H2O dependencies to compile
- SW-596 - Minor build improvements
- SW-603 - Upgrade Gradle to 4.3.1
- SW-605 - addFiles doesn’t accept sparkSession
- SW-610 - Change default client mode to INFO, let user to change it at runtime
v2.2.2 (2017-10-23)¶
- Bug
- SW-555 - Fix documentation issue in PySparkling
- SW-558 - Increase default value for client connection retry timeout in
- SW-560 - SW documentation for nthreads is inconsistent with code
- SW-561 - Fix reporting artefacts in Jenkins and remove use of h2o-3-shared-lib
- SW-564 - Clean test workspace in jenkins
- SW-565 - Fix creation of extended jar in jenkins
- SW-567 - Fix failing tests on external backend
- SW-568 - Remove obsolete and failing idea configuration
- SW-559 - GLM fails to build model when weights are specified
- Improvement
- SW-557 - Create 2 jenkins files ( for internal and external backend ) backed by configurable pipeline
- SW-562 - Disable web on external H2O nodes in external cluster mode
- SW-563 - In external cluster mode, print also YARN job ID of the external cluster once context is available
- SW-566 - Upgrade H2O to 3.14.0.7
- SW-553 - Improve handling of sparse vectors in internal cluster
v2.2.1 (2017-10-10)¶
- Bug
- SW-423 - Tests of External Cluster mode fails
- SW-516 - External cluster improperly convert RDD[ml.linalg.Vector]
- SW-524 - Ensure Jenkins uses Java 8 for Sparkling Water 2.2.x
- SW-525 - Don’t use GPU nodes for sparkling water testing in Jenkins
- SW-526 - Add missing when clause to scripts test stage in Jenkinsfile
- SW-527 - Use dX cluster for Jenkins testing
- SW-529 - Code defect in Scala example
- SW-531 - Use code which is compatible between Scala 2.10 and 2.11
- SW-532 - Make auto mode in external cluster default for tests in jenkins
- SW-534 - Ensure that all tests run on both, internal and external backends
- SW-536 - Allow to test sparkling water against specific h2o branch
- SW-537 - Update Gradle to 4.2RC2
- SW-538 - Fix problem in Jenkinsfile where H2O_HOME has higher priority then H2O_PYTHON_WHEEL
- SW-539 - Fix PySparkling issue when running multiple times on the same node
- SW-541 - Model training hangs in SW
- SW-542 - Sparkling Water does not support parquet import
- SW-552 - Fix documentation bug
- New Feature
- Improvement
v2.2.0 (2017-08-23)¶
- Bug
- New Feature
- Improvement
v2.1.x (2017-03-02)¶
- Sparkling Water 2.1 brings support of Spark 2.1.
- For detailed changelog, please read rel-2.1/CHANGELOG.
v2.0.x (2016-09-26)¶
- Sparkling Water 2.0 brings support of Spark 2.0.
- For detailed changelog, please read rel-2.0/CHANGELOG.
v1.6.x (2016-03-15)¶
- Sparkling Water 1.6 brings support of Spark 1.6.
- For detailed changelog, please read rel-1.6/CHANGELOG.
v1.5.x (2015-09-28)¶
- Sparkling Water 1.5 brings support of Spark 1.5.
- For detailed changelog, please read rel-1.5/CHANGELOG.
v1.4.x (2015-07-06)¶
- Sparkling Water 1.4 brings support of Spark 1.4.
- For detailed changelog, please read rel-1.4/CHANGELOG.
v1.3.x (2015-05-25)¶
- Sparkling Water 1.3 brings support of Spark 1.3.
- For detailed changelog, please read rel-1.3/CHANGELOG.
v1.2.x (2015-05-18) and older¶
- Sparkling Water 1.2 brings support of Spark 1.2.
- For detailed changelog, please read rel-1.2/CHANGELOG.