.. _parameters_H2OExtendedIsolationForest:

Parameters of H2OExtendedIsolationForest
----------------------------------------

Affected Class
##############

- ``ai.h2o.sparkling.ml.algos.H2OExtendedIsolationForest``

Parameters
##########

- *Each parameter has also a corresponding getter and setter method.*
*(E.g.:* ``label`` *->* ``getLabel()`` *,* ``setLabel(...)`` *)*

ignoredCols
Names of columns to ignore for training.

*Scala default value:* ``null`` *; Python default value:* ``None``

*Also available on the trained model.*

categoricalEncoding
Encoding scheme for categorical features. Possible values are ``"AUTO"``, ``"OneHotInternal"``, ``"OneHotExplicit"``, ``"Enum"``, ``"Binary"``, ``"Eigen"``, ``"LabelEncoder"``, ``"SortByResponse"``, ``"EnumLimited"``.

*Default value:* ``"AUTO"``

*Also available on the trained model.*

columnsToCategorical
List of columns to convert to categorical before modelling

*Scala default value:* ``Array()`` *; Python default value:* ``[]``

convertInvalidNumbersToNa
If set to 'true', the model converts invalid numbers to NA during making predictions.

*Scala default value:* ``false`` *; Python default value:* ``False``

*Also available on the trained model.*

convertUnknownCategoricalLevelsToNa
If set to 'true', the model converts unknown categorical levels to NA during making predictions.

*Scala default value:* ``false`` *; Python default value:* ``False``

*Also available on the trained model.*

dataFrameSerializer
A full name of a serializer used for serialization and deserialization of Spark DataFrames to a JSON value within NullableDataFrameParam.

*Default value:* ``"ai.h2o.sparkling.utils.JSONDataFrameSerializer"``

*Also available on the trained model.*

detailedPredictionCol
Column containing additional prediction details, its content depends on the model type.

*Default value:* ``"detailed_prediction"``

*Also available on the trained model.*

disableTrainingMetrics
Disable calculating training metrics (expensive on large datasets).

*Scala default value:* ``true`` *; Python default value:* ``True``

*Also available on the trained model.*

extensionLevel
Maximum is N - 1 (N = numCols). Minimum is 0. Extended Isolation Forest with extension_Level = 0 behaves like Isolation Forest.

*Default value:* ``0``

*Also available on the trained model.*

featuresCols
Name of feature columns

*Scala default value:* ``Array()`` *; Python default value:* ``[]``

*Also available on the trained model.*

ignoreConstCols
Ignore constant columns.

*Scala default value:* ``true`` *; Python default value:* ``True``

*Also available on the trained model.*

keepBinaryModels
If set to true, all binary models created during execution of the ``fit`` method will be kept in DKV of H2O-3 cluster.

*Scala default value:* ``false`` *; Python default value:* ``False``

modelId
Destination id for this model; auto-generated if not specified.

*Scala default value:* ``null`` *; Python default value:* ``None``

ntrees
Number of Extended Isolation Forest trees.

*Default value:* ``100``

*Also available on the trained model.*

predictionCol
Prediction column name

*Default value:* ``"prediction"``

*Also available on the trained model.*

sampleSize
Number of randomly sampled observations used to train each Extended Isolation Forest tree.

*Default value:* ``256``

*Also available on the trained model.*

scoreEachIteration
Whether to score during each iteration of model training.

*Scala default value:* ``false`` *; Python default value:* ``False``

*Also available on the trained model.*

scoreTreeInterval
Score the model after every so many trees. Disabled if set to 0.

*Default value:* ``0``

*Also available on the trained model.*

seed
Seed for pseudo random number generator (if applicable).

*Scala default value:* ``-1L`` *; Python default value:* ``-1``

*Also available on the trained model.*

splitRatio
Accepts values in range [0, 1.0] which determine how large part of dataset is used for training and for validation. For example, 0.8 -> 80% training 20% validation. This parameter is ignored when validationDataFrame is set.

*Default value:* ``1.0``

validationDataFrame
A data frame dedicated for a validation of the trained model. If the parameters is not set,a validation frame created via the 'splitRatio' parameter. The parameter is not serializable!

*Scala default value:* ``null`` *; Python default value:* ``None``

withContributions
Enables or disables generating a sub-column of detailedPredictionCol containing Shapley values of original features.

*Scala default value:* ``false`` *; Python default value:* ``False``

*Also available on the trained model.*

withLeafNodeAssignments
Enables or disables computation of leaf node assignments.

*Scala default value:* ``false`` *; Python default value:* ``False``

*Also available on the trained model.*

withStageResults
Enables or disables computation of stage results.

*Scala default value:* ``false`` *; Python default value:* ``False``

*Also available on the trained model.*