Parameters of H2OWord2Vec¶
Affected Class¶
ai.h2o.sparkling.ml.features.H2OWord2Vec
Parameters¶
Each parameter has also a corresponding getter and setter method. (E.g.:
label
->getLabel()
,setLabel(...)
)
- epochs
Number of training iterations to run.
Default value:
5
- exportCheckpointsDir
Automatically export generated models to this directory.
Scala default value:
null
; Python default value:None
- initLearningRate
Set the starting learning rate.
Scala default value:
0.025f
; Python default value:0.025
- inputCol
input column name
Default value:
"No default value"
- maxRuntimeSecs
Maximum allowed runtime in seconds for model training. Use 0 to disable.
Default value:
0.0
- minWordFreq
This will discard words that appear less than <int> times.
Default value:
5
- modelId
Destination id for this model; auto-generated if not specified.
Scala default value:
null
; Python default value:None
- normModel
Use Hierarchical Softmax. Possible values are
"HSM"
.Default value:
"HSM"
- outputCol
output column name
Default value:
"H2OWord2Vec_output"
- sentSampleRate
- Set threshold for occurrence of words. Those that appear with higher frequency in the training data
will be randomly down-sampled; useful range is (0, 1e-5).
Scala default value:
0.001f
; Python default value:0.001
- vecSize
Set size of word vectors.
Default value:
100
- windowSize
Set max skip length between words.
Default value:
5
- wordModel
The word model to use (SkipGram or CBOW). Possible values are
"SkipGram"
,"CBOW"
.Default value:
"SkipGram"