public class VecUtils
extends java.lang.Object
Modifier and Type | Class and Description |
---|---|
static class |
VecUtils.CollectDomainFast
|
static class |
VecUtils.CollectDoubleDomain |
static class |
VecUtils.CollectIntegerDomain
|
static class |
VecUtils.DomainDedupe
|
static class |
VecUtils.MeanResponsePerLevelTask
Compute the mean (weighted) response per categorical level
Skip NA values (those are already a separate bucket in the tree building histograms, for which this is designed)
|
static class |
VecUtils.ReorderTask
Reorder an integer (such as Enum storage) Vec using an int -> int mapping
|
Constructor and Description |
---|
VecUtils() |
Modifier and Type | Method and Description |
---|---|
static Vec |
categoricalDomainsToNumeric(Vec src)
|
static Vec |
categoricalToInt(Vec src)
|
static Vec |
categoricalToStringVec(Vec src)
|
static void |
deleteVecs(Vec[] vs,
int cnt) |
static int[] |
getLocalChunkIds(Vec v) |
static Vec |
numericToCategorical(Vec src)
|
static Vec |
numericToStringVec(Vec src)
|
static Vec |
stringToCategorical(Vec vec)
|
static Vec |
stringToNumeric(Vec src)
|
static Vec |
toCategoricalVec(Vec src)
|
static Vec |
toNumericVec(Vec src)
|
static Vec |
toStringVec(Vec src)
|
static Vec |
UUIDToStringVec(Vec src)
|
public static Vec toCategoricalVec(Vec src)
Vec
of categorical values from an existing Vec
.
This method accepts all Vec
types as input. The original Vec is not mutated.
If src is a categorical Vec
, a copy is returned.
If src is a numeric Vec
, the values are converted to strings used as domain
values.
For all other types, an exception is currently thrown. These need to be replaced
with appropriate conversions.
Throws H2OIllegalArgumentException() if the resulting domain exceeds
Categorical.MAX_CATEGORICAL_COUNT.public static Vec toNumericVec(Vec src)
Vec
of numeric values from an existing Vec
.
This method accepts all Vec
types as input. The original Vec is not mutated.
If src is a categorical Vec
, a copy is returned.
If src is a string Vec
, all values that can be are parsed into reals or integers, and all
others become NA. See stringToNumeric for parsing details.
If src is a numeric Vec
, a copy is made.
If src is a time Vec
, the milliseconds since the epoch are used to populate the new Vec.
If src is a UUID Vec
, the existing numeric storage is used to populate the new Vec.
Throws H2OIllegalArgumentException() if the resulting domain exceeds
Categorical.MAX_CATEGORICAL_COUNT.public static Vec categoricalToInt(Vec src)
Vec
of numeric values from a categorical Vec
.
If the first value in the domain of the src Vec is a stringified ints,
then it will use those ints. Otherwise, it will use the raw enumeration level mapping.
If the domain is stringified ints, then all of the domain must be able to be parsed as
an int. If it cannot be parsed as such, a NumberFormatException will be caught and
rethrown as an H2OIllegalArgumentException that declares the illegal domain value.
Otherwise, the this pointer is copied to a new Vec whose domain is null.
The magic of this method should be eliminated. It should just use enumeration level
maps. If the user wants domains to be used, call categoricalDomainsToNumeric().
PUBDEV-2209public static Vec toStringVec(Vec src)
Vec
of string values from an existing Vec
.
This method accepts all Vec
types as input. The original Vec is not mutated.
If src is a string Vec
, a copy of the Vec
is made.
If src is a categorical Vec
, levels are dropped, and the Vec
only records the string.
For all numeric Vec
s, the number is converted to a string.
For all UUID Vec
s, the hex representation is stored as a string.public static void deleteVecs(Vec[] vs, int cnt)
public static int[] getLocalChunkIds(Vec v)