public static class VecUtils.DomainDedupe extends MRTask<VecUtils.DomainDedupe>
Vec
with deduplicated domains from a categorical Vec
.
Categoricals may have the same values after munging, and should have the same domain index in the numerical chunk
representation. Unify categoricals that are the same by remapping their domain indices.
Could be more efficient with a vec copy and replace domain indices as needed. PUBDEV-2587MRTask.PostMapAction<T extends MRTask.PostMapAction<T>>
DTask.DKeyTask<T extends DTask.DKeyTask,V extends Keyed>, DTask.RemoveCall
Constructor and Description |
---|
DomainDedupe(java.util.HashMap<java.lang.Integer,java.lang.Integer> oldToNewDomainIndex) |
Modifier and Type | Method and Description |
---|---|
static Vec |
domainDeduper(Vec vec,
java.util.HashMap<java.lang.String,java.util.ArrayList<java.lang.Integer>> substringToOldDomainIndices) |
void |
map(Chunk c,
NewChunk nc) |
appendables, asyncExecOnAllNodes, block, closeLocal, compute2, dfork, dfork, dfork, dfork, dfork, dinvoke, doAll, doAll, doAll, doAll, doAll, doAll, doAll, doAll, doAll, doAll, doAll, doAll, doAllNodes, getResult, getResult, isReleasable, map, map, map, map, map, map, map, map, map, map, map, modifiesVolatileVecs, onCompletion, onExceptionalCompletion, outputFrame, outputFrame, outputFrame, postGlobal, profile, profString, reduce, self, setupLocal, withPostMapAction
copyOver, getDException, hasException, logVerbose, onAck, onAckAck, setException
asBytes, clone, compute, compute1, currThrPriority, frozenType, icer, priority, read, readJSON, reloadFromBytes, write, writeJSON
__tryComplete, addToPendingCount, compareAndSetPendingCount, complete, exec, getCompleter, getPendingCount, getRawResult, setCompleter, setPendingCount, setRawResult, tryComplete
adapt, adapt, adapt, cancel, compareAndSetForkJoinTaskTag, completeExceptionally, fork, get, get, get, getException, getForkJoinTaskTag, getPool, getQueuedTaskCount, getSurplusQueuedTaskCount, helpQuiesce, inForkJoinPool, invoke, invokeAll, invokeAll, invokeAll, isCancelled, isCompletedAbnormally, isCompletedNormally, isDone, join, peekNextLocalTask, pollNextLocalTask, pollTask, quietlyComplete, quietlyInvoke, quietlyJoin, reinitialize, setForkJoinTaskTag, tryUnfork