Get a parse setup back for the staged data.

h2o.parseSetup(data, pattern = "", destination_frame = "",
  header = NA, sep = "", col.names = NULL, col.types = NULL,
  na.strings = NULL, parse_type = NULL, chunk_size = NULL,
  decrypt_tool = NULL, skipped_columns = NULL)

Arguments

data

An H2OFrame object to be parsed.

pattern

(Optional) Character string containing a regular expression to match file(s) in the folder.

destination_frame

(Optional) The hex key assigned to the parsed file.

header

(Optional) A logical value indicating whether the first row is the column header. If missing, H2O will automatically try to detect the presence of a header.

sep

(Optional) The field separator character. Values on each line of the file are separated by this character. If sep = "", the parser will automatically detect the separator.

col.names

(Optional) An H2OFrame object containing a single delimited line with the column names for the file. If skipped_columns are specified, only list column names of columns that are not skipped.

col.types

(Optional) A vector specifying the types to attempt to force over columns. If skipped_columns are specified, only list column types of columns that are not skipped.

na.strings

(Optional) H2O will interpret these strings as missing.

parse_type

(Optional) Specify which parser type H2O will use. Valid types are "ARFF", "XLS", "CSV", "SVMLight"

chunk_size

size of chunk of (input) data in bytes

decrypt_tool

(Optional) Specify a Decryption Tool (key-reference acquired by calling h2o.decryptionSetup.

skipped_columns

a list of column indices to be excluded from parsing

See also

h2o.parseRaw