The second phase in the data ingestion step.
h2o.parseRaw(data, pattern = "", destination_frame = "", header = NA, sep = "", col.names = NULL, col.types = NULL, na.strings = NULL, blocking = FALSE, parse_type = NULL, chunk_size = NULL, decrypt_tool = NULL)
data | An H2OFrame object to be parsed. |
---|---|
pattern | (Optional) Character string containing a regular expression to match file(s) in the folder. |
destination_frame | (Optional) The hex key assigned to the parsed file. |
header | (Optional) A logical value indicating whether the first row is the column header. If missing, H2O will automatically try to detect the presence of a header. |
sep | (Optional) The field separator character. Values on each line of
the file are separated by this character. If |
col.names | (Optional) An H2OFrame object containing a single delimited line with the column names for the file. |
col.types | (Optional) A vector specifying the types to attempt to force over columns. |
na.strings | (Optional) H2O will interpret these strings as missing. |
blocking | (Optional) Tell H2O parse call to block synchronously instead of polling. This can be faster for small datasets but loses the progress bar. |
parse_type | (Optional) Specify which parser type H2O will use. Valid types are "ARFF", "XLS", "CSV", "SVMLight" |
chunk_size | size of chunk of (input) data in bytes |
decrypt_tool | (Optional) Specify a Decryption Tool (key-reference acquired by calling h2o.decryptionSetup. |
Parse the Raw Data produced by the import phase.
h2o.importFile, h2o.parseSetup