Download H2O 3.38.0.3

Download and Run Install in R Install in Python Install on Hadoop Use from Maven Kubernetes

Download H₂O

Get started with H₂O in 3 easy steps

1. Download H₂O. This is a zip file that contains everything you need to get started.

2. From your terminal, run:

cd ~/Downloads
unzip h2o-3.38.0.3.zip
cd h2o-3.38.0.3
java -jar h2o.jar

3. Point your browser to http://localhost:54321

Setup H₂O on Kubernetes using Helm

Helm can be used to deploy H2O into a kubernetes cluster. Helm requires the KUBECONFIG environment variable to be set up properly, or stating the kubeconfig destination explicitly. Please refer to Helm's documentation for further information.

helm repo add h2o https://charts.h2o.ai
helm install basic-h2o h2o/h2o
helm test basic-h2o

There are various settings and modifications available. To inspect the configuration options available, use the "helm inspect values h2o/h2o --version 3.38.0.3" command.

Setup H₂O on Kubernetes with kubectl

1. Set-up kubernetes cluster and kubectl.

2. (Optional) Adjust the 'default' namespace in the following YAML, if required.

apiVersion: apps/v1 kind: StatefulSet metadata: name: h2o-cluster-stateful-set namespace: default spec: serviceName: h2o-service podManagementPolicy: "Parallel" replicas: 1 selector: matchLabels: app: h2o-cluster template: metadata: labels: app: h2o-cluster spec: containers: - name: h2o-cluster image: 'h2oai/h2o-open-source-k8s:docker-image-version' command: ["/bin/bash", "-c", "java -XX:+UseContainerSupport -XX:MaxRAMPercentage=50 -jar /opt/h2oai/h2o-3/h2o.jar"] ports: - containerPort: 54321 protocol: TCP readinessProbe: httpGet: path: /kubernetes/isLeaderNode port: 8081 initialDelaySeconds: 5 periodSeconds: 5 failureThreshold: 1 resources: limits: cpu: '1' memory: 1Gi requests: cpu: '1' memory: 1Gi env: - name: H2O_KUBERNETES_SERVICE_DNS value: h2o-cluster-service.default.svc.cluster.local - name: H2O_NODE_LOOKUP_TIMEOUT value: '180' - name: H2O_NODE_EXPECTED_COUNT value: '1' - name: H2O_KUBERNETES_API_PORT value: '8081' --- apiVersion: v1 kind: Service metadata: name: h2o-cluster-service namespace: default spec: type: ClusterIP clusterIP: None selector: app: h2o-cluster ports: - protocol: TCP port: 80 targetPort: 54321

Environment variables:

H2O_KUBERNETES_SERVICE_DNS - [MANDATORY] Crucial for the clustering to work. The format usually follows the {service-name}.{project-name}.svc.cluster.local pattern. This setting enables H2O node discovery via DNS. It must be modified to match the name of the headless service created. Also, pay attention to the rest of the address to match the specifics of your Kubernetes implementation.

H2O_NODE_LOOKUP_TIMEOUT - [OPTIONAL] Node lookup constraint. Time before the node lookup is ended.

H2O_NODE_EXPECTED_COUNT - [OPTIONAL] Node lookup constraint. Expected number of H2O pods to be discovered.

H2O_KUBERNETES_API_PORT - [OPTIONAL] Port for Kubernetes API checks and probes to listen on. Defaults to 8080.

3. Issue "kubectl apply -f filename.yaml" to deploy H2O into Kubernetes.

4. (Optional) Adjust the YAML file to spawn more nodes or allocate more resources for the H2O cluster.

H₂O

Get started with H₂O in 3 easy steps

Use H₂O directly from Python

Conda Installation

Use H₂O directly from R

Run H₂O on Hadoop in just 3 steps

Gradle-style specification for Maven artifacts

Setup H₂O on Kubernetes using Helm

Setup H₂O on Kubernetes with kubectl

User Documentation

Developer Documentation

Booklets

H2O

Get started with H2O in 3 easy steps

Use H2O directly from Python

Conda Installation

Use H2O directly from R

Run H2O on Hadoop in just 3 steps

Gradle-style specification for Maven artifacts

Setup H2O on Kubernetes using Helm

Setup H2O on Kubernetes with kubectl

User Documentation

Developer Documentation

Booklets

H₂O

Get started with H₂O in 3 easy steps

Use H₂O directly from Python

Use H₂O directly from R

Run H₂O on Hadoop in just 3 steps

Setup H₂O on Kubernetes using Helm

Setup H₂O on Kubernetes with kubectl