Productionizing H2O¶
(Note: This section is new and a work in progress...)
Example Design Patterns¶
Here is a collection of example design patterns for how to productionize H2O.
Consumer loan application¶
| Characteristic | Value | 
|---|---|
| Pattern name | Jetty servlet | 
| Example training language | R | 
| Example training data source | CSV file | 
| Example scoring data source | User input to Javascript application running in browser | 
| Scoring environment | REST API service provided by Jetty servlet | 
| Scoring engine | H2O POJO | 
| Scoring latency SLA | Real-time | 
Craigslist application¶
| Characteristic | Value | 
|---|---|
| Pattern name | Sparkling water streaming | 
| Example training language | Scala | 
| Example training data source | CSV file | 
| Example scoring data source | User input to Javascript application running in browser | 
| Scoring engine | H2O cluster | 
| Scoring latency SLA | Real-time | 
| Resource | Location | 
|---|---|
| Git repos | https://github.com/h2oai/app-ask-craig | 
| Blogs | |
| Slides | http://www.slideshare.net/0xdata/sparkling-water-ask-craig http://www.slideshare.net/0xdata/sparkling-water-applications-meetup-072115  | 
Malicious domain application¶
| Characteristic | Value | 
|---|---|
| Pattern name | AWS Lambda | 
| Example training language | Python | 
| Example training data source | CSV file | 
| Example scoring data source | User input to Javascript application running in browser | 
| Scoring environment | AWS Lambda REST API endpoint | 
| Scoring engine | H2O POJO | 
| Scoring latency SLA | Real-time | 
Storm bolt¶
| Characteristic | Value | 
|---|---|
| Pattern name | Storm bolt | 
| Example training language | R | 
| Example training data source | CSV file | 
| Example scoring data source | Storm spout | 
| Scoring environment | POJO embedded in a Storm bolt | 
| Scoring engine | H2O POJO | 
| Scoring latency SLA | Real-time | 
| Resource | Location | 
|---|---|
| Git repos | https://github.com/h2oai/h2o-tutorials/tree/master/tutorials/streaming/storm | 
| Tutorials | http://docs.h2o.ai/h2o-tutorials/latest-stable/tutorials/streaming/storm/index.html | 
Invoking POJO directly in R¶
| Characteristic | Value | 
|---|---|
| Pattern name | POJO in R | 
| Example training language | R | 
| Example training data source | (Need example) | 
| Example scoring data source | (Need example) | 
| Scoring environment | R | 
| Scoring engine | H2O POJO | 
| Scoring latency SLA | Batch | 
Hive UDF¶
| Characteristic | Value | 
|---|---|
| Pattern name | Hive UDF | 
| Example training language | R | 
| Example training data source | HDFS directory with hive part files output by a SELECT | 
| Example scoring data source | Hive | 
| Scoring environment | Hive SELECT query (parallel MapReduce) running UDF | 
| Scoring engine | H2O POJO | 
| Scoring latency SLA | Batch | 
Steam Scoring Server from H2O.ai¶
| Characteristic | Value | 
|---|---|
| Pattern name | Steam | 
| Scoring data source | REST API client | 
| Scoring environment | Steam scoring server | 
| Scoring engine | H2O POJO | 
| Scoring latency SLA | Real-time | 
| Resource | Location | 
|---|---|
| Web sites | http://www.h2o.ai/steam/ |