Productionizing H2O¶
(Note: This section is new and a work in progress...)
Example Design Patterns¶
Here is a collection of example design patterns for how to productionize H2O.
Consumer loan application¶
| Characteristic | Value |
|---|---|
| Pattern name | Jetty servlet |
| Example training language | R |
| Example training data source | CSV file |
| Example scoring data source | User input to Javascript application running in browser |
| Scoring environment | REST API service provided by Jetty servlet |
| Scoring engine | H2O POJO |
| Scoring latency SLA | Real-time |
Craigslist application¶
| Characteristic | Value |
|---|---|
| Pattern name | Sparkling water streaming |
| Example training language | Scala |
| Example training data source | CSV file |
| Example scoring data source | User input to Javascript application running in browser |
| Scoring engine | H2O cluster |
| Scoring latency SLA | Real-time |
| Resource | Location |
|---|---|
| Git repos | https://github.com/h2oai/app-ask-craig |
| Blogs | |
| Slides | http://www.slideshare.net/0xdata/sparkling-water-ask-craig http://www.slideshare.net/0xdata/sparkling-water-applications-meetup-072115 |
Malicious domain application¶
| Characteristic | Value |
|---|---|
| Pattern name | AWS Lambda |
| Example training language | Python |
| Example training data source | CSV file |
| Example scoring data source | User input to Javascript application running in browser |
| Scoring environment | AWS Lambda REST API endpoint |
| Scoring engine | H2O POJO |
| Scoring latency SLA | Real-time |
Storm bolt¶
| Characteristic | Value |
|---|---|
| Pattern name | Storm bolt |
| Example training language | R |
| Example training data source | CSV file |
| Example scoring data source | Storm spout |
| Scoring environment | POJO embedded in a Storm bolt |
| Scoring engine | H2O POJO |
| Scoring latency SLA | Real-time |
| Resource | Location |
|---|---|
| Git repos | https://github.com/h2oai/h2o-tutorials/tree/master/tutorials/streaming/storm |
| Tutorials | http://docs.h2o.ai/h2o-tutorials/latest-stable/tutorials/streaming/storm/index.html |
Invoking POJO directly in R¶
| Characteristic | Value |
|---|---|
| Pattern name | POJO in R |
| Example training language | R |
| Example training data source | (Need example) |
| Example scoring data source | (Need example) |
| Scoring environment | R |
| Scoring engine | H2O POJO |
| Scoring latency SLA | Batch |
Hive UDF¶
| Characteristic | Value |
|---|---|
| Pattern name | Hive UDF |
| Example training language | R |
| Example training data source | HDFS directory with hive part files output by a SELECT |
| Example scoring data source | Hive |
| Scoring environment | Hive SELECT query (parallel MapReduce) running UDF |
| Scoring engine | H2O POJO |
| Scoring latency SLA | Batch |
Steam Scoring Server from H2O.ai¶
| Characteristic | Value |
|---|---|
| Pattern name | Steam |
| Scoring data source | REST API client |
| Scoring environment | Steam scoring server |
| Scoring engine | H2O POJO |
| Scoring latency SLA | Real-time |
| Resource | Location |
|---|---|
| Web sites | http://www.h2o.ai/steam/ |