h2o-3/h2o-persist-drive at 054a599f1b6794047c52fe17592cd87ebccc9ab1 · JakubVe/h2o-3

History

Name		Name	Last commit message	Last commit date
parent directory ..
src		src
tests/python		tests/python
bootstrap.sh		bootstrap.sh
build.gradle		build.gradle
readme.txt		readme.txt
settings.gradle		settings.gradle

readme.txt

This is an experimental support for custom persist implementation relying on python and GraalVM

Currently, our example uses S3 and boto3 python library to download file locally and parse into H2O.

Installation Steps:

1) Download GraalVM
wget https://github.com/graalvm/graalvm-ce-builds/releases/download/vm-22.2.0/graalvm-ce-java17-linux-amd64-22.2.0.tar.gz
tar xfz graalvm-ce-java17-linux-amd64-22.2.0.tar.gz
export GRAAL_HOME=$(ls -d graalvm-*)

2) Create Python environment with boto3 
$GRAAL_HOME/bin/gu install python
$GRAAL_HOME/bin/graalpython -m venv venv
(source venv/bin/activate && pip install boto3)

3) Set AWS credentials into environment variables
export AWS_ACCESS_KEY_ID=...
export AWS_SECRET_ACCESS_KEY=...

4) Run H2O with GraalVM
$GRAAL_HOME/bin/java -jar ../../build/h2o.jar

5) Import data from "drive" (in our case backed by S3)
python
> import h2o
> h2o.connect()
> h2o.import_file("drive://h2o-public-test-data/smalldata/iris/iris.csv")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

h2o-persist-drive

h2o-persist-drive

readme.txt

Files

h2o-persist-drive

Directory actions

More options

Directory actions

More options

Latest commit

History

h2o-persist-drive

Folders and files

parent directory

readme.txt