Name		Name	Last commit message	Last commit date
parent directory ..
bigquery		bigquery
dataflow		dataflow
dataproc		dataproc
gcp		gcp
ibm-components		ibm-components
kubeflow		kubeflow
local		local
resnet-cmle		resnet-cmle
sample/keras/train_classifier		sample/keras/train_classifier
OWNERS		OWNERS
README.md		README.md
build_image.sh		build_image.sh
license.sh		license.sh
release.sh		release.sh
third_party_licenses.csv		third_party_licenses.csv

README.md

Kubeflow pipeline components

Kubeflow pipeline components are implementations of Kubeflow pipeline tasks. Each task takes one or more artifacts as input and may produce one or more artifacts as output.

Example: XGBoost DataProc components

Each task usually includes two parts:

Client code The code that talks to endpoints to submit jobs. For example, code to talk to Google Dataproc API to submit a Spark job.

Runtime code The code that does the actual job and usually runs in the cluster. For example, Spark code that transforms raw data into preprocessed data.

Container A container image that runs the client code.

Note the naming convention for client code and runtime code—for a task named "mytask":

The mytask.py program contains the client code.
The mytask directory contains all the runtime code.

See how to build your own components

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

components

components

README.md

Kubeflow pipeline components

Files

components

Directory actions

More options

Directory actions

More options

Latest commit

History

components

Folders and files

parent directory

README.md

Kubeflow pipeline components