Skip to content

Commit

Permalink
Merge pull request kubernetes#2205 from nqn/mlwg-charter-update
Browse files Browse the repository at this point in the history
Adding a bit more detail to the mlwg charter
  • Loading branch information
k8s-ci-robot authored Jun 4, 2018
2 parents 4bb68d8 + 151d810 commit bc37bc9
Showing 1 changed file with 12 additions and 4 deletions.
16 changes: 12 additions & 4 deletions wg-machine-learning/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,9 +30,17 @@ A working group dedicated towards making Kubernetes work best for Machine Learni

The charter for this working group as [proposed](https://groups.google.com/forum/?utm_medium=email&utm_source=footer#!msg/kubernetes-dev/lOeMjOLilxI/wuQayFDvCQAJ) is as follows:

* Assess the state of the art for ML workloads on K8s
* Identify pain points users currently have with ML on k8s
* Identify, prioritize and execute on improving k8s to better support ML workloads in the near, medium, and long term.
- Assess the state of the art for ML workloads on K8s
- Identify pain points users currently have with ML on k8s
- Identify, prioritize and execute on improving k8s to better support ML workloads in the near, medium, and long term.

## Goals:

Topics include, but are not limited to:

- Ease source changes to execution workflows, as they are a common barrier to entry.
- Scheduler enhancements such as improved bin packing for accelerators, job queueing, fair sharing and gang scheduling.
- Runtime enhancements such as job data loading (common data set sizes in the tens of gigabytes to terabytes), accelerator support, persisting job output (ML workloads can run for days and rely heavily on checkpointing) and multi-tenancy and job isolation (dealing with potential sensitive data sets).
- Job management such as experiment tracking (including enabling hyperparameter tuning systems) and scaling and deployment aspects of inference workloads.

TODO: Finalize and update the charter after the initial kick off meeting on 3/1/2018.
<!-- END CUSTOM CONTENT -->

0 comments on commit bc37bc9

Please sign in to comment.