Responsible for answering scalability related questions such as: What size clusters do we think that we should support with Kubernetes in the short to medium term? How performant do we think that the control system should be at scale? What resource overhead should the Kubernetes control system reasonably consume? For more details about our objectives please review our Scaling And Performance Goals
- Thursdays at 16:00 UTC (weekly). Convert to your timezone.
Meeting notes and Agenda can be found here. Meeting recordings can be found here.
- Wojciech Tyczynski (@wojtek-t), Google
- Bob Wise (@countspongebob), Samsung SDS
- Joe Beda (@jbeda), Heptio
We officially support two different SLOs:
-
"API-responsiveness": 99% of all API calls return in less than 1s
-
"Pod startup time: 99% of pods (with pre-pulled images) start within 5s
This should be valid on appropriate hardware up to a 5000 node cluster with 30 pods/node. We eventually want to expand that to 100 pods/node.
For more details how do we measure those, you can look at: http://blog.kubernetes.io/2015_09_01_archive.html
We are working on refining existing SLOs and defining more for other areas of the system.