Skip to content

Automatically rebalance your kafka topics, partitions, replicas across your cluster

License

Notifications You must be signed in to change notification settings

CAFxX/kafkabalancer

Repository files navigation

kafkabalancer

Rebalance your kafka topics, partitions, replicas across your cluster

Purpose

kafkabalancer allows you to compute the set of rebalancing operations yielding a minimally-unbalanced kafka cluster, given a set of constraints:

  • set of allowed brokers (globally, or per partition)
  • number of desired replicas (per partition)
  • current distribution of replicas (per partition)
  • leader reassignment enabled/disabled (globally)
  • partition weight (per partition)

The goal is to minimize the workload difference between brokers in the cluster, where the workload of a broker is measured by the sum of the weights of each partition having a replica on that broker. Additionally leaders have to do more work (producers and consumers only operate on the leader, followers fetch from the leaders as well) so the weight applied to leader partitions is assumed to be proportional to the sum of the number of replicas and consumer groups.

The tool is designed to be used iteratively: at each iteration only a single reassignment operation is returned by kafkabalancer. This is useful in a automated framework like the following:

  forever:
    if !kafka_cluster_is_nominal:
      continue
    state = get_current_state()
    change = kafkabalancer(state)
    if change:
      apply(change)

Installation

go get github.com/cafxx/kafkabalancer

Usage

Run kafkabalancer -help for usage instructions.

Usage of ./kafkabalancer:
  -allow-leader
    	Consider the partition leader eligible for rebalancing
  -broker-ids string
    	Comma-separated list of broker IDs (default "auto")
  -input string
    	Name of the file to read (if no file is specified, read from stdin)
  -input-json
    	Parse the input as JSON
  -max-reassign int
    	Maximum number of reassignments to generate (default 1)
  -min-replicas int
    	Minimum number of replicas for a partition to be eligible for rebalancing (default 2)
  -min-umbalance float
    	Minimum umbalance value required to perform rebalancing (default 1e-05)

How to perform rebalancing

First dump the list of partitions from your Kafka broker ($ZK is the comma-separated list of your zookeeper brokers):

kafka-topics.sh --zookeeper $ZK --describe > kafka-topics.txt

Next run kafkabalancer on the list (note: this assumes that all partitions have the same weight and no consumers; this is functionally OK but could lead to suboptimal load distribution). kafkabalancer will analyze the list and suggest one or more reassignments:

kafkabalancer -input kafka-topics.txt > reassignment.json

To perform the suggested change(s), run the following command:

kafka-reassign-partitions.sh --zookeeper $ZK --reassignment-json-file reassignment.json --execute

Features

  • parse the output of kafka-topic.sh --describe
  • parse the reassignment JSON format
  • output the reassignment JSON format
  • minimize leader unbalance (maximize global throughput)

Planned

  • parse the output of kafka-offset.sh to get the per-partiton weights (number of messages)
  • fetch elsewhere additional metrics to refine the weights (e.g. number of consumers, size of messages)
  • minimize same-broker colocation of partitions of the same topic (maximize per-topic throughput)
  • proactively minimize unbalance caused by broker failure (i.e. minimize unbalance caused by one or more brokers failing) keeping into considerations how followers are elected to leaders when the leader fails
  • consider N-way rebalancing plans (e.g. swap two replicas) to avoid local minima
  • prefer to relocate "small" partitions to minimize the additional load due to moving data between brokers
  • use something like https://github.com/wvanbergen/kazoo-go to query state directly
  • use something like https://github.com/wvanbergen/kazoo-go to apply changes directly

Scenarios

This section lists some examples of how kafkabalancer operates.

Adding brokers

Setting broker-ids=1,2,3 will move partition 1 from broker 2 to broker 3 to equalize the load:

Part Original Output
1 1,2 1,3
2 2,1 2,1

Setting broker-ids=1,2,3,4 will move partition 1 from brokers 1,2 to brokers 4,3 to equalize the load:

Part Original Output
1 1,2 4,3
2 2,1 2,1

Removing brokers

Setting broker-ids=1,2 will move partition 3 from broker 3 to broker 2 to equalize the load:

Part Original Output
1 1,2 1,2
2 1 1
3 3 2

Setting broker-ids=1 will return error because the partition 1 requires 2 replicas.

Add replicas

Setting NumReplicas=2 for partition 3 will add a replica on broker 2 to equalize the load.

Part Original Output
1 1,2 1,2
2 1,3 1,3
3 3 2,3

Remove replicas

Setting NumReplicas=1 for partition 1 will remove the replica from broker 1 to equalize the load.

Part Original Output
1 1,2 2
2 1 1

Automated rebalancing

If no changes need to be made, kafkabalancer will simply seek to equalize the load between brokers:

Part Original Output
1 1,2,3 1,4,3
2 1,2,4 1,2,4
3 1,2,3 1,2,3

If leader moving is enabled, also the leaders are eligible for rebalancing:

Part Original Output
1 1,2,3 4,2,1
2 1,2,4 3,2,4
3 1,2,3 1,2,3

Author

Carlo Alberto Ferraris (@cafxx)

License

MIT

About

Automatically rebalance your kafka topics, partitions, replicas across your cluster

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published