Skip to content

Latest commit

 

History

History
107 lines (77 loc) · 7.34 KB

README.md

File metadata and controls

107 lines (77 loc) · 7.34 KB

The OpenMessaging Benchmark Framework

This repository houses user-friendly, cloud-ready benchmarking suites for the following messaging platforms:

A benchmarking suite for RocketMQ and RabbitMQ will be available soon.

For each platform, the benchmarking suite includes easy-to-use scripts for deploying that platform on Amazon Web Services (AWS) and then running benchmarks upon deployment. For end-to-end instructions, see platform-specific docs for:

Project goals

The goal of the OpenMessaging Benchmark Framework is to provide benchmarking suites for an ever-expanding variety of messaging platforms. These suites are intended to be:

  • Cloud friendly — All benchmarks are run on cloud infrastructure, not on your laptop
  • Easy to use — Just a few CLI commands get you from zero to completed benchmarks
  • Transparent — All benchmarking code is open source, with pull requests very welcome
  • Realistic — Benchmarks should be largely oriented toward standard use cases rather than bizarre edge cases

Benchmarking workloads

Benchmarking workloads are specified in YAML configuration files that are available in the workloads directory. The table below describes each workload in terms of the following parameters:

  • The number of topics
  • The size of the messages being produced and consumed
  • The number of subscriptions per topic
  • The number of producers per topic
  • The rate at which producers produce messages (per second). Note: a value of 0 means that messages are produced as quickly as possible, with no rate limiting.
  • The size of the consumer's backlog (in gigabytes)
  • The total duration of the test (in minutes)
Workload Topics Partitions per topic Message size Subscriptions per topic Producers per topic Producer rate (per second) Consumer backlog size (GB) Test duration (minutes)
simple-workload.yaml 1 10 1 kB 1 1 10000 0 5
1-topic-1-partition-1kb.yaml 1 1 1 kB 1 1 50000 0 15
1-topic-1-partition-100b.yaml 1 1 100 bytes 1 1 50000 0 15
1-topic-16-partitions-1kb.yaml 1 16 1 kB 1 1 50000 0 15
backlog-1-topic-1-partition-1kb.yaml 1 1 1 kB 1 1 100000 100 5
backlog-1-topic-16-partitions-1kb.yaml 1 16 1 kB 1 1 100000 100 5
max-rate-1-topic-1-partition-1kb.yaml 1 1 1 kB 1 1 0 0 5
max-rate-1-topic-1-partition-100b.yaml 1 1 100 bytes 1 1 0 0 5
1-topic-3-partition-100b-3producers.yaml 1 3 100 bytes 1 3 0 0 15
max-rate-1-topic-16-partitions-1kb.yaml 1 16 1 kB 1 1 0 0 5
max-rate-1-topic-16-partitions-100b.yaml 1 16 100 bytes 1 1 0 0 5
max-rate-1-topic-100-partitions-1kb.yaml 1 100 1 kB 1 1 0 0 5
max-rate-1-topic-100-partitions-100b.yaml 1 100 100 bytes 1 1 0 0 5

Instructions for running specific workloads—or all workloads sequentially—can be found in the platform-specific documentation.

Interpreting the results

Initially, you should see a log message like this, which affirms that a warm-up phase is intiating:

22:03:19.125 [main] INFO - ----- Starting warm-up traffic ------

You should then see some just a handful of readings, followed by an aggregation message that looks like this:

22:04:19.329 [main] INFO - ----- Aggregated Pub Latency (ms) avg:  2.1 - 50%:  1.7 - 95%:  3.0 - 99%: 11.8 - 99.9%: 45.4 - 99.99%: 52.6 - Max: 55.4

At this point, the benchmarking traffic will begin. You'll start see readings like this emitted every few seconds:

22:03:29.199 [main] INFO - Pub rate 50175.1 msg/s /  4.8 Mb/s | Cons rate 50175.2 msg/s /  4.8 Mb/s | Backlog:  0.0 K | Pub Latency (ms) avg:  3.5 - 50%:  1.9 - 99%: 39.8 - 99.9%: 52.3 - Max: 55.4

The table below breaks down the information presented in the benchmarking log messages (all figures are for the most recent 10-second time window):

Measure Meaning Units
Pub rate The rate at which messages are published to the topic Messages per second / Megabytes per second
Cons rate The rate at which messages are consumed from the topic Messages per second / Megabytes per second
Backlog The number of messages in the messaging system's backlog Number of messages (in thousands)
Pub latency (ms) avg The publish latency within the time range Milliseconds (average, 50th percentile, 99th percentile, and 99.9th percentile, and maximum)

At the end of each workload, you'll see a log message that aggregages the results:

22:19:20.577 [main] INFO - ----- Aggregated Pub Latency (ms) avg:  1.8 - 50%:  1.7 - 95%:  2.8 - 99%:  3.0 - 99.9%:  8.0 - 99.99%: 17.1 - Max: 58.9

You'll also see a message like this that tells into which JSON file the benchmarking results have been saved (all JSON results are saved to the /opt/benchmark directory):

22:19:20.592 [main] INFO - Writing test result into 1-topic-1-partition-100b-Kafka-2018-01-29-22-19-20.json

The process explained above will repeat for each benchmarking workload that you run.

Adding a new platform

In order to add a new platform for benchmarking, you need to provide the following: