✍️ add article on Criterion in Rust

thinkjrs · Dec 20, 2024 · 38c45f6 · 38c45f6
1 parent d3e78f3
commit 38c45f6
Showing 1 changed file with 236 additions and 0 deletions.
diff --git a/data/blog/using-criterion-to-benchmark-rust-code.mdx b/data/blog/using-criterion-to-benchmark-rust-code.mdx
@@ -0,0 +1,236 @@
+---
+title: Using Criterion To Benchmark Rust Code
+date: '2024-12-20'
+tags: ['Rust', 'Benchmarking', 'Code', 'Software', 'Developer tooling']
+draft: false
+summary: "Rust's ecosystem for benchmarking is robust. Here's a dive into using my personal favorite, criterion.rs."
+images: ['/static/images/using-criterion-to-benchmark-rust-code.webp']
+authors: ['default']
+layout: PostLayout
+---
+
+<Image
+  className="rounded-md shadow-md"
+  src="/static/images/using-criterion-to-benchmark-rust-code.webp"
+  alt="An absolutely ripped dude holding two differently sized bricks and wondering which one is heavier. Fortunately, Rust's criterion.rs can help him statistically compare the two so that he can rigorously detect which is heavier."
+  height={928}
+  width={1232}
+/>
+
+The Rust ecosystem has a great tool for benchmarking [Criterion.rs](https://docs.rs/criterion/latest/criterion/index.html), ported from Haskell.
+
+Let's fire up a small test case and discover a few things we can do with it.
+
+## What we're going to benchmark
+
+In solving something this week I had to break down an integer into its digits. Of course, we can always use the base and divide; however, when writing this, I longed for something more idiomatic to the language.
+
+You know, something functional with iterators and all that - because that's how we Rustaceans rock! That said, I was also curious as to the overhead that this introduced, versus the arithmetic solution.
+
+### Breaking down digits
+
+- The arithmetic solution:
+
+  ```rust
+  pub fn get_digits_arithmetic(mut n: u32) -> Vec<u32> {
+      let mut digits = Vec::new();
+      while n > 0 {
+          digits.push(n % 10);
+          n /= 10;
+      }
+      digits.reverse();
+      digits
+  }
+  ```
+
+- The idiomatic version:
+
+  ```rust
+  pub fn get_digits_idiomatic(mut n: u32) -> Vec<u32> {
+      std::iter::from_fn(|| {
+          if n > 0 {
+              let digit = n % 10;
+              n /= 10;
+              Some(digit)
+          } else {
+              None
+          }
+      })
+      .collect::<Vec<_>>()
+      .into_iter()
+      .rev()
+      .collect()
+  }
+  ```
+
+Now we should test these and make sure they work before moving on.
+
+#### Super simple unit tests
+
+We'll do this right in our `lib.rs` file.
+
+```rust
+
+#[cfg(test)]
+mod tests {
+    use super::*;
+
+    #[test]
+    fn get_digits_arithmetic_works() {
+        let result = get_digits_arithmetic(20);
+        assert_eq!(result, vec![2, 0]);
+    }
+
+    #[test]
+    fn get_digits_idiomatic_works() {
+        let result = get_digits_idiomatic(20);
+        assert_eq!(result, vec![2, 0]);
+    }
+}
+```
+
+💥 `cargo test` passes like a charm.
+
+## Using criterion
+
+To use `criterion.rs` we need to add a `benches` directory and a `benchmark.rs` file inside of it, plus a `[[bench]]` directive in our `Cargo.toml`.
+
+Once we've got these we can rock `cargo bench` and get some stats on our functions above.
+
+### `benchmark.rs`
+
+So assuming the functions above are public, we can use the following to do statistical runtime comparison of the two functions.
+
+```rust
+use criterion::{black_box, criterion_group, criterion_main, Criterion};
+use criterion_benchmarks::{get_digits_arithmetic, get_digits_idiomatic};
+
+fn benchmark_digits(c: &mut Criterion) {
+    let test_number = 123456789;
+
+    // Benchmark the arithmetic implementation
+    c.bench_function("get_digits_arithmetic", |b| {
+        b.iter(|| get_digits_arithmetic(black_box(test_number)))
+    });
+
+    // Benchmark the idiomatic implementation
+    c.bench_function("get_digits_idiomatic", |b| {
+        b.iter(|| get_digits_idiomatic(black_box(test_number)))
+    });
+}
+
+criterion_group!(benches, benchmark_digits);
+
+criterion_main!(benches);
+```
+
+Here I've named my crate `criterion_benchmarks` - use whatever you rocked when running `cargo new --lib` prior.
+
+### Update `Cargo.toml`
+
+Now add the `[[bench]]` setup we need:
+
+```toml
+[[bench]]
+name = "benchmark"
+harness = false
+```
+
+And the library itself under `dev-dependencies`:
+
+```toml
+[dev-dependencies]
+criterion = { version = "0.4", features = ["html_reports"] }
+```
+> We'll use the HTML report feature soon!
+
+### Benchmark your heart out
+
+Now we can run our benchmarks with `cargo bench`. You should get something that looks similar to the following:
+
+```zsh
+➜  criterion_benchmarks git:(main) ✗ cargo bench
+   Compiling criterion_benchmarks v0.1.0 (/Users/thinkjrs/repos/learning-rust/criterion_benchmarks)
+    Finished `bench` profile [optimized] target(s) in 1.13s
+     Running unittests src/lib.rs (target/release/deps/criterion_benchmarks-7495786da99396e2)
+
+running 2 tests
+test tests::get_digits_arithmetic_works ... ignored
+test tests::get_digits_idiomatic_works ... ignored
+
+test result: ok. 0 passed; 0 failed; 2 ignored; 0 measured; 0 filtered out; finished in 0.00s
+
+     Running benches/benchmark.rs (target/release/deps/benchmark-3d6e7777710a0ab1)
+Benchmarking get_digits_arithmetic: Collecting 100 samples in estimated 5.0006 s (39M i
+get_digits_arithmetic   time:   [122.35 ns 125.08 ns 128.04 ns]
+                        change: [+8.2527% +12.946% +17.712%] (p = 0.00 < 0.05)
+                        Performance has regressed.
+Found 3 outliers among 100 measurements (3.00%)
+  1 (1.00%) high mild
+  2 (2.00%) high severe
+
+Benchmarking get_digits_idiomatic: Collecting 100 samples in estimated 5.0004 s (32M it
+get_digits_idiomatic    time:   [150.57 ns 155.18 ns 160.08 ns]
+                        change: [+1.1505% +6.5465% +12.033%] (p = 0.01 < 0.05)
+                        Performance has regressed.
+Found 2 outliers among 100 measurements (2.00%)
+  2 (2.00%) high mild
+```
+
+Sweet, eh?
+
+## Visualizations and tracking
+
+One of the awesome features of `criterion.rs` is that the benchmarks are tracked _and_ you can visualize them. I believe they call this "batteries included!"
+
+The killer feature here is that these are already done for you. On most systems, simply run the following to open your browser with the report:
+
+```sh
+open target/criterion/report/index.html
+```
+
+<Image
+  className="rounded-md shadow-md"
+  src="https://res.cloudinary.com/thinkjrs-dev/image/upload/v1734653383/thinkjrs.dev/Screenshot_2024-12-19_at_7.06.28_PM_tmreh7_m77tov.webp"
+  alt="A screengrab of the HTML report output by the Criterion.rs library for comparing and benchmarking Rust code."
+  height={778}
+  width={1200}
+/>
+
+
+### Baselines for tracking
+
+Another amazing feature is the ability to set a baseline against which to measure. Practically, let’s set a baseline to main for measuring feature performance. Sounds reasonable, no?
+
+
+```sh
+git checkout main
+cargo bench -- --save-baseline main
+```
+
+```sh
+git checkout feature/amazing-new-optimization
+cargo bench -- --save-baseline feature/amazing-new-optimization
+```
+
+Then run your benches as normal. And compare them with:
+
+```sh
+cargo bench -- --load-baseline new --baseline main
+```
+
+Sick, no?
+
+## Summing it all up
+
+Benchmarking in Rust is incredibly easy with `criterion.rs`.
+
+**Some highlights:**
+• Easy Setup: A quick addition to your Cargo.toml and benches directory gets you started.
+• Detailed Analysis: Criterion provides time measurements, performance regression checks, and outlier detection.
+• Visualizations: Automatically generated HTML reports let you track and compare benchmarks in a user-friendly format.
+• Baselines: Save and compare results across branches or optimizations, enabling informed decisions about performance improvements.
+
+Now you can ninja-tune your Rust code and feel like a real pro. LFG! 🚀
+
+<BlogNewsletterForm />