memchr/bench/README.md

fb6c1f39Sopenharmony_ciThis directory defines a large suite of benchmarks for both the memchr and
fb6c1f39Sopenharmony_cimemmem APIs in this crate. A selection of "competitor" implementations are
fb6c1f39Sopenharmony_cichosen. In general, benchmarks are meant to be a tool for optimization. That's
fb6c1f39Sopenharmony_ciwhy there is so many: we want to be sure we get enough coverage such that our
fb6c1f39Sopenharmony_cibenchmarks approximate real world usage. When some benchmarks look a bit slower
fb6c1f39Sopenharmony_cithan we expect (for one reason another), we can use profiling tools to look at
fb6c1f39Sopenharmony_cicodegen and attempt to improve that case.
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ciBecause there are so many benchmarks, if you run all of them, you might want to
fb6c1f39Sopenharmony_cistep away for a cup of coffee (or two). Therefore, the typical way to run them
fb6c1f39Sopenharmony_ciis to select a subset. For example,
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ci```
fb6c1f39Sopenharmony_ci$ cargo bench -- 'memmem/krate/.*never.*'
fb6c1f39Sopenharmony_ci```
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ciruns all benchmarks for the memmem implementation in this crate with searches
fb6c1f39Sopenharmony_cithat never produce any matches. This will still take a bit, but perhaps only a
fb6c1f39Sopenharmony_cifew minutes.
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ciRunning a specific benchmark can be useful for profiling. For example, if you
fb6c1f39Sopenharmony_ciwant to see where `memmem/krate/prebuiltiter/huge-en/common-one-space` is
fb6c1f39Sopenharmony_cispending all of its time, you would first want to run it (to make sure the code
fb6c1f39Sopenharmony_ciis compiled):
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ci```
fb6c1f39Sopenharmony_ci$ cargo bench -- memmem/krate/prebuiltiter/huge-en/common-one-space
fb6c1f39Sopenharmony_ci```
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ciAnd then run it under your profiling tool (I use `perf` on Linux):
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ci```
fb6c1f39Sopenharmony_ci$ perfr --callgraph cargo bench -- memmem/krate/prebuiltiter/huge-en/common-one-space --profile-time 3
fb6c1f39Sopenharmony_ci```
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ciWhere
fb6c1f39Sopenharmony_ci[`perfr` is my own wrapper around `perf`](https://github.com/BurntSushi/dotfiles/blob/master/bin/perfr),
fb6c1f39Sopenharmony_ciand the `--profile-time 3` flag means, "just run the code for 3 seconds, but
fb6c1f39Sopenharmony_cidon't do anything else." This makes the benchmark harness get out of the way,
fb6c1f39Sopenharmony_ciwhich lets the profile focus as much as possible on the code being measured.
fb6c1f39Sopenharmony_ci
fb6c1f39Sopenharmony_ciSee the README in the `runs` directory for a bit more info on how to use
fb6c1f39Sopenharmony_ci`critcmp` to look at benchmark data in a way that makes it easy to do
fb6c1f39Sopenharmony_cicomparisons.