criterion performance measurements

overview

want to understand this report?

readBaseline

270
275
280
285
290
295
300
305
310
readBaseline time densities
mean
2
3
4
5
1 iters
500
750
0 s
250 ms
1 s
1.25
1.5
regression
readBaseline times
lower bound estimate upper bound
OLS regression 233 ms 282 ms 333 ms
R² goodness-of-fit 0.947 0.990 1.000
Mean execution time 272 ms 281 ms 301 ms
Standard deviation 538 μs 15.5 ms 20.0 ms

Outlying measurements have moderate (16.0%) effect on estimated standard deviation.

readText

10.4
10.6
10.8
11.0
11.2
11.4
readText time densities
mean
10
15
20
25
30
5 iters
100
150
200
250
300
350
0 s
50 ms
regression
readText times
lower bound estimate upper bound
OLS regression 10.7 ms 10.9 ms 11.1 ms
R² goodness-of-fit 0.996 0.998 1.000
Mean execution time 10.8 ms 10.8 ms 10.9 ms
Standard deviation 162 μs 228 μs 329 μs

Outlying measurements have slight (3.3%) effect on estimated standard deviation.

readByteString

9
10
8.5
9.5
readByteString time densities
mean
10
15
20
25
30
35
5 iters
100
150
200
250
300
350
0 s
50 ms
regression
readByteString times
lower bound estimate upper bound
OLS regression 8.61 ms 8.90 ms 9.44 ms
R² goodness-of-fit 0.981 0.990 0.999
Mean execution time 8.84 ms 8.94 ms 9.13 ms
Standard deviation 230 μs 380 μs 588 μs

Outlying measurements have moderate (17.8%) effect on estimated standard deviation.

understanding this report

In this report, each function benchmarked by criterion is assigned a section of its own. The charts in each section are active; if you hover your mouse over data points and annotations, you will see more details.

Under the charts is a small table. The first two rows are the results of a linear regression run on the measurements displayed in the right-hand chart.

We use a statistical technique called the bootstrap to provide confidence intervals on our estimates. The bootstrap-derived upper and lower bounds on estimates let you see how accurate we believe those estimates to be. (Hover the mouse over the table headers to see the confidence levels.)

A noisy benchmarking environment can cause some or many measurements to fall far from the mean. These outlying measurements can have a significant inflationary effect on the estimate of the standard deviation. We calculate and display an estimate of the extent to which the standard deviation has been inflated by outliers.