Readable, robust benchmarking: percentile-first comparisons + retrying runner | saasbrowser.ai