Skip to content
This repository has been archived by the owner on Apr 6, 2022. It is now read-only.

A tool that analyses Gradle Profiler benchmarks with the Paired T-tests ๐Ÿ“Š

License

Notifications You must be signed in to change notification settings

dotanuki-labs/gradle-profiler-pttest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

โš ๏ธ NO LONGER MAINTENED โš ๏ธ

This project is deprecated. If you are still interested in solution, consider forking this repo.

Paired T-tests for Gradle Benchmarks

Flake8 Maintainability codecov PyPI Main License

Context

Complete blog post to come. Stay tunned

gradle-profiler-pttest can analyse the outcomes of two benchmarks generated by Gradle Profiler with the Paired T-test statistical technique.

The goal is provide a super easy way to compare two benchmarks for Gradle builds - for the same task - without being mislead by simple means, since we are leveraging on a more robust statistical evidence on top of the outcomes.

This tool is built on top of pingouin, an opinionated Statistics library which leverages NumPy, Pandas and SciPy. Among other things, gradle-profiler-pttest features :

  • An opinionated hyphotesis test (left-tailed) where we want to check if modified build conditions (h1) have a mean statistically smaller than the one we get from baseline build conditions (h0), which means better execution times given the modifications applied
  • Auto-correction for benchmark samples with different sizes

Note that the ideal size for samples should be a small one - ideally between 10 and 30 measured builds - in order to make sense of T-student analysis.

Installing

Install from your CLI with pip

pip install gradle-profiler-pttest

Requires Python 3.8.5 or newer.

When running this tool on MacOS Catalina or newer, please check instructions in order to have SciPy properly installed in your local machine.

Using

  • Run the benchmarks with Gradle profiler for the status quo (baseline) and for the modifications applied the your Gradle project (modified)

  • Supply the generated CSV files to gradle-profiler-pttest

gradle-profiler-pttest \
	-b <path/to/baseline/benchmark.csv> \
	-m <path/to/modified/benchmark.csv>
  • Profit results

Limitations

Right now gradle-profiler-pttest supports only one Gradle task per supplied benchmark sample, taking the first task executed as reference for the analysis given a multi-task benchmarked scenario.

Contributing

  • Ensure you have Python 3.8.5 or newer installed
  • Ensure you have flake8 installed and supported in your text editor / IDE
  • Ensure you have Poetry installed
  • Check the contribution guidelines
  • Make sure you have a green build
make flake8
make test
  • Submit your PR ๐Ÿ”ฅ

Credits

Author

Coded by Ubiratan Soares (follow me on Twitter)

License

The MIT License (MIT)

Copyright (c) 2020 Dotanuki Labs

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of
the Software, and to permit persons to whom the Software is furnished to do so,
subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS
FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR
COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER
IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.