Skip to content
This repository has been archived by the owner on Dec 19, 2023. It is now read-only.
/ PyStats2MD Public archive

Generate Markdown tables & Plotly charts from Google Benchmark outputs or directly in Python!

Notifications You must be signed in to change notification settings

unum-cloud/PyStats2MD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pystats2md

A simple library to export benchmark results, compatiable with Google Benchmark CSV and JSON output formats. Allows building nice markdown tables and charts in just a few lines of code.

Features

  • Filtering and grouping data from benchmarks files.
  • Rendering GitHub-Markdown tables with easy formatting.
  • Generating and embedding plots with Plotly in just a couple of lines.
  • Included pure-Python benchmarking tool with same output file formats as Google Benchmark.
  • Embeds the source code of the benchmark itself into the log files and reports!

Installation

pip install pystats2md

Examples

Example 1

Let's assume we are benchmarking databases. We have an array of measurements and each of them is a dictionary with at least 3 keys: database_name, benchmark_name and operations_per_second. Let's generate a report.

from pystats2md import *

f = StatsFile('benchmarks.json')
r = Report()
r.add('#### Following section was auto-generated')
r.add(f.table(
    rows='database_name', 
    cols='benchmark_name', 
    cells='operations_per_second'
))
r.add(f.plot(
    title='DB Performance (Ops/Sec)', 
    variants='database_name', 
    groups='benchmark_name', 
    values='operations_per_second'
))
r.print_to('example/README.md')

DB Performance (Ops/Sec)

Example 2

A more complex example. The previous example assumes we have just 1 entry for every combination of database_name, benchmark_name. If it's not the case, we can reduce them in-place and also add all kinds of supplementary columns to make results more visually appealing. Just don't get carried away 😆

r.add('#### Following section was auto-generated')
r.add(f \
    .subset() \
    .group('database_name', 'benchmark_name', **{ 'operations_per_second': Aggregation.take_mean }) \
    .table(rows='database_name', cols='benchmark_name', cells='operations_per_second') \
    .add_ranking(column='insert') \
    .add_emoji(column='insert', log_scale=False) \
    .add_gains(column='insert', baseline='SQLite')
)
# If no context is specified in the input file - we will generate one on the fly - assuming it's the same device.
r.add_current_device_specs()

Example 3

If you need even more control, you can manually build StatsSubset from StatsFile and export it into StatsPlot or StatsTable. Such workflows can be found here.

Related Projects

  • PyWrappedDBs lib for persistent graph data structures us pystats2md to render reports.
  • Unum high-performance GPU-accelerated general-purpose framework.

TODO

  • Add special class MeanAndStdev which when rendered shows the standard deviation of preceding reduction.
  • Add inline HTML color-coding for standard deviation size.
  • Add astriks and subtitles for results that have much older timestamps.

About

Generate Markdown tables & Plotly charts from Google Benchmark outputs or directly in Python!

Topics

Resources

Stars

Watchers

Forks