Yeah interesting! That was what I was after with the multiple dimensions of graphs we talked about. The overall graph of a benchmark would be plotted against git commit and then you could zoom in a see the individual run. Maybe even overlaying a number of them to look for patterns. This combined with results from multiple machines/node versions could further provide data for figuring out what is going on.