Yeah, I've been thinking about the different hardware aspect as well. If we share the output of the benchmark in a git repo, you would be able to see the performance on different machines, and easily see who the people were. There is also the node version aspect that we need to take into account.
Also this should be quite easy to turn into automated test reporting as well.