* print intermediate A/B results when not silent * print raw A/B results when in loud mode * add tests; handle missing metrics more gracefully * use less fancy section header on Windows