Pragmatic and Effective Testing in Go

by bep about Go, Hugo

850 words, 4 minutes read.

Stub at the lowest practical level, if at all, and do not bend tests into unit tests when running the entire flow is simple and blistering fast.

Hugo, the popular static site generator, was released in version 0.17 Friday. And while the most eye-catching news in this release was the impressive improvement in speed in an already very speedy piece of software, the big new functional feature was native multilingual support.

I wrote the main bulk of the code in this release, too, and adding support for multiple languages was more or less a “core Hugo” rewrite.

Hugo already had a fair test line coverage, between 70 and 80 percent, but the tests did not provide enough confidence to support making big structural changes without a fair amount of manual testing.

Keeping full backwards compatibility became more of a testing challenge than anything else.

Stub at the lowest practical level

Or do not stub¹ at all, if possible, I might add. When unit-testing an isolated component, this can be as simple as passing the data it needs as function arguments.

When testing components that read from and write to disk or a database, the best solution isn’t always obvious.

It is possible to remove the dependencies on file systems and databases by providing test implementations of high level interfaces interface such as SomeDataStore.

File Type	Variants
Content	JSON YAML TOML Blackfriday Asciidoctor reStructuredText HTML Ace Go Amber
Config	JSON YAML TOML Blackfriday Asciidoctor reStructuredText HTML Ace Go Amber
Data	JSON YAML TOML Blackfriday Asciidoctor reStructuredText HTML Ace Go Amber
Language	JSON YAML TOML Blackfriday Asciidoctor reStructuredText HTML Ace Go Amber
Layout	JSON YAML TOML Blackfriday Asciidoctor reStructuredText HTML Ace Go Amber
Shortcode	JSON YAML TOML Blackfriday Asciidoctor reStructuredText HTML Ace Go Amber

Add to the mix that Hugo also supports live reloads and partial rebuilds triggered by filesystem events on a variety of platforms, and that most files can be provided by both the project and the theme, you get a massive test matrix.

Steve Francia, the founder of Hugo, lay the foundation some time ago with the introduction of Afero, a file system abstraction.

But even if now the result files were written to a proper file system and the content could be verified, the source files were not. They were either force-fed into the handler chain by a byte-slice-backed file source, or only tested in isolation.

Now every file operation in Hugo is backed by a virtual file system, and the integration tests are as close to the real deal as practically possible.

And these low-level tests matter. Even the most experienced developer can fail when checking if a file exists. This becomes glaringly relevant for applications that are supposed to run on (almost) any platform and operating system.

Nest Table-Driven Tests

And when building “the whole thing” is as cheap as a couple of milliseconds, you might as well do so many times, as in the example below for every configuration format:

func TestMultiSitesBuild(t *testing.T) {
	for _, config := range []struct {
		content string
		suffix  string
	}{
		{multiSiteTOMLConfig, "toml"},
		{multiSiteYAMLConfig, "yml"},
		{multiSiteJSONConfig, "json"},
	} {
		doTestMultiSitesBuild(t, config.content, config.suffix)
	}
}

Table-driven tests are encouraged and easy to write in Go. Exponentially adding more test-variants by nesting the loops can be very powerful, and a practical way to approach the test matrix outlined above.

You will get tests that overlap. But you will discover corner-cases you never would have thought existed. And the superfluous tests are cheap.

Do not force-write unit tests

Testing in isolation, proper unit tests, is a good thing when you can do so with ease, and you should build your code with that in mind.

But when the “unit of test” depends on a chain of preprocessing, going out of the way to run only that small part makes little sense. You’ll end up with tests that depends on a not-so-realistic synthetic data set or tests that look like this:

	prepareStep1()
	prepareStep2() 
	runUnit()

If you have a test setup that allows you to run the whole thing really fast, you might as well do that for most tests, and then verify the unit by narrowing the scope of the assertions.

And if you really need to limit what gets run, make it explicit in the production code. One way of doing this is by adding feature flags, as in the BuildCfg in Hugo:

type BuildCfg struct {
	//...
	// Skip the rendering. Useful in tests.
	SkipRender bool
	//...
}

bepsays

Pragmatic and Effective Testing in Go

Stub at the lowest practical level

Nest Table-Driven Tests

Do not force-write unit tests