The acceptance tests are blackbox* tests that are expected to interact with resources on a real GitHub instance. They are built on top of the go-internal/testscript
package, which provides a framework for building tests for command line tools.
*Note: they aren't strictly blackbox because exec gh
commands delegate to a binary set up by testscript
that calls into ghcmd.Main
. However, since our real func main
is an extremely thin adapter over ghcmd.Main
, this is reasonable. This tradeoff avoids us building the binary ourselves for the tests, and allows us to get code coverage metrics.
The acceptance tests have a build constraint of //go:build acceptance
, this means that go test ./...
will continue to work without any modifications. The acceptance
tag must therefore be provided when running go test
.
The following environment variables are required:
The GitHub host to target e.g. github.com
The organization in which the acceptance tests can manage resources in. Consider using gh-acceptance-testing
on github.com
.
The token to use for authenticatin with the GH_ACCEPTANCE_HOST
. This must already have the necessary scopes for each test, and must have permissions to act in the GH_ACCEPTANCE_ORG
. See Effective Test Authoring for how tests must handle tokens without sufficient scopes.
It's recommended to create and use a Legacy PAT for this; Fine-Grained PATs do not offer all the necessary privileges required. You can use an OAuth token provided via gh auth login --web
and can provide it to the acceptance tests via GH_ACCEPTANCE_TOKEN=$(gh auth token --hostname <host>)
but this can be a bit confusing and annoying if you gh auth login
again without -s
and lose the required scopes.
A full example invocation can be found below:
GH_ACCEPTANCE_HOST=<host> GH_ACCEPTANCE_ORG=<org> GH_ACCEPTANCE_TOKEN=<token> go test -tags=acceptance ./acceptance
While writing a new test, it can be useful to target that specific script by providing the GH_ACCEPTANCE_SCRIPT
env var in combination with the -run
flag, for example:
GH_ACCEPTANCE_SCRIPT=pr-view.txtar GH_ACCEPTANCE_HOST=<host> GH_ACCEPTANCE_ORG=<org> GH_ACCEPTANCE_TOKEN=<token> go test -tags=acceptance -run ^TestPullRequests$ ./acceptance
To get code coverage, go test
can be invoked with coverpkg
and coverprofile
like so:
GH_ACCEPTANCE_HOST=<host> GH_ACCEPTANCE_ORG=<org> GH_ACCEPTANCE_TOKEN=<token> go test -tags=acceptance -coverprofile=coverage.out -coverpkg=./... ./acceptance
This section is to be expanded over time as we write more tests and learn more.
The following custom environment variables are made available to the scripts:
GH_HOST
: Set to value of theGH_ACCEPTANCE_ORG
env var provided togo test
ORG
: Set to the value of theGH_ACCEPTANCE_ORG
env var provided togo test
GH_TOKEN
: Set to the value of theGH_ACCEPTANCE_TOKEN
env var provided togo test
RANDOM_STRING
: Set to a length 10 random string of letters to help isolate globally visible resourcesSCRIPT_NAME
: Set to the name of thetestscript
currently running, without extension e.g.pr-view
HOME
: Set to the initial working directory. Required forgit
operationsGH_CONFIG_DIR
: Set to the initial working directory. Required forgh
operations
Due to the //go:build acceptance
build constraint, some functionality is limited because gopls
isn't being informed about the tag. To resolve this, set the following in your settings.json
:
"gopls": {
"buildFlags": [
"-tags=acceptance"
]
},
You can install the txtar
or vscode-testscript
extensions to get syntax highlighting.
When tests fail they fail like this:
➜ go test -tags=acceptance ./acceptance
--- FAIL: TestPullRequests (0.00s)
--- FAIL: TestPullRequests/pr-merge (11.07s)
testscript.go:584: WORK=/private/var/folders/45/sdnm1hp10nj1s9q57dp3bc5h0000gn/T/go-test-script2778137936/script-pr-merge
# Use gh as a credential helper (0.693s)
# Create a repository with a file so it has a default branch (1.155s)
# Defer repo cleanup (0.000s)
# Clone the repo (1.551s)
# Prepare a branch to PR with a single file (1.168s)
# Create the PR (1.903s)
# Check that the file doesn't exist on the main branch (0.059s)
# Merge the PR (2.426s)
# Check that the state of the PR is now merged (0.571s)
# Pull and check the file exists on the main branch (1.074s)
# And check we had a merge commit (0.462s)
> exec git show HEAD
[stdout]
commit 85d32c1a83ace270f6754c61f3f7e14956be0a47
Author: William Martin <[email protected]>
Date: Fri Oct 11 15:23:56 2024 +0200
Add file.txt
diff --git a/file.txt b/file.txt
new file mode 100644
index 0000000..7449899
--- /dev/null
+++ b/file.txt
@@ -0,0 +1 @@
+Unimportant contents
> stdout 'Merge pull request #1'
FAIL: testdata/pr/pr-merge.txtar:42: no match for `Merge pull request #1` found in stdout
This is generally enough information to understand why a test has failed. However, we can get more information by providing the -v
flag to go test
, which turns on verbose mode and shows each command and any associated stdio
.
Warning
Verbose mode dumps the testscript
environment variables, including the GH_TOKEN
, so be careful.
By default testscript
removes the directory in which it was running the script, and if you've been a conscientious engineer, you should be cleaning up resources using the defer
statement. However, this can be an impediment to debugging. As such you can set GH_ACCEPTANCE_PRESERVE_WORK_DIR=true
and GH_ACCEPTANCE_SKIP_DEFER=true
to skip these cleanup steps.
This section is to be expanded over time as we write more tests and learn more.
The testscript
library creates a somewhat isolated environment for each script. Each script gets a directory with limited environment variables by default. As far as reasonable, we should look to write scripts that depend on nothing more than themselves, the GitHub resources they manage, and limited additional environmental injection from our own testscript
setup.
Here are some guidelines around test isolation:
- Favour duplication in test setup over abstracting a new
testscript
command - Favour a
testscript
owning an entire resource lifecycle over shared resource until we see a performance or rate limiting issue - Use the
RANDOM_STRING
env var for globally visible resources to avoid conflicts
Since these scripts are creating resources on a GitHub instance, we should try our best to cleanup after them. Use the defer
keyword to ensure a command runs at the end of a test even in the case of failure.
TODO: I believe tests should early exit if the correct scopes aren't in place to execute the entire lifecycle. It's extremely annoying if a defer
fails to clean up resources because there's no delete_repo
scope for example. However, I'm not sure yet whether this scope checking should be in the Go tests or in the scripts themselves. It seems very cool to understand required scopes for a script just by looking at the script itself.
https://bitfieldconsulting.com/posts/test-scripts