Open
Description
CI passes here but tests are actually failing. The failures are logged in Datadog. See below context - this is due to a retry mechanism. We should stop retrying as doing so compromises the feedback loop for engineering.
Maybe we're passing the wrong exit codes on Windows?
Posting @hugodutka's comment here:
We automatically retry failing tests now. If the test passes on the next run, we don't report the CI run as failed. The output isn't formatted great, but that's what happened here. The key line here is:
DONE 2 runs, 9735 tests, 229 skipped, 2 failures in 162.909s
2 runs means that the tests were run twice, and on the second run they passed.