Evergreen 'qa-tests' frequently timeout and error. We need to determine why and get tests back to green. Questions:
- Is something hanging?
- Are tests just really long (given all the cluster spin up/down that goes on)?
- Something else?
If the problem is actually just that the tests take a long time, we should explore which can be broken out and parallelized.