The model's own unit tests are not currently executed as a part of PR testing, but they should be: This is not necessarily because we expect frequent work on the model itself, but this would be a great way to detect changes to WiredTiger's semantics early.
In this way, we can catch unintentional changes to the semantics early, and contributors will be required to update the model to match any intentional changes. Otherwise merging a semantics change without the corresponding model changes would create a fallout in Evergreen.