I agree with this when it's done as a blind listening session & the propensity to call it a "blind test" further conditions the participants to the idea that they are under scrutiny - to stick with examinations as mentioned above - it's a pass/fail multiple choice test rather than a discursive essay style test.
This factor, of second-guessing one's first thought, was revealed & discussed in the thread where ABX positive results were reported - it's a naturally arising stress factor for most people, any time an individual feels they are being tested or can be definitively exposed as being wrong in their choice - it's an ego thing. (This doesn't arise when one does a personal blind test)
Furthermore, the argument often used is that a sighted test which shows differences followed by a blind test which no longer reveals differences "proves" that blind testing is "more valid", "more revealing", "the truth" is obviously logically flawed.
What we have are sighted test which have a bias towards false positives followed by a blind test which have a bias towards false negatives. What conclusions can be drawn about contradictory results from these two flawed tests?
That is all that's being said - both tests are flawed in their own way. Yes, some can point out obvious bias possibilities in sighted "tests" but fail to recognise other biases in "blind tests"