I'm with SPXY here, (he says putting his architectural radiosity solution to one side).
The ABX assumes that we know what differences to listen out for. There may be differences, but we may not actively be aware of them and as such we miss them.
Assume you are presented with two images on screen, exactly the same apart from one pixel. Chances are you won't see that pixel, but once it has been pointed out to you, you will use its presence to identify the two images each and every time.
Listening to musical extracts is like being shown picture flash cards, you can't necessarily take it all in immediately. You may become reasonably adept at identifying gross differences quite quickly, as with the flash cards, but only greater long term exposure will help you develop the level of insight required to discern the smallest differences in musical presentation.
I'm all for blind tests, there really is no other way, but I do think they should be extended blind tests. 5 seconds of ABX tells you precious little at the fine end of the scale.