but long-term listening in a relaxed (not analytical) state is the real tell for me.
This statement nails it. I have often noticed that a quick switch between streaming services tells you very little to nothing. But if you relax and "take it in" in your regular listening mood, differences are easily felt OR missed. Applies to almost everything in listening tests.