"The most reliable method is to listen."
The most reliable method is to listen. BLIND. Repeated ABX testing is required. A consistent score of 95% or better indicates a valid distinction. All else is entirely invalid. PERIOD.
Blind listening proves if there is a change or not. Measurements are unnecessary - we are listening, not measuring. That removes any suggestion measurements do not measure everything.