This is great question. IMHO, we are all different, I associate the advantages of a high end system with accurate dynamics.
But let’s look at your definition for a moment. If a system can accurately portray voices, and acoustic instruments, it seems that it can just as accurately portray distortion that is intentionally woven into the musical fabric by the artist. I don’t think that will make you like the music any more, but it will convey the tonality that the artist intended.
If you are able to stream, pull up the Barbie soundtrack. You will hear a variety of rap, and hip-hop, and though it may not change your outlook, perhaps it will broaden it.