Yes, the requirements for the input/buffer stage are actually quite modest, not even a headphone amp, really. But the current fad for floating 12 or 15V supplies from a switching wall wart limits the output swing and current available. Barely enough for an op-amp (+/- 6 volts), plus losses from local sub-regulation.
It makes sense for the op-amp (or discrete circuit) to be fully isolated from the Class D switching module. The Class D module generates program-modulated switch noise ... it's effectively a low-frequency AM transmitter contained within the chassis. That's where the efficiency comes from, after all ... when there's no program material, switching is still going on at 200~500 kHz, but no power is going through the output devices, and very little is drawn from the support circuits. There's no residual Class A idling as there is with Class AB amplifiers. The output devices are either on or off, with only extremely brief switch transitions.
As program material level increases, more power and switch noise is created by the output switcher, and filtration demands on the speaker output and AC power supply increase. It is not trivial to silence a 200-watt AM transmitter in a can ... that energy is going to escape any way it can. Through the speaker wires (which make a great antenna), through the AC power cord, and even through the input jacks if it can find a way. Or leaks in the metal can itself. The adjacent linear audio equipment will have varying levels of tolerance for nearby RF emitters, which not usually tested in most test scenarios.
Oddly enough, this is an argument for input filtration using transformers to prevent RF emission on nearby equipment. I doubt many will do this, though, since designers that use Class D modules also like the very low distortion of those modules. In the Class D world, distortion specs (and the respect of the ASR crowd) make a difference,