Erik, yes speakers interact with the room but that has nothing to do with the transient response on the speaker.
Mike, unfortunately regular dynamic drivers are a poor impedance match to air. They have to work much harder to get the job done. ESLs and Horns do not have this problem to near the degree.
ESLs and horns are generally described as being very detailed. They also have better transient response, association or causation. I would say the later. Yes, a speaker with a lighter moving system could have better transient response assuming the motor was designed correctly.
Andy, resolution and transient response are very closely related. By dynamic I do not mean loud. I mean snap.