It doesn’t seem hard for a loudspeaker to produce this, though because it is akin to inter modulation distortion, a speaker with lower distortion would do better.
Would be nice if the article included an idea of the Hz of the CT relative to the other two. The publication abstract doesn't give much more information except to say the CT appears close to the violin resonance frequency.