Replay (rehearsal) dataset generation for mitigating catastrophic forgetting during SFT. Instead of mixing public SFT datasets (which are distributionally mismatched), this pipeline reconstructs the ...