Replay (rehearsal) dataset generation for mitigating catastrophic forgetting during SFT. Instead of mixing public SFT datasets (which are distributionally mismatched), this pipeline reconstructs the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results