本仓库提供用于训练 RModel 的脚本。该 RModel 用于作为大型目标模型的专家选择代理(MoE gate proxy),通过学习 ShareGPT ...
ai-infra-career / resources / books / 01-Programming-Massively-Parallel-Processors-PMMP-4th.pdf 712sir add: resources directory with books, papers, and curated links 9293f34 · last month ...