精读文献【Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models】的收获

发布时间:2026/6/29 16:12:56
精读文献【Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models】的收获 1、就算是在精读文献也要先大概的粗读一遍把大框架整理出来2、这篇文献的大框架