研究如何在训练中使用模型内部是合理的
Published on February 8, 2026 344 AM GMTThere seems to be a common belief in the AGI safety community that involving interpretability in the training process is the most forbidden technique, including recent criticism of Goodfire for investing in this area。I dont know if it will be net positive to u...