【论文笔记】Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training 企业开发 2023-06-05 00:42 0 阅读 NoSuchKey 猜你喜欢