论文略读:LoRA+: Efficient Low Rank Adaptation of Large Models

ICML 2024

从理论分析了LoRA最优解必然是右矩阵的学习率大于左矩阵的学习率(数量级差距是O(n))

猜你喜欢

转载自blog.csdn.net/qq_40206371/article/details/143428811