ALBEF: Learning visual language representations based on momentum distillation

NoSuchKey

Guess you like

Origin blog.csdn.net/zag666/article/details/130290466