论文解读 X-CLIP : Expanding Language-Image Pretrained Models for General Video Recognition

NoSuchKey