목록slip: self-supervision meets language-image pre-training (1)

My Vision, Computer Vision