목록scaling up visual and vision-language representation learning with noisy text supervision (1)

My Vision, Computer Vision