착쓰
[논문 리뷰] (CLIP) Learning Transferable Visual Models From Natural Language Supervision