Jump to content

Papers: Difference between revisions

84 bytes added ,  5 February 2023
no edit summary
No edit summary
No edit summary
Line 4: Line 4:


'''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]])
'''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]])
'''[[OpenAI CLIP]]''' - https://openai.com/blog/clip/ - Connecting Text and Images