Papers: Difference between revisions

34 bytes added ,  5 February 2023
no edit summary
No edit summary
No edit summary
Line 5: Line 5:
'''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]])
'''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]])


'''[[OpenAI CLIP]]''' - https://openai.com/blog/clip/ - Connecting Text and Images
'''[[OpenAI CLIP]]''' - https://arxiv.org/abs/2103.00020, https://openai.com/blog/clip/ - Connecting Text and Images