Papers: Difference between revisions

Revision as of 15:01, 5 February 2023

Attention Is All You Need - https://arxiv.org/abs/1706.03762 - - influential paper that introduced Transformer

OpenAI CLIP - https://arxiv.org/abs/2103.00020, https://openai.com/blog/clip/ - Learning Transferable Visual Models From Natural Language Supervision

Revision as of 15:01, 5 February 2023 (view source) Alpha5 (talk \| contribs) No edit summary ← Older edit		Revision as of 15:01, 5 February 2023 (view source) Alpha5 (talk \| contribs) No edit summary Newer edit →
Line 5:		Line 5:
	'''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]])		'''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]])

	'''[[OpenAI CLIP]]''' - https://arxiv.org/abs/2103.00020, https://openai.com/blog/clip/ - ~~Connecting Text and Images~~		'''[[OpenAI CLIP]]''' - https://arxiv.org/abs/2103.00020, https://openai.com/blog/clip/ - Learning Transferable Visual Models From Natural Language Supervision