Papers: Difference between revisions

Revision as of 15:06, 5 February 2023

Attention Is All You Need - https://arxiv.org/abs/1706.03762 - - influential paper that introduced Transformer

Memorizing Transformers - https://arxiv.org/abs/2203.08913 -

OpenAI CLIP - https://arxiv.org/abs/2103.00020, https://openai.com/blog/clip/ - Learning Transferable Visual Models From Natural Language Supervision

Transformer-XL - https://arxiv.org/abs/1901.02860 - Attentive Language Models Beyond a Fixed-Length Context

@@ Line 3: / Line 3: @@
 '''[[Attention Is All You Need]]''' - https://arxiv.org/abs/1706.03762 - - influential paper that introduced [[Transformer]]
-'''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]])
+'''[[An Image is Worth 16x16 Words]]''' - https://arxiv.org/abs/2010.11929 - Transformers for Image Recognition at Scale - [[Vision Transformer]] ([[ViT]])
+'''[[Language Models are Few-Shot Learners]]''' - https://arxiv.org/abs/2005.14165 - [[GPT]]
 '''[[Memorizing Transformers]]''' - https://arxiv.org/abs/2203.08913 -