Papers: Difference between revisions
No edit summary |
No edit summary |
||
Line 5: | Line 5: | ||
'''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]]) | '''[[An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale]]''' - https://arxiv.org/abs/2010.11929 - [[Vision Transformer]] ([[ViT]]) | ||
'''[[OpenAI CLIP]]''' - https://openai.com/blog/clip/ - Connecting Text and Images | '''[[OpenAI CLIP]]''' - https://arxiv.org/abs/2103.00020, https://openai.com/blog/clip/ - Connecting Text and Images |
Revision as of 15:01, 5 February 2023
https://arxiv.org/abs/2301.13779 (FLAME: A small language model for spreadsheet formulas) - Small model specifically for spreadsheets by Miscrofot
Attention Is All You Need - https://arxiv.org/abs/1706.03762 - - influential paper that introduced Transformer
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale - https://arxiv.org/abs/2010.11929 - Vision Transformer (ViT)
OpenAI CLIP - https://arxiv.org/abs/2103.00020, https://openai.com/blog/clip/ - Connecting Text and Images