Image-to-Text Models
- See also: Multimodal Models and Tasks
Model | Name | User/Org | Task | Library | Dataset | Language | Paper | Related to | License |
---|---|---|---|---|---|---|---|---|---|
Nlpconnect/vit-gpt2-image-captioning model | Image-to-Text | PyTorch Transformers | Vision-encoder-decoder Image-captioning | Apache-2.0 |