Walle
Created page with "{{Model infobox | hugging-face-uri = nlpconnect/vit-gpt2-image-captioning | creator = | type = Multimodal | task = Image-to-Text | library = PyTorch, Transformers | dataset = | language = | paper = | license = apache-2.0 | related-to = vision-encoder-decoder, image-captioning | all-tags = Image-to-Text, PyTorch, Transformers, doi:10.57967/hf/0222, vision-encoder-decoder, image-captioning, License: apache-2.0 | all-lang-tags = }} ==Model Description== ==Clone Model..."