Computer Vision Models

From AI Wiki
See also: Computer Vision and Models
ModelHF NameCreatorTaskLibraryDatasetLanguagePaperRelated toLicense
All-In-One-Pixel modelPublicPrompts/All-In-One-Pixel-ModelPublicPromptsDiffusersCreativeml-openrail-m
BEiT (base-sized, fine-tuned on ImageNet-22k) modelmicrosoft/beit-base-patch16-224-pt22k-ft22kMicrosoftImage ClassificationPyTorch
JAX
Transformers
Imagenet
Imagenet-21k
2106.08254Apache-2.0
CLIP ViT base patch16 modelopenai/clip-vit-base-patch16OpenaiZero-Shot Image ClassificationPyTorch
JAX
Transformers
2103.00020
1908.04913
CLIP ViT base patch32 modelopenai/clip-vit-base-patch32OpenaiZero-Shot Image ClassificationPyTorch
TensorFlow
JAX
Transformers
2103.00020
1908.04913
CLIP ViT-B/32 - LAION-2B modellaion/CLIP-ViT-B-32-laion2B-s34B-b79KLaionPyTorch
OpenCLIP
1910.04867Mit
CLIP ViT-H-14 - LAION-2B modellaion/CLIP-ViT-H-14-laion2B-s32B-b79KLaionPyTorch
OpenCLIP
1910.04867Mit
CLIP modelopenai/clip-vit-large-patch14OpenaiZero-Shot Image ClassificationPyTorch
TensorFlow
JAX
Transformers
2103.00020
1908.04913
CLIP-ViT-large-patch14-336 modelopenai/clip-vit-large-patch14-336OpenaiZero-Shot Image ClassificationPyTorch
TensorFlow
Transformers
CLIPSeg modelCIDAS/clipseg-rd64-refinedCIDASImage SegmentationPyTorch
Transformers
2112.10003Apache-2.0
DETR (End-to-End Object Detection) with ResNet-101 backbone modelfacebook/detr-resnet-101MetaObject DetectionPyTorch
Transformers
Coco2005.12872Apache-2.0
DETR (End-to-End Object Detection) with ResNet-50 backbone modelfacebook/detr-resnet-50MetaObject DetectionPyTorch
Transformers
Coco2005.12872Apache-2.0
EimisAnimeDiffusion 1.0v modeleimiss/EimisAnimeDiffusion_1.0vEimissText-to-Image
Image-to-Image
DiffusersEnglishCreativeml-openrail-m
Fantasy Card Diffusion modelvolrath50/fantasy-card-diffusionVolrath50Text-to-Image
Image-to-Image
DiffusersEnglishCreativeml-openrail-m
Ghibli Diffusion modelnitrosocke/Ghibli-DiffusionNitrosockeText-to-Image
Image-to-Image
DiffusersEnglishCreativeml-openrail-m
MaskFormer ADE20k modelfacebook/maskformer-swin-large-adeMetaImage SegmentationPyTorch
Transformers
Scene parse 1502107.06278Other
MaskFormer Coco modelfacebook/maskformer-swin-large-cocoMetaImage SegmentationPyTorch
Transformers
Coco2107.06278Other
Midjourney style on Stable Diffusion modelsd-concepts-library/midjourney-styleSd-concepts-libraryMit
Nitro Diffusion modelnitrosocke/Nitro-DiffusionNitrosockeText-to-Image
Image-to-Image
DiffusersEnglishCreativeml-openrail-m
OWL-ViT modelgoogle/owlvit-base-patch32GoogleObject DetectionPyTorch
Transformers
2205.06230Apache-2.0
Redshift Diffusion modelnitrosocke/redshift-diffusionNitrosockeText-to-Image
Image-to-Image
DiffusersEnglishCreativeml-openrail-m
RuCLIP-ViT-base-patch32-224 modelsberbank-ai/ruclip-vit-base-patch32-224Sberbank-aiPyTorch
Transformers
Stable Diffusion Image Variations modellambdalabs/sd-image-variations-diffusersLambdalabsImage-to-ImageDiffusersChristophSchuhmann/improved aesthetics 6plusCreativeml-openrail-m
ViT For Age Classification modelnateraw/vit-age-classifierNaterawImage ClassificationPyTorch
Transformers
Fairface
ViT large patch14 CLIP 224.openai ft in12k in1k modeltimm/vit_large_patch14_clip_224.openai_ft_in12k_in1kTimmImage ClassificationPyTorch
Timm
Apache-2.0
Vision Transformer (base-sized) 224x224 modelgoogle/vit-base-patch16-224GoogleImage ClassificationPyTorch
TensorFlow
JAX
Transformers
Imagenet-21k
Imagenet-1k
2010.11929
2006.03677
Apache-2.0
Vision Transformer (base-sized) 384x384 modelgoogle/vit-base-patch16-384GoogleImage ClassificationPyTorch
TensorFlow
JAX
Transformers
Imagenet
Imagenet-21k
2010.11929
2006.03677
Apache-2.0
Vision Transformer (base-sized) patch-16-384 modelgoogle/vit-base-patch16-384GoogleImage ClassificationPyTorch
TensorFlow
JAX
Transformers
Imagenet
Imagenet-21k
2010.11929
2006.03677
Apache-2.0