vision-transformer-models
Here are 7 public repositories matching this topic...
Language:All
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
- Updated
Dec 16, 2025 - Python
Multi-label classification based on timm.
- Updated
Sep 13, 2021 - Jupyter Notebook
Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.
- Updated
Jul 3, 2024 - Jupyter Notebook
Multi-label classification based on timm, and add SimCLR to timm.
- Updated
Sep 13, 2021 - Jupyter Notebook
Solution for NeurIPS 2023 - MedFM Challenge
- Updated
Sep 22, 2023 - Python
This project focuses on evaluating Convolutional Neural Networks (CNN) and Vision Transformers (ViT) for image classification tasks, specifically distinguishing between Asian elephants and African elephants.
- Updated
Apr 8, 2024 - Jupyter Notebook
Code for the base version of the the model vision transformer in pytorch.
- Updated
Mar 17, 2023 - Python
Improve this page
Add a description, image, and links to thevision-transformer-models topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thevision-transformer-models topic, visit your repo's landing page and select "manage topics."