Skip to content

ViT

Classification Target Model

What is ViT?

ViT is a classification model pre-trained on ImageNet-21k, developed by Google. You can train ViT classification models using Autodistill.

Read the full Autodistill documentation.

Read the ViT Autodistill documentation.

Installation

To use the ViT target model, you will need to install the following dependency:

pip3 install autodistill-vit

Quickstart

from autodistill_vit import ViT

target_model = ViT()

# train a model from a classification folder structure
target_model.train("./context_images_labeled/", epochs=200)

# run inference on the new model
pred = target_model.predict("./context_images_labeled/train/images/dog-7.jpg", conf=0.01)