Skip to content


Classification Base Model

What is RemoteCLIP?

RemoteCLIP is a vision-language CLIP model trained on remote sensing data. According to the RemoteCLIP README:

RemoteCLIP outperforms previous SoTA by 9.14% mean recall on the RSICD dataset and by 8.92% on RSICD dataset. For zero-shot classification, our RemoteCLIP outperforms the CLIP baseline by up to 6.39% average accuracy on 12 downstream datasets.


To use RemoteCLIP with autodistill, you need to install the following dependency:

pip3 install autodistill-remote-clip


from autodistill_remote_clip import RemoteCLIP
from autodistill.detection import CaptionOntology

# define an ontology to map class names to our RemoteCLIP prompt
# the ontology dictionary has the format {caption: class}
# where caption is the prompt sent to the base model, and class is the label that will
# be saved for that caption in the generated annotations
# then, load the model
base_model = RemoteCLIP(
            "airport runway": "runway",
            "countryside": "countryside",

predictions = base_model.predict("runway.jpg")



This project is covered under an Apache 2.0 license.