Skip to content
/ SegTTO Public

Official Implementation of "Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation"

Notifications You must be signed in to change notification settings

UlinduP/SegTTO

Repository files navigation

Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation

Ulindu De Silva*, Didula Samaraweera*, Sasini Wanigathunga*, Kavindu Kariyawasam*, Kanchana Ranasinghe, Muzammal Naseer, Ranga Rodrigo

*Equal Contribution

Release Notes

  • [2024/01/02] 🔥 Seg-TTO is out! We release the paper and code for inference and deployment. Seg-TTO achieves SOTA results on supervised and unsupervised open-vocabulary semantic segmentation.

Introduction

We present Seg-TTO, a plug-in-play module focusing on segmentation-specific test time prompt tuning and visual attributes to address the lack of knowledge of vision language models in domain-specific datasets.

For further details and visualization results, please check out our [paper TODO] and our [project page TODO].

Installation

Please follow this to setup environment for CAT-Seg version and this to setup environment for CLIP-DINOiser version of Seg-TTO.

Data Preparation

Please follow dataset preparation.

Demo

If you want to try your own images locally, please try cat seg demo and clip dinoiser demo.

Evaluation

Please follow this to evaluate CAT-Seg version and this to evaluate CLIP-DINOiser version of Seg-TTO.

Evaluation Results


Figure 2: Zero-Shot Semantic Segmentation on Out-of-Domain Datasets

Figure 3: Zero-Shot Unsupervised Semantic Segmentation on Out-of-Domain Datasets

Pretrained Models

We provide pre-trained weights for our models reported in the paper. All of the models were evaluated with two 24GB NVIDIA RTX A5000 or 16GB NVIDIA Quadro RTX 5000 and can be reproduced with the evaluation script above. Please note that the results may vary slightly with installed module versions.

Name CLIP General Earth Monitoring Medical Sciences Engineering Agri. and Biology Download
CLIP-DINO-TTO ViT-B/16 25.38 27.92 47.87 35.6 31.35 ckpt 
CAT-Seg-B-TTO ViT-B/16 35.75 35.61 45.64 29.44 29.66 ckpt 
CAT-Seg-L-TTO ViT-L/14 41.52 40.59 51.4 27.71 38.4 ckpt 

Acknowledgement

We would like to acknowledge the contributions of public projects CAT-Seg, MESS and CLIP-Dinoiser, whose code has been utilized in this repository.

Citing Seg-TTO:

@misc{desilva2025SegTTO,
      title={Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation}, 
      author={Ulindu De Silva and Didula Samaraweera and Sasini Wanigathunga and Kavindu Kariyawasam and Kanchana Ranasinghe and Muzammal Naseer and Ranga Rodrigo},
      year={2025},
}

About

Official Implementation of "Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •