We propose MaskCut approach to generate pseudo-masks for multiple objects in an image. CutLER can learn unsupervised object detectors and instance segmentors solely on ImageNet-1K. CutLER exhibits ...
[2024/4/23] We have added an audio-grounding feature that tracks the sound-making object within the video's soundtrack. [2023/5/12] We have authored a technical report for SAM-Track. [2023/5/7] We ...