Skip to main content

🏢 Meta FAIR

Cluster and Predict Latents Patches for Improved Masked Image Modeling
·7222 words·34 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Segmentation 🏢 Meta FAIR
CAPI: a novel masked image modeling framework boosts self-supervised visual representation learning by predicting latent clusterings, achieving state-of-the-art ImageNet accuracy and mIoU.