Skip to main content

🏢 Yonsei University

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
·5447 words·26 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Action Recognition 🏢 Yonsei University
DisCoRD: Rectified flow decodes discrete motion tokens into continuous, natural movement, balancing faithfulness and realism.
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
·4014 words·19 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Yonsei University
MaskRIS revolutionizes referring image segmentation by using novel masking and contextual learning to enhance data augmentation, achieving state-of-the-art results.