↓Skip to main content

🏢 Idiap Research Institute

Toward Semantic Gaze Target Detection

26 September 2024·2529 words·12 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Idiap Research Institute

Researchers developed a novel architecture for semantic gaze target detection, achieving state-of-the-art results by simultaneously predicting gaze target localization and semantic label, surpassing e…

MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction

26 September 2024·3398 words·16 mins· loading · loading

AI Generated Computer Vision Video Understanding 🏢 Idiap Research Institute

MTGS: a unified framework jointly predicts gaze and social gaze (shared attention, mutual gaze) for multiple people in videos, achieving state-of-the-art results using a temporal transformer model and…