Skip to main content

🏢 Idiap Research Institute

Toward Semantic Gaze Target Detection
·2529 words·12 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Idiap Research Institute
Researchers developed a novel architecture for semantic gaze target detection, achieving state-of-the-art results by simultaneously predicting gaze target localization and semantic label, surpassing e…
MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction
·3398 words·16 mins· loading · loading
AI Generated Computer Vision Video Understanding 🏢 Idiap Research Institute
MTGS: a unified framework jointly predicts gaze and social gaze (shared attention, mutual gaze) for multiple people in videos, achieving state-of-the-art results using a temporal transformer model and…