🏢 Idiap Research Institute
Toward Semantic Gaze Target Detection
·2529 words·12 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Idiap Research Institute
Researchers developed a novel architecture for semantic gaze target detection, achieving state-of-the-art results by simultaneously predicting gaze target localization and semantic label, surpassing e…
MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction
·3398 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
Video Understanding
🏢 Idiap Research Institute
MTGS: a unified framework jointly predicts gaze and social gaze (shared attention, mutual gaze) for multiple people in videos, achieving state-of-the-art results using a temporal transformer model and…