Skip to main content

🏢 Pohang University of Science and Technology

ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation
·3953 words·19 mins· loading · loading
AI Generated Computer Vision Action Recognition 🏢 Pohang University of Science and Technology
ActFusion: a unified diffusion model achieving state-of-the-art performance in both action segmentation and anticipation by jointly learning visible and invisible parts of video sequences.
3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction
·2707 words·13 mins· loading · loading
Computer Vision 3D Vision 🏢 Pohang University of Science and Technology
3D pose estimation is revolutionized by a novel SO(3)-equivariant network directly predicting Wigner-D harmonics, achieving state-of-the-art accuracy and efficiency.