Skip to main content

🏢 Sun Yat-Sen University

UniFL: Improve Latent Diffusion Model via Unified Feedback Learning
·2732 words·13 mins· loading · loading
Computer Vision Image Generation 🏢 Sun Yat-Sen University
UniFL: Unified Feedback Learning revolutionizes latent diffusion models by improving image quality, aesthetics, and inference speed through a unified feedback learning framework, surpassing existing m…
State Space Models on Temporal Graphs: A First-Principles Study
·1794 words·9 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Sun Yat-Sen University
GRAPHSSM: a novel graph state space model efficiently captures temporal graph dynamics, overcoming limitations of existing sequence models.
Real-time Stereo-based 3D Object Detection for Streaming Perception
·2407 words·12 mins· loading · loading
Computer Vision Object Detection 🏢 Sun Yat-Sen University
StreamDSGN: a real-time stereo 3D object detection framework significantly boosts streaming perception accuracy by leveraging historical information, a feature-flow fusion method, and a motion consist…
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation
·3082 words·15 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Sun Yat-Sen University
PIVOT-R, a novel primitive-driven waypoint-aware world model, significantly boosts robotic manipulation performance and efficiency via an asynchronous hierarchical executor.
Neural Combinatorial Optimization for Robust Routing Problem with Uncertain Travel Times
·2186 words·11 mins· loading · loading
AI Theory Optimization 🏢 Sun Yat-Sen University
Neural networks efficiently solve robust routing problems with uncertain travel times, minimizing worst-case deviations from optimal routes under the min-max regret criterion.
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition
·2356 words·12 mins· loading · loading
Computer Vision Action Recognition 🏢 Sun Yat-Sen University
CHASE: A novel method for skeleton-based multi-entity action recognition that cleverly adapts skeleton positions to minimize data bias and boost accuracy.
Bayesian Domain Adaptation with Gaussian Mixture Domain-Indexing
·2584 words·13 mins· loading · loading
AI Generated Machine Learning Transfer Learning 🏢 Sun Yat-Sen University
GMDI: a novel Bayesian domain adaptation algorithm significantly improves adaptation by dynamically modeling domain indices using Gaussian Mixture Models, outperforming state-of-the-art methods.
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation
·4145 words·20 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Sun Yat-Sen University
AttnDreamBooth: A novel approach to text-to-image generation that overcomes limitations of prior methods by separating learning processes, resulting in significantly improved identity preservation and…
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
·2513 words·12 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Sun Yat-Sen University
This work introduces PDOA, an offline adaptation framework for constrained multi-objective RL, using demonstrations instead of manually designed preferences to infer optimal policies while satisfying …
A Swiss Army Knife for Heterogeneous Federated Learning: Flexible Coupling via Trace Norm
·2684 words·13 mins· loading · loading
Machine Learning Federated Learning 🏢 Sun Yat-Sen University
FedSAK, a novel federated multi-task learning framework, flexibly handles data, model, and task heterogeneity using tensor trace norm to learn correlations among client models, achieving superior perf…