š¢ Hong Kong University of Science and Technology
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
·4202 words·20 mins·
loading
·
loading
AI Generated
AI Applications
Autonomous Vehicles
š¢ Hong Kong University of Science and Technology
Vista: a novel driving world model achieving high-fidelity prediction and versatile controllability, outperforming state-of-the-art models in generalization and prediction accuracy.
VeXKD: The Versatile Integration of Cross-Modal Fusion and Knowledge Distillation for 3D Perception
·3369 words·16 mins·
loading
·
loading
AI Applications
Autonomous Vehicles
š¢ Hong Kong University of Science and Technology
VeXKD: A versatile framework boosts 3D perception by cleverly combining cross-modal fusion and knowledge distillation, improving single-modal student model accuracy without extra inference time.
UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction
·3215 words·16 mins·
loading
·
loading
AI Generated
AI Applications
Smart Cities
š¢ Hong Kong University of Science and Technology
UrbanKGent: A unified LLM agent framework revolutionizes urban knowledge graph construction, achieving significantly improved accuracy and efficiency.
UPS: Unified Projection Sharing for Lightweight Single-Image Super-resolution and Beyond
·2752 words·13 mins·
loading
·
loading
Computer Vision
Image Generation
š¢ Hong Kong University of Science and Technology
UPS: A novel algorithm for lightweight single-image super-resolution, decoupling feature extraction and similarity modeling for enhanced efficiency and robustness.
Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness
·3237 words·16 mins·
loading
·
loading
AI Generated
AI Applications
Security
š¢ Hong Kong University of Science and Technology
Two-Stage Backdoor Defense (TSBD) unveils and mitigates backdoor vulnerabilities by cleverly unlearning weight changes and suppressing backdoor neuron activeness, significantly improving the robustnes…
Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense
·3401 words·16 mins·
loading
·
loading
AI Theory
Safety
š¢ Hong Kong University of Science and Technology
Current backdoor defenses, while effective at reducing attack success rates, are vulnerable to rapid re-learning. This work unveils this superficial safety, proposes a novel attack, and introduces a p…
UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks
·3265 words·16 mins·
loading
·
loading
Computer Vision
Image Generation
š¢ Hong Kong University of Science and Technology
UltraPixel generates high-quality images at various resolutions (1K-6K) efficiently using cascade diffusion models, achieving state-of-the-art performance.
Training for Stable Explanation for Free
·2565 words·13 mins·
loading
·
loading
AI Theory
Interpretability
š¢ Hong Kong University of Science and Technology
R2ET: training for robust ranking explanations by an effective regularizer.
Towards Stable Representations for Protein Interface Prediction
·2364 words·12 mins·
loading
·
loading
AI Generated
Machine Learning
Representation Learning
š¢ Hong Kong University of Science and Technology
ATProt: Adversarial training makes protein interface prediction robust to flexibility!
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection
·2705 words·13 mins·
loading
·
loading
Computer Vision
3D Vision
š¢ Hong Kong University of Science and Technology
Object-centric occupancy completion boosts 3D object detection accuracy by using temporal information from long sequences to precisely reconstruct object shapes, particularly for incomplete or distant…
Time-FFM: Towards LM-Empowered Federated Foundation Model for Time Series Forecasting
·2669 words·13 mins·
loading
·
loading
Machine Learning
Federated Learning
š¢ Hong Kong University of Science and Technology
TIME-FFM: a Federated Foundation Model empowers time series forecasting using pre-trained Language Models, tackling data scarcity and privacy concerns for superior few-shot and zero-shot predictions.
The Limits of Differential Privacy in Online Learning
·440 words·3 mins·
loading
·
loading
AI Theory
Privacy
š¢ Hong Kong University of Science and Technology
This paper reveals fundamental limits of differential privacy in online learning, demonstrating a clear separation between pure, approximate, and non-private settings.
The Implicit Bias of Adam on Separable Data
·1356 words·7 mins·
loading
·
loading
AI Theory
Optimization
š¢ Hong Kong University of Science and Technology
Adam’s implicit bias revealed: On separable data, Adam converges towards the maximum lā-margin solution, a finding contrasting with gradient descent’s l2-margin preference. This polynomial-time conver…
Tackling Uncertain Correspondences for Multi-Modal Entity Alignment
·1671 words·8 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
š¢ Hong Kong University of Science and Technology
TMEA: A novel approach significantly boosts multi-modal entity alignment accuracy by effectively handling uncertain correspondences between modalities, improving data integration for diverse knowledge…
SubgDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning
·2691 words·13 mins·
loading
·
loading
Machine Learning
Representation Learning
š¢ Hong Kong University of Science and Technology
SubgDiff enhances molecular representation learning by incorporating substructural information into a diffusion model framework, achieving superior performance in molecular force predictions.
Spiking Neural Network as Adaptive Event Stream Slicer
·2956 words·14 mins·
loading
·
loading
Computer Vision
Object Detection
š¢ Hong Kong University of Science and Technology
SpikeSlicer: An adaptive event stream slicer using a spiking neural network (SNN) to efficiently split events for improved downstream processing in object tracking and recognition.
SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network
·2466 words·12 mins·
loading
·
loading
AI Generated
AI Applications
Human-AI Interaction
š¢ Hong Kong University of Science and Technology
SpGesture: A source-free domain-adaptive SEMG gesture recognition system using a novel Spiking Jaccard Attentive Neural Network achieves real-time performance with high accuracy.
Should We Really Edit Language Models? On the Evaluation of Edited Language Models
·3638 words·18 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
š¢ Hong Kong University of Science and Technology
Language model editing’s limitations exposed: Scaling current methods leads to knowledge loss and compromised safety, urging research into more robust techniques.
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
·3132 words·15 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
š¢ Hong Kong University of Science and Technology
RouterDC: A query-based router trained via dual contrastive learning assembles multiple LLMs, significantly outperforming individual LLMs and existing routing methods on both in- and out-of-distributi…
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
·2705 words·13 mins·
loading
·
loading
AI Applications
Robotics
š¢ Hong Kong University of Science and Technology
RL-GPT seamlessly integrates Large Language Models (LLMs) and Reinforcement Learning (RL) to create highly efficient agents mastering complex tasks in open-world environments.