🏢 Hong Kong University of Science and Technology

Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability

26 September 2024·4202 words·20 mins· loading · loading

AI Generated AI Applications Autonomous Vehicles 🏢 Hong Kong University of Science and Technology

Vista: a novel driving world model achieving high-fidelity prediction and versatile controllability, outperforming state-of-the-art models in generalization and prediction accuracy.

VeXKD: The Versatile Integration of Cross-Modal Fusion and Knowledge Distillation for 3D Perception

26 September 2024·3369 words·16 mins· loading · loading

AI Applications Autonomous Vehicles 🏢 Hong Kong University of Science and Technology

VeXKD: A versatile framework boosts 3D perception by cleverly combining cross-modal fusion and knowledge distillation, improving single-modal student model accuracy without extra inference time.

UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction

26 September 2024·3215 words·16 mins· loading · loading

AI Generated AI Applications Smart Cities 🏢 Hong Kong University of Science and Technology

UrbanKGent: A unified LLM agent framework revolutionizes urban knowledge graph construction, achieving significantly improved accuracy and efficiency.

UPS: Unified Projection Sharing for Lightweight Single-Image Super-resolution and Beyond

26 September 2024·2752 words·13 mins· loading · loading

Computer Vision Image Generation 🏢 Hong Kong University of Science and Technology

UPS: A novel algorithm for lightweight single-image super-resolution, decoupling feature extraction and similarity modeling for enhanced efficiency and robustness.

Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness

26 September 2024·3237 words·16 mins· loading · loading

AI Generated AI Applications Security 🏢 Hong Kong University of Science and Technology

Two-Stage Backdoor Defense (TSBD) unveils and mitigates backdoor vulnerabilities by cleverly unlearning weight changes and suppressing backdoor neuron activeness, significantly improving the robustnes…

Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense

26 September 2024·3401 words·16 mins· loading · loading

AI Theory Safety 🏢 Hong Kong University of Science and Technology

Current backdoor defenses, while effective at reducing attack success rates, are vulnerable to rapid re-learning. This work unveils this superficial safety, proposes a novel attack, and introduces a p…

UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks

26 September 2024·3265 words·16 mins· loading · loading

Computer Vision Image Generation 🏢 Hong Kong University of Science and Technology

UltraPixel generates high-quality images at various resolutions (1K-6K) efficiently using cascade diffusion models, achieving state-of-the-art performance.

Training for Stable Explanation for Free

26 September 2024·2565 words·13 mins· loading · loading

AI Theory Interpretability 🏢 Hong Kong University of Science and Technology

R2ET: training for robust ranking explanations by an effective regularizer.

Towards Stable Representations for Protein Interface Prediction

26 September 2024·2364 words·12 mins· loading · loading

AI Generated Machine Learning Representation Learning 🏢 Hong Kong University of Science and Technology

ATProt: Adversarial training makes protein interface prediction robust to flexibility!

Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection

26 September 2024·2705 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 Hong Kong University of Science and Technology

Object-centric occupancy completion boosts 3D object detection accuracy by using temporal information from long sequences to precisely reconstruct object shapes, particularly for incomplete or distant…

Time-FFM: Towards LM-Empowered Federated Foundation Model for Time Series Forecasting

26 September 2024·2669 words·13 mins· loading · loading

Machine Learning Federated Learning 🏢 Hong Kong University of Science and Technology

TIME-FFM: a Federated Foundation Model empowers time series forecasting using pre-trained Language Models, tackling data scarcity and privacy concerns for superior few-shot and zero-shot predictions.

The Limits of Differential Privacy in Online Learning

26 September 2024·440 words·3 mins· loading · loading

AI Theory Privacy 🏢 Hong Kong University of Science and Technology

This paper reveals fundamental limits of differential privacy in online learning, demonstrating a clear separation between pure, approximate, and non-private settings.

The Implicit Bias of Adam on Separable Data

26 September 2024·1356 words·7 mins· loading · loading

AI Theory Optimization 🏢 Hong Kong University of Science and Technology

Adam’s implicit bias revealed: On separable data, Adam converges towards the maximum l∞-margin solution, a finding contrasting with gradient descent’s l2-margin preference. This polynomial-time conver…

Tackling Uncertain Correspondences for Multi-Modal Entity Alignment

26 September 2024·1671 words·8 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Hong Kong University of Science and Technology

TMEA: A novel approach significantly boosts multi-modal entity alignment accuracy by effectively handling uncertain correspondences between modalities, improving data integration for diverse knowledge…

SubgDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning

26 September 2024·2691 words·13 mins· loading · loading

Machine Learning Representation Learning 🏢 Hong Kong University of Science and Technology

SubgDiff enhances molecular representation learning by incorporating substructural information into a diffusion model framework, achieving superior performance in molecular force predictions.

Spiking Neural Network as Adaptive Event Stream Slicer

26 September 2024·2956 words·14 mins· loading · loading

Computer Vision Object Detection 🏢 Hong Kong University of Science and Technology

SpikeSlicer: An adaptive event stream slicer using a spiking neural network (SNN) to efficiently split events for improved downstream processing in object tracking and recognition.

SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network

26 September 2024·2466 words·12 mins· loading · loading

AI Generated AI Applications Human-AI Interaction 🏢 Hong Kong University of Science and Technology

SpGesture: A source-free domain-adaptive SEMG gesture recognition system using a novel Spiking Jaccard Attentive Neural Network achieves real-time performance with high accuracy.

Should We Really Edit Language Models? On the Evaluation of Edited Language Models

26 September 2024·3638 words·18 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Hong Kong University of Science and Technology

Language model editing’s limitations exposed: Scaling current methods leads to knowledge loss and compromised safety, urging research into more robust techniques.

RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models

26 September 2024·3132 words·15 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Hong Kong University of Science and Technology

RouterDC: A query-based router trained via dual contrastive learning assembles multiple LLMs, significantly outperforming individual LLMs and existing routing methods on both in- and out-of-distributi…

RL-GPT: Integrating Reinforcement Learning and Code-as-policy

26 September 2024·2705 words·13 mins· loading · loading

AI Applications Robotics 🏢 Hong Kong University of Science and Technology

RL-GPT seamlessly integrates Large Language Models (LLMs) and Reinforcement Learning (RL) to create highly efficient agents mastering complex tasks in open-world environments.