๐ข Beihang University
Why Go Full? Elevating Federated Learning Through Partial Network Updates
ยท3064 wordsยท15 minsยท
loading
ยท
loading
AI Generated
Machine Learning
Federated Learning
๐ข Beihang University
FedPart boosts federated learning by updating only parts of the network, solving the layer mismatch problem, and achieving faster convergence with higher accuracy.
Uncovering the Redundancy in Graph Self-supervised Learning Models
ยท2804 wordsยท14 minsยท
loading
ยท
loading
AI Generated
Machine Learning
Self-Supervised Learning
๐ข Beihang University
Graph self-supervised learning models surprisingly exhibit high redundancy, allowing for significant parameter reduction without performance loss. A novel framework, SLIDE, leverages this discovery fโฆ
QUEST: Quadruple Multimodal Contrastive Learning with Constraints and Self-Penalization
ยท1744 wordsยท9 minsยท
loading
ยท
loading
Multimodal Learning
Vision-Language Models
๐ข Beihang University
QUEST: Quadruple Multimodal Contrastive Learning tackles feature suppression by using quaternion embedding to extract unique information while penalizing excessive shared information influence, achievโฆ
Long-Tailed Out-of-Distribution Detection via Normalized Outlier Distribution Adaptation
ยท2335 wordsยท11 minsยท
loading
ยท
loading
Computer Vision
Object Detection
๐ข Beihang University
AdaptOD: a novel approach for robust OOD detection in long-tailed recognition, dynamically adapting outlier distributions to true OOD distributions using a dual-normalized energy loss for improved accโฆ
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings
ยท1927 wordsยท10 minsยท
loading
ยท
loading
Natural Language Processing
Large Language Models
๐ข Beihang University
TEA-GLM leverages LLMs for zero-shot graph learning by aligning GNN representations with LLM token embeddings, achieving state-of-the-art performance on unseen datasets and tasks.
How to Use Diffusion Priors under Sparse Views?
ยท2930 wordsยท14 minsยท
loading
ยท
loading
Computer Vision
3D Vision
๐ข Beihang University
Inline Prior Guided Score Matching (IPSM) improves sparse-view 3D reconstruction by leveraging visual inline priors from pose relationships to rectify rendered image distribution and effectively guideโฆ
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
ยท3972 wordsยท19 minsยท
loading
ยท
loading
AI Generated
Machine Learning
Reinforcement Learning
๐ข Beihang University
TTCT translates natural language constraints into effective training signals for safe reinforcement learning, enabling agents to learn safer policies with lower violation rates and zero-shot transfer โฆ
Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation
ยท2718 wordsยท13 minsยท
loading
ยท
loading
AI Generated
Computer Vision
Image Segmentation
๐ข Beihang University
DUSA:Unlocking Diffusion Modelsโ Discriminative Power for Efficient Test-Time Adaptation
Enhancing Graph Transformers with Hierarchical Distance Structural Encoding
ยท3923 wordsยท19 minsยท
loading
ยท
loading
AI Generated
Machine Learning
Representation Learning
๐ข Beihang University
Hierarchical Distance Structural Encoding (HDSE) empowers graph transformers to better capture hierarchical graph structures, leading to improved performance in graph classification and regression tasโฆ
Dual Defense: Enhancing Privacy and Mitigating Poisoning Attacks in Federated Learning
ยท2489 wordsยท12 minsยท
loading
ยท
loading
AI Generated
Machine Learning
Federated Learning
๐ข Beihang University
Dual Defense Federated Learning (DDFed) simultaneously boosts privacy and thwarts poisoning attacks in federated learning without altering the existing framework.
CLIP in Mirror: Disentangling text from visual images through reflection
ยท4284 wordsยท21 minsยท
loading
ยท
loading
AI Generated
Multimodal Learning
Vision-Language Models
๐ข Beihang University
MirrorCLIP disentangles text from images in CLIP using mirror reflection differences, enhancing robustness against text-visual image confusion.
Block Sparse Bayesian Learning: A Diversified Scheme
ยท3512 wordsยท17 minsยท
loading
ยท
loading
Machine Learning
Deep Learning
๐ข Beihang University
Diversified Block Sparse Bayesian Learning (DivSBL) improves block sparse signal recovery by adapting to unknown block structures, enhancing accuracy and robustness over existing methods.
BiDM: Pushing the Limit of Quantization for Diffusion Models
ยท2390 wordsยท12 minsยท
loading
ยท
loading
Computer Vision
Image Generation
๐ข Beihang University
BiDM achieves full 1-bit quantization in diffusion models, significantly improving storage and speed without sacrificing image quality, setting a new state-of-the-art.
4Diffusion: Multi-view Video Diffusion Model for 4D Generation
ยท2302 wordsยท11 minsยท
loading
ยท
loading
Computer Vision
3D Vision
๐ข Beihang University
4Diffusion generates high-quality, temporally consistent 4D content from monocular videos using a unified multi-view diffusion model and novel loss functions.