🏢 Huazhong University of Science and Technology

Unlearnable 3D Point Clouds: Class-wise Transformation Is All You Need

26 September 2024·3088 words·15 mins· loading · loading

Computer Vision 3D Vision 🏢 Huazhong University of Science and Technology

New unlearnable framework secures 3D point cloud data by using class-wise transformations, enabling authorized training while preventing unauthorized access.

United We Stand, Divided We Fall: Fingerprinting Deep Neural Networks via Adversarial Trajectories

26 September 2024·2453 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 Huazhong University of Science and Technology

ADV-TRA uses adversarial trajectories to robustly fingerprint deep neural networks, outperforming state-of-the-art methods against various removal attacks.

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

26 September 2024·2935 words·14 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Huazhong University of Science and Technology

Twin-Merging dynamically merges modular model expertise, significantly improving multitask performance without retraining, and adapting to diverse data.

Self-Distilled Depth Refinement with Noisy Poisson Fusion

26 September 2024·2691 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 Huazhong University of Science and Technology

Self-Distilled Depth Refinement (SDDR) tackles noisy depth maps via a novel noisy Poisson fusion approach, achieving significant improvements in depth accuracy and edge quality.

On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion

26 September 2024·2220 words·11 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Huazhong University of Science and Technology

Effortlessly boost large language model performance by dynamically fusing knowledge from smaller, task-specific models – achieving near full fine-tuning results with minimal computational cost!

MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks

26 September 2024·2159 words·11 mins· loading · loading

Computer Vision Image Classification 🏢 Huazhong University of Science and Technology

MoE Jetpack efficiently transforms readily available dense checkpoints into high-performing MoE models, drastically accelerating convergence and improving accuracy.

Long-range Meta-path Search on Large-scale Heterogeneous Graphs

26 September 2024·2383 words·12 mins· loading · loading

Machine Learning Representation Learning 🏢 Huazhong University of Science and Technology

LMSPS: a novel framework efficiently leverages long-range dependencies in large heterogeneous graphs by dynamically identifying effective meta-paths, mitigating computational costs and over-smoothing.

LION: Linear Group RNN for 3D Object Detection in Point Clouds

26 September 2024·3911 words·19 mins· loading · loading

AI Generated Computer Vision Object Detection 🏢 Huazhong University of Science and Technology

LION: Linear Group RNNs conquer 3D object detection in sparse point clouds by enabling efficient long-range feature interaction, significantly outperforming transformer-based methods.

Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation

26 September 2024·3232 words·16 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 Huazhong University of Science and Technology

Lightweight Frequency Masker significantly improves cross-domain few-shot semantic segmentation by cleverly filtering frequency components of images, thereby reducing inter-channel correlation and enh…

Leveraging Contrastive Learning for Enhanced Node Representations in Tokenized Graph Transformers

26 September 2024·1604 words·8 mins· loading · loading

Machine Learning Representation Learning 🏢 Huazhong University of Science and Technology

GCFormer, a novel graph Transformer, enhances node representation learning by employing a hybrid token generator and contrastive learning, outperforming existing methods on various datasets.

Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization

26 September 2024·1904 words·9 mins· loading · loading

Natural Language Processing Text Classification 🏢 Huazhong University of Science and Technology

New criterion maximizes remaining discrepancy after rationale removal, treating spurious features as noise, improving rationale extraction.

IR-CM: The Fast and Universal Image Restoration Method Based on Consistency Model

26 September 2024·2449 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 Huazhong University of Science and Technology

IR-CM: One-step image restoration using a novel consistency model for fast and universal performance.

How Sparse Can We Prune A Deep Network: A Fundamental Limit Perspective

26 September 2024·2596 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 Huazhong University of Science and Technology

Deep network pruning’s fundamental limits are characterized, revealing how weight magnitude and network sharpness determine the maximum achievable sparsity.

Generate Universal Adversarial Perturbations for Few-Shot Learning

26 September 2024·2582 words·13 mins· loading · loading

Machine Learning Few-Shot Learning 🏢 Huazhong University of Science and Technology

Researchers developed FSAFW, a novel framework generating universal adversarial perturbations effective against various Few-Shot Learning paradigms, surpassing baseline methods by over 16%.

FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification

26 September 2024·2727 words·13 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Huazhong University of Science and Technology

FasterDiT accelerates Diffusion Transformers training 7x without architecture modification by analyzing SNR probability density functions and implementing a new supervision method.

Diffusion Priors for Variational Likelihood Estimation and Image Denoising

26 September 2024·2184 words·11 mins· loading · loading

Image Generation 🏢 Huazhong University of Science and Technology

Adaptive likelihood estimation and MAP inference during reverse diffusion tackles real-world image noise.

DarkSAM: Fooling Segment Anything Model to Segment Nothing

26 September 2024·3441 words·17 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 Huazhong University of Science and Technology

DarkSAM, a novel prompt-free attack, renders the Segment Anything Model incapable of segmenting objects across diverse images, highlighting its vulnerability to universal adversarial perturbations.

Coupled Mamba: Enhanced Multimodal Fusion with Coupled State Space Model

26 September 2024·2541 words·12 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 Huazhong University of Science and Technology

Coupled Mamba: Enhanced multi-modal fusion via coupled state space model boosts accuracy and efficiency.

Attention Temperature Matters in ViT-Based Cross-Domain Few-Shot Learning

26 September 2024·2330 words·11 mins· loading · loading

Computer Vision Few-Shot Learning 🏢 Huazhong University of Science and Technology

Boosting Vision Transformer’s transferability in cross-domain few-shot learning is achieved by a simple yet effective method: strategically adjusting attention temperature to remedy ineffective target…

Are Your Models Still Fair? Fairness Attacks on Graph Neural Networks via Node Injections

26 September 2024·2258 words·11 mins· loading · loading

AI Theory Fairness 🏢 Huazhong University of Science and Technology

Node Injection-based Fairness Attack (NIFA) reveals GNNs’ vulnerability to realistic fairness attacks by injecting a small percentage of nodes, significantly undermining fairness even in fairness-awar…