🏢 Huazhong University of Science and Technology
Unlearnable 3D Point Clouds: Class-wise Transformation Is All You Need
·3088 words·15 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Huazhong University of Science and Technology
New unlearnable framework secures 3D point cloud data by using class-wise transformations, enabling authorized training while preventing unauthorized access.
United We Stand, Divided We Fall: Fingerprinting Deep Neural Networks via Adversarial Trajectories
·2453 words·12 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Huazhong University of Science and Technology
ADV-TRA uses adversarial trajectories to robustly fingerprint deep neural networks, outperforming state-of-the-art methods against various removal attacks.
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
·2935 words·14 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 Huazhong University of Science and Technology
Twin-Merging dynamically merges modular model expertise, significantly improving multitask performance without retraining, and adapting to diverse data.
Self-Distilled Depth Refinement with Noisy Poisson Fusion
·2691 words·13 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Huazhong University of Science and Technology
Self-Distilled Depth Refinement (SDDR) tackles noisy depth maps via a novel noisy Poisson fusion approach, achieving significant improvements in depth accuracy and edge quality.
On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion
·2220 words·11 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Huazhong University of Science and Technology
Effortlessly boost large language model performance by dynamically fusing knowledge from smaller, task-specific models – achieving near full fine-tuning results with minimal computational cost!
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
·2159 words·11 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Huazhong University of Science and Technology
MoE Jetpack efficiently transforms readily available dense checkpoints into high-performing MoE models, drastically accelerating convergence and improving accuracy.
Long-range Meta-path Search on Large-scale Heterogeneous Graphs
·2383 words·12 mins·
loading
·
loading
Machine Learning
Representation Learning
🏢 Huazhong University of Science and Technology
LMSPS: a novel framework efficiently leverages long-range dependencies in large heterogeneous graphs by dynamically identifying effective meta-paths, mitigating computational costs and over-smoothing.
LION: Linear Group RNN for 3D Object Detection in Point Clouds
·3911 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 Huazhong University of Science and Technology
LION: Linear Group RNNs conquer 3D object detection in sparse point clouds by enabling efficient long-range feature interaction, significantly outperforming transformer-based methods.
Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation
·3232 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
Image Segmentation
🏢 Huazhong University of Science and Technology
Lightweight Frequency Masker significantly improves cross-domain few-shot semantic segmentation by cleverly filtering frequency components of images, thereby reducing inter-channel correlation and enh…
Leveraging Contrastive Learning for Enhanced Node Representations in Tokenized Graph Transformers
·1604 words·8 mins·
loading
·
loading
Machine Learning
Representation Learning
🏢 Huazhong University of Science and Technology
GCFormer, a novel graph Transformer, enhances node representation learning by employing a hybrid token generator and contrastive learning, outperforming existing methods on various datasets.
Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization
·1904 words·9 mins·
loading
·
loading
Natural Language Processing
Text Classification
🏢 Huazhong University of Science and Technology
New criterion maximizes remaining discrepancy after rationale removal, treating spurious features as noise, improving rationale extraction.
IR-CM: The Fast and Universal Image Restoration Method Based on Consistency Model
·2449 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Huazhong University of Science and Technology
IR-CM: One-step image restoration using a novel consistency model for fast and universal performance.
How Sparse Can We Prune A Deep Network: A Fundamental Limit Perspective
·2596 words·13 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Huazhong University of Science and Technology
Deep network pruning’s fundamental limits are characterized, revealing how weight magnitude and network sharpness determine the maximum achievable sparsity.
Generate Universal Adversarial Perturbations for Few-Shot Learning
·2582 words·13 mins·
loading
·
loading
Machine Learning
Few-Shot Learning
🏢 Huazhong University of Science and Technology
Researchers developed FSAFW, a novel framework generating universal adversarial perturbations effective against various Few-Shot Learning paradigms, surpassing baseline methods by over 16%.
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
·2727 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Huazhong University of Science and Technology
FasterDiT accelerates Diffusion Transformers training 7x without architecture modification by analyzing SNR probability density functions and implementing a new supervision method.
Diffusion Priors for Variational Likelihood Estimation and Image Denoising
·2184 words·11 mins·
loading
·
loading
Image Generation
🏢 Huazhong University of Science and Technology
Adaptive likelihood estimation and MAP inference during reverse diffusion tackles real-world image noise.
DarkSAM: Fooling Segment Anything Model to Segment Nothing
·3441 words·17 mins·
loading
·
loading
AI Generated
Computer Vision
Image Segmentation
🏢 Huazhong University of Science and Technology
DarkSAM, a novel prompt-free attack, renders the Segment Anything Model incapable of segmenting objects across diverse images, highlighting its vulnerability to universal adversarial perturbations.
Coupled Mamba: Enhanced Multimodal Fusion with Coupled State Space Model
·2541 words·12 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
🏢 Huazhong University of Science and Technology
Coupled Mamba: Enhanced multi-modal fusion via coupled state space model boosts accuracy and efficiency.
Attention Temperature Matters in ViT-Based Cross-Domain Few-Shot Learning
·2330 words·11 mins·
loading
·
loading
Computer Vision
Few-Shot Learning
🏢 Huazhong University of Science and Technology
Boosting Vision Transformer’s transferability in cross-domain few-shot learning is achieved by a simple yet effective method: strategically adjusting attention temperature to remedy ineffective target…
Are Your Models Still Fair? Fairness Attacks on Graph Neural Networks via Node Injections
·2258 words·11 mins·
loading
·
loading
AI Theory
Fairness
🏢 Huazhong University of Science and Technology
Node Injection-based Fairness Attack (NIFA) reveals GNNs’ vulnerability to realistic fairness attacks by injecting a small percentage of nodes, significantly undermining fairness even in fairness-awar…