🏢 Chinese Academy of Sciences
Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model
·2150 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Chinese Academy of Sciences
AcFormer, a novel vision-language connector for MLLMs, leverages ‘visual anchors’ to reduce computation cost by ~66% while improving accuracy.
Understanding and Improving Adversarial Collaborative Filtering for Robust Recommendation
·2042 words·10 mins·
loading
·
loading
AI Generated
AI Theory
Robustness
🏢 Chinese Academy of Sciences
PamaCF, a novel personalized adversarial collaborative filtering technique, significantly improves recommendation robustness and accuracy against poisoning attacks by dynamically adjusting perturbatio…
Improving Robustness of 3D Point Cloud Recognition from a Fourier Perspective
·2312 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Chinese Academy of Sciences
Boosting 3D point cloud recognition robustness, Frequency Adversarial Training (FAT) leverages frequency-domain adversarial examples to improve model resilience against corruptions, achieving state-of…
Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation
·2871 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Chinese Academy of Sciences
Hallo3D: a tuning-free method resolving 3D generation hallucinations via multi-modal inconsistency detection and mitigation for consistent 3D content.
Generalizablity of Memorization Neural Network
·1319 words·7 mins·
loading
·
loading
AI Theory
Generalization
🏢 Chinese Academy of Sciences
Unlocking deep learning’s generalization mystery, this research pioneers a theoretical understanding of memorization neural network generalizability, revealing critical network structural requirements…
Beyond Efficiency: Molecular Data Pruning for Enhanced Generalization
·2607 words·13 mins·
loading
·
loading
AI Generated
Machine Learning
Transfer Learning
🏢 Chinese Academy of Sciences
MolPeg, a novel molecular data pruning framework, enhances model generalization in transfer learning by using a source-free approach and consistently outperforming other methods, even surpassing full-…