Posters
2024
Attack-Resilient Image Watermarking Using Stable Diffusion
·3069 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
π’ University of Massachusetts Amherst
ZoDiac: a novel image watermarking framework leveraging pre-trained stable diffusion models for robust, invisible watermarks resistant to state-of-the-art attacks.
Attack-Aware Noise Calibration for Differential Privacy
·2558 words·13 mins·
loading
·
loading
AI Generated
AI Theory
Privacy
π’ Lausanne University Hospital
Boosting machine learning model accuracy in privacy-preserving applications, this research introduces novel noise calibration methods directly targeting desired attack risk levels, bypassing conventio…
Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication
·2999 words·15 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
π’ UC Los Angeles
Atlas3D enhances text-to-3D generation by integrating physics-based simulations, producing self-supporting 3D models for seamless real-world applications.
Asynchronous Perception Machine for Efficient Test Time Training
·5559 words·27 mins·
loading
·
loading
AI Generated
Computer Vision
Image Classification
π’ University of Central Florida
APM: Asynchronous Perception Machine, a computationally-efficient architecture for test-time training (TTT), processes image patches asynchronously, encoding semantic awareness without pre-training, a…
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
·2720 words·13 mins·
loading
·
loading
Computer Vision
Image Generation
π’ National University of Singapore
AsyncDiff accelerates diffusion model inference by 2.8x using asynchronous denoising and model parallelism, maintaining near-perfect image quality.
Asymptotics of Alpha-Divergence Variational Inference Algorithms with Exponential Families
·1536 words·8 mins·
loading
·
loading
Machine Learning
Optimization
π’ Telecom Sud-Paris
This paper rigorously analyzes alpha-divergence variational inference, proving its convergence and providing convergence rates, thereby advancing the theoretical foundations of this increasingly impor…
Association Pattern-aware Fusion for Biological Entity Relationship Prediction
·1918 words·10 mins·
loading
·
loading
AI Applications
Healthcare
π’ Zhejiang University
Pattern-BERP, a novel method, boosts biological entity relationship prediction accuracy by 4-23% using association pattern-aware fusion, enhancing interpretability for real-world applications.
Assembly Fuzzy Representation on Hypergraph for Open-Set 3D Object Retrieval
·2034 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Tsinghua University
Hypergraph-Based Assembly Fuzzy Representation (HAFR) excels at open-set 3D object retrieval by using part-level shapes and fuzzy representations to overcome challenges posed by unseen object categori…
Ask, Attend, Attack: An Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models
·3219 words·16 mins·
loading
·
loading
AI Generated
Natural Language Processing
Vision-Language Models
π’ Xiamen University
This paper introduces AAA, a novel three-stage decision-based black-box targeted attack against image-to-text models. AAA efficiently generates semantically consistent adversarial examples by asking …
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
·2478 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
π’ Snap Inc.
AsCAN, a novel hybrid architecture, achieves superior efficiency and performance in image recognition and generation by asymmetrically combining convolutional and transformer blocks.
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
·1937 words·10 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
π’ University of Oxford
Reinforcement learning agents achieve emergent cultural accumulation by balancing social and independent learning, outperforming single-lifetime agents.
Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis
·3848 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ University of Edinburgh
Unsupervised Articulated Object Modeling using Conditional View Synthesis learns pose and part segmentation from only two object observations, achieving significantly better performance than previous …
Artemis: Towards Referential Understanding in Complex Videos
·3373 words·16 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
π’ University of Chinese Academy of Sciences
Artemis: A new MLLM excels at video-based referential understanding, accurately describing targets within complex videos using natural language questions and bounding boxes, surpassing existing models…
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users
·3873 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
π’ Nanyang Technological University
ART: A novel automatic red-teaming framework reveals safety vulnerabilities in popular text-to-image models by identifying unsafe outputs even from seemingly harmless prompts.
AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields
·3676 words·18 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Sorbonne UniversitΓ©
AROMA: Attentive Reduced Order Model with Attention enhances PDE modeling with local neural fields, offering efficient processing of diverse geometries and superior performance in simulating 1D and 2D…
ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
·2152 words·11 mins·
loading
·
loading
Natural Language Processing
Large Language Models
π’ Peking University
ARKVALE boosts LLM inference efficiency by intelligently evicting and recalling key-value pairs from cache, improving latency and throughput without significant accuracy loss.
Are Your Models Still Fair? Fairness Attacks on Graph Neural Networks via Node Injections
·2258 words·11 mins·
loading
·
loading
AI Theory
Fairness
π’ Huazhong University of Science and Technology
Node Injection-based Fairness Attack (NIFA) reveals GNNs’ vulnerability to realistic fairness attacks by injecting a small percentage of nodes, significantly undermining fairness even in fairness-awar…
Are We on the Right Way for Evaluating Large Vision-Language Models?
·2514 words·12 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
π’ University of Science and Technology of China
MMStar benchmark tackles flawed LVLMs evaluation by focusing on vision-critical samples, minimizing data leakage, and introducing new metrics for fair multi-modal gain assessment.
Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?
·2360 words·12 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ MIT
Evidential deep learning’s uncertainty quantification is unreliable; this paper reveals its limitations, proposes model uncertainty incorporation for improved performance.
Are Self-Attentions Effective for Time Series Forecasting?
·3575 words·17 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Seoul National University
Cross-Attention-only Time Series Transformer (CATS) outperforms existing models by removing self-attention, improving long-term forecasting accuracy, and reducing computational cost.