Skip to main content

Posters

2024

Attack-Resilient Image Watermarking Using Stable Diffusion
·3069 words·15 mins· loading · loading
Computer Vision Image Generation 🏒 University of Massachusetts Amherst
ZoDiac: a novel image watermarking framework leveraging pre-trained stable diffusion models for robust, invisible watermarks resistant to state-of-the-art attacks.
Attack-Aware Noise Calibration for Differential Privacy
·2558 words·13 mins· loading · loading
AI Generated AI Theory Privacy 🏒 Lausanne University Hospital
Boosting machine learning model accuracy in privacy-preserving applications, this research introduces novel noise calibration methods directly targeting desired attack risk levels, bypassing conventio…
Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication
·2999 words·15 mins· loading · loading
AI Generated Multimodal Learning Vision-Language Models 🏒 UC Los Angeles
Atlas3D enhances text-to-3D generation by integrating physics-based simulations, producing self-supporting 3D models for seamless real-world applications.
Asynchronous Perception Machine for Efficient Test Time Training
·5559 words·27 mins· loading · loading
AI Generated Computer Vision Image Classification 🏒 University of Central Florida
APM: Asynchronous Perception Machine, a computationally-efficient architecture for test-time training (TTT), processes image patches asynchronously, encoding semantic awareness without pre-training, a…
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
·2720 words·13 mins· loading · loading
Computer Vision Image Generation 🏒 National University of Singapore
AsyncDiff accelerates diffusion model inference by 2.8x using asynchronous denoising and model parallelism, maintaining near-perfect image quality.
Asymptotics of Alpha-Divergence Variational Inference Algorithms with Exponential Families
·1536 words·8 mins· loading · loading
Machine Learning Optimization 🏒 Telecom Sud-Paris
This paper rigorously analyzes alpha-divergence variational inference, proving its convergence and providing convergence rates, thereby advancing the theoretical foundations of this increasingly impor…
Association Pattern-aware Fusion for Biological Entity Relationship Prediction
·1918 words·10 mins· loading · loading
AI Applications Healthcare 🏒 Zhejiang University
Pattern-BERP, a novel method, boosts biological entity relationship prediction accuracy by 4-23% using association pattern-aware fusion, enhancing interpretability for real-world applications.
Assembly Fuzzy Representation on Hypergraph for Open-Set 3D Object Retrieval
·2034 words·10 mins· loading · loading
Computer Vision 3D Vision 🏒 Tsinghua University
Hypergraph-Based Assembly Fuzzy Representation (HAFR) excels at open-set 3D object retrieval by using part-level shapes and fuzzy representations to overcome challenges posed by unseen object categori…
Ask, Attend, Attack: An Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models
·3219 words·16 mins· loading · loading
AI Generated Natural Language Processing Vision-Language Models 🏒 Xiamen University
This paper introduces AAA, a novel three-stage decision-based black-box targeted attack against image-to-text models. AAA efficiently generates semantically consistent adversarial examples by asking …
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
·2478 words·12 mins· loading · loading
Computer Vision Image Generation 🏒 Snap Inc.
AsCAN, a novel hybrid architecture, achieves superior efficiency and performance in image recognition and generation by asymmetrically combining convolutional and transformer blocks.
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
·1937 words·10 mins· loading · loading
Machine Learning Reinforcement Learning 🏒 University of Oxford
Reinforcement learning agents achieve emergent cultural accumulation by balancing social and independent learning, outperforming single-lifetime agents.
Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis
·3848 words·19 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 University of Edinburgh
Unsupervised Articulated Object Modeling using Conditional View Synthesis learns pose and part segmentation from only two object observations, achieving significantly better performance than previous …
Artemis: Towards Referential Understanding in Complex Videos
·3373 words·16 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏒 University of Chinese Academy of Sciences
Artemis: A new MLLM excels at video-based referential understanding, accurately describing targets within complex videos using natural language questions and bounding boxes, surpassing existing models…
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users
·3873 words·19 mins· loading · loading
AI Generated Computer Vision Image Generation 🏒 Nanyang Technological University
ART: A novel automatic red-teaming framework reveals safety vulnerabilities in popular text-to-image models by identifying unsafe outputs even from seemingly harmless prompts.
AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields
·3676 words·18 mins· loading · loading
Machine Learning Deep Learning 🏒 Sorbonne Université
AROMA: Attentive Reduced Order Model with Attention enhances PDE modeling with local neural fields, offering efficient processing of diverse geometries and superior performance in simulating 1D and 2D…
ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
·2152 words·11 mins· loading · loading
Natural Language Processing Large Language Models 🏒 Peking University
ARKVALE boosts LLM inference efficiency by intelligently evicting and recalling key-value pairs from cache, improving latency and throughput without significant accuracy loss.
Are Your Models Still Fair? Fairness Attacks on Graph Neural Networks via Node Injections
·2258 words·11 mins· loading · loading
AI Theory Fairness 🏒 Huazhong University of Science and Technology
Node Injection-based Fairness Attack (NIFA) reveals GNNs’ vulnerability to realistic fairness attacks by injecting a small percentage of nodes, significantly undermining fairness even in fairness-awar…
Are We on the Right Way for Evaluating Large Vision-Language Models?
·2514 words·12 mins· loading · loading
Multimodal Learning Vision-Language Models 🏒 University of Science and Technology of China
MMStar benchmark tackles flawed LVLMs evaluation by focusing on vision-critical samples, minimizing data leakage, and introducing new metrics for fair multi-modal gain assessment.
Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?
·2360 words·12 mins· loading · loading
Machine Learning Deep Learning 🏒 MIT
Evidential deep learning’s uncertainty quantification is unreliable; this paper reveals its limitations, proposes model uncertainty incorporation for improved performance.
Are Self-Attentions Effective for Time Series Forecasting?
·3575 words·17 mins· loading · loading
Machine Learning Deep Learning 🏒 Seoul National University
Cross-Attention-only Time Series Transformer (CATS) outperforms existing models by removing self-attention, improving long-term forecasting accuracy, and reducing computational cost.