Posters

Attack-Resilient Image Watermarking Using Stable Diffusion

26 September 2024·3069 words·15 mins· loading · loading

Computer Vision Image Generation 🏢 University of Massachusetts Amherst

ZoDiac: a novel image watermarking framework leveraging pre-trained stable diffusion models for robust, invisible watermarks resistant to state-of-the-art attacks.

Attack-Aware Noise Calibration for Differential Privacy

26 September 2024·2558 words·13 mins· loading · loading

AI Generated AI Theory Privacy 🏢 Lausanne University Hospital

Boosting machine learning model accuracy in privacy-preserving applications, this research introduces novel noise calibration methods directly targeting desired attack risk levels, bypassing conventio…

Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication

26 September 2024·2999 words·15 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 UC Los Angeles

Atlas3D enhances text-to-3D generation by integrating physics-based simulations, producing self-supporting 3D models for seamless real-world applications.

Asynchronous Perception Machine for Efficient Test Time Training

26 September 2024·5559 words·27 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 University of Central Florida

APM: Asynchronous Perception Machine, a computationally-efficient architecture for test-time training (TTT), processes image patches asynchronously, encoding semantic awareness without pre-training, a…

AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

26 September 2024·2720 words·13 mins· loading · loading

Computer Vision Image Generation 🏢 National University of Singapore

AsyncDiff accelerates diffusion model inference by 2.8x using asynchronous denoising and model parallelism, maintaining near-perfect image quality.

Asymptotics of Alpha-Divergence Variational Inference Algorithms with Exponential Families

26 September 2024·1536 words·8 mins· loading · loading

Machine Learning Optimization 🏢 Telecom Sud-Paris

This paper rigorously analyzes alpha-divergence variational inference, proving its convergence and providing convergence rates, thereby advancing the theoretical foundations of this increasingly impor…

Association Pattern-aware Fusion for Biological Entity Relationship Prediction

26 September 2024·1918 words·10 mins· loading · loading

AI Applications Healthcare 🏢 Zhejiang University

Pattern-BERP, a novel method, boosts biological entity relationship prediction accuracy by 4-23% using association pattern-aware fusion, enhancing interpretability for real-world applications.

Assembly Fuzzy Representation on Hypergraph for Open-Set 3D Object Retrieval

26 September 2024·2034 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

Hypergraph-Based Assembly Fuzzy Representation (HAFR) excels at open-set 3D object retrieval by using part-level shapes and fuzzy representations to overcome challenges posed by unseen object categori…

Ask, Attend, Attack: An Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models

26 September 2024·3219 words·16 mins· loading · loading

AI Generated Natural Language Processing Vision-Language Models 🏢 Xiamen University

This paper introduces AAA, a novel three-stage decision-based black-box targeted attack against image-to-text models. AAA efficiently generates semantically consistent adversarial examples by asking …

AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation

26 September 2024·2478 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 Snap Inc.

AsCAN, a novel hybrid architecture, achieves superior efficiency and performance in image recognition and generation by asymmetrically combining convolutional and transformer blocks.

Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning

26 September 2024·1937 words·10 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Oxford

Reinforcement learning agents achieve emergent cultural accumulation by balancing social and independent learning, outperforming single-lifetime agents.

Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis

26 September 2024·3848 words·19 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 University of Edinburgh

Unsupervised Articulated Object Modeling using Conditional View Synthesis learns pose and part segmentation from only two object observations, achieving significantly better performance than previous …

Artemis: Towards Referential Understanding in Complex Videos

26 September 2024·3373 words·16 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Chinese Academy of Sciences

Artemis: A new MLLM excels at video-based referential understanding, accurately describing targets within complex videos using natural language questions and bounding boxes, surpassing existing models…

ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users

26 September 2024·3873 words·19 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Nanyang Technological University

ART: A novel automatic red-teaming framework reveals safety vulnerabilities in popular text-to-image models by identifying unsafe outputs even from seemingly harmless prompts.

AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields

26 September 2024·3676 words·18 mins· loading · loading

Machine Learning Deep Learning 🏢 Sorbonne Université

AROMA: Attentive Reduced Order Model with Attention enhances PDE modeling with local neural fields, offering efficient processing of diverse geometries and superior performance in simulating 1D and 2D…

ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction

26 September 2024·2152 words·11 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Peking University

ARKVALE boosts LLM inference efficiency by intelligently evicting and recalling key-value pairs from cache, improving latency and throughput without significant accuracy loss.

Are Your Models Still Fair? Fairness Attacks on Graph Neural Networks via Node Injections

26 September 2024·2258 words·11 mins· loading · loading

AI Theory Fairness 🏢 Huazhong University of Science and Technology

Node Injection-based Fairness Attack (NIFA) reveals GNNs’ vulnerability to realistic fairness attacks by injecting a small percentage of nodes, significantly undermining fairness even in fairness-awar…

Are We on the Right Way for Evaluating Large Vision-Language Models?

26 September 2024·2514 words·12 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Science and Technology of China

MMStar benchmark tackles flawed LVLMs evaluation by focusing on vision-critical samples, minimizing data leakage, and introducing new metrics for fair multi-modal gain assessment.

Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?

26 September 2024·2360 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 MIT

Evidential deep learning’s uncertainty quantification is unreliable; this paper reveals its limitations, proposes model uncertainty incorporation for improved performance.

Are Self-Attentions Effective for Time Series Forecasting?

26 September 2024·3575 words·17 mins· loading · loading

Machine Learning Deep Learning 🏢 Seoul National University

Cross-Attention-only Time Series Transformer (CATS) outperforms existing models by removing self-attention, improving long-term forecasting accuracy, and reducing computational cost.