Posters

Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning

26 September 2024·2845 words·14 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 Harvard University

Eye-gaze data boosts medical image-text alignment!

Extracting Training Data from Molecular Pre-trained Models

26 September 2024·2322 words·11 mins· loading · loading

AI Generated AI Theory Privacy 🏢 Zhejiang University

Researchers reveal a high risk of training data extraction from molecular pre-trained models, challenging the assumption that model sharing alone adequately protects against data theft.

Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational Data

26 September 2024·3848 words·19 mins· loading · loading

AI Theory Causality 🏢 Uppsala University

This paper introduces a novel nonparametric method to make policy evaluations from randomized trials externally valid, even when trial and target populations differ. It leverages additional covariate…

Extending Video Masked Autoencoders to 128 frames

26 September 2024·2466 words·12 mins· loading · loading

Computer Vision Video Understanding 🏢 Google Research

Long-video masked autoencoders (LVMAE) achieve state-of-the-art performance by using an adaptive masking strategy that prioritizes important video tokens, enabling efficient training on 128 frames.

Extending Multi-modal Contrastive Representations

26 September 2024·2089 words·10 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Zhejiang University

Ex-MCR: Efficiently build unified multi-modal representations by extending, not connecting, pre-trained spaces, achieving superior performance with less paired data and training.

Expressive Gaussian Human Avatars from Monocular RGB Video

26 September 2024·1431 words·7 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Texas at Austin

EVA: a novel method generates expressive 3D Gaussian human avatars from monocular RGB videos, excelling in detailed hand and facial expressions via context-aware density control and improved SMPL-X al…

Exponential Quantum Communication Advantage in Distributed Inference and Learning

26 September 2024·2117 words·10 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Google Quantum AI

Quantum computing drastically reduces communication needs for distributed machine learning, enabling faster and more private AI.

eXponential FAmily Dynamical Systems (XFADS): Large-scale nonlinear Gaussian state-space modeling

26 September 2024·1596 words·8 mins· loading · loading

Machine Learning Deep Learning 🏢 Champalimaud Research

XFADS: a novel low-rank structured VAE framework for large-scale nonlinear Gaussian state-space modeling, achieving high predictive accuracy and scalability.

Exploring Token Pruning in Vision State Space Models

26 September 2024·1749 words·9 mins· loading · loading

Computer Vision Image Classification 🏢 Northeastern University

This paper introduces a novel token pruning method for vision state space models, achieving significant computational reduction with minimal performance impact, addressing the limitations of directly …

Exploring the trade-off between deep-learning and explainable models for brain-machine interfaces

26 September 2024·2641 words·13 mins· loading · loading

AI Generated AI Applications Healthcare 🏢 University of Michigan

KalmanNet, a novel BMI decoder, achieves state-of-the-art performance by integrating recurrent neural networks into Kalman filtering, balancing accuracy and explainability.

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models

26 September 2024·2598 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 SenseTime Research

LLM-Infused Diffuser boosts text-to-image generation by smartly integrating LLMs, surpassing existing models in prompt understanding and image quality.

Exploring the Precise Dynamics of Single-Layer GAN Models: Leveraging Multi-Feature Discriminators for High-Dimensional Subspace Learning

26 September 2024·1590 words·8 mins· loading · loading

Machine Learning Representation Learning 🏢 Koç University

Single-layer GANs learn data subspaces more effectively using multi-feature discriminators, enabling faster training and better feature representation than conventional methods.

Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning

26 September 2024·3530 words·17 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Rutgers University

CE2: A new goal-directed exploration algorithm for efficient reinforcement learning in unknown environments, prioritizing accessible frontier goals via latent state clustering.

Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation

26 September 2024·2718 words·13 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 Beihang University

DUSA:Unlocking Diffusion Models’ Discriminative Power for Efficient Test-Time Adaptation

Exploring Molecular Pretraining Model at Scale

26 September 2024·2151 words·11 mins· loading · loading

AI Generated Machine Learning Self-Supervised Learning 🏢 Peking University

Uni-Mol2, a groundbreaking 1.1B parameter molecular pretraining model, reveals power-law scaling in molecular representation learning, achieving significant performance improvements on downstream task…

Exploring Low-Dimensional Subspace in Diffusion Models for Controllable Image Editing

26 September 2024·2111 words·10 mins· loading · loading

Computer Vision Image Generation 🏢 University of Michigan

LOCO Edit achieves precise, localized image editing in diffusion models via a single-step, training-free method leveraging low-dimensional semantic subspaces.

Exploring Fixed Point in Image Editing: Theoretical Support and Convergence Optimization

26 September 2024·2322 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 East China Normal University

This paper theoretically proves the existence and uniqueness of fixed points in DDIM inversion, optimizing the loss function for improved image editing and extending this approach to unsupervised imag…

Exploring DCN-like architecture for fast image generation with arbitrary resolution

26 September 2024·1909 words·9 mins· loading · loading

Computer Vision Image Generation 🏢 Nanjing University

FlowDCN: A purely convolutional generative model achieves state-of-the-art image generation speed and quality at arbitrary resolutions, surpassing transformer-based models.

Exploring Consistency in Graph Representations: from Graph Kernels to Graph Neural Networks

26 September 2024·2605 words·13 mins· loading · loading

AI Generated Machine Learning Representation Learning 🏢 Dartmouth College

Boost GNN graph classification accuracy by enforcing consistency in learned representations across layers using a novel loss function!

Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion Models

26 September 2024·3176 words·15 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Georgia Institute of Technology

BeNeDiff uses generative diffusion models to disentangle and interpret neural dynamics linked to specific behaviors, providing interpretable quantifications of behavior in multi-brain region datasets.