Posters
2024
Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning
·2845 words·14 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
๐ข Harvard University
Eye-gaze data boosts medical image-text alignment!
Extracting Training Data from Molecular Pre-trained Models
·2322 words·11 mins·
loading
·
loading
AI Generated
AI Theory
Privacy
๐ข Zhejiang University
Researchers reveal a high risk of training data extraction from molecular pre-trained models, challenging the assumption that model sharing alone adequately protects against data theft.
Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational Data
·3848 words·19 mins·
loading
·
loading
AI Theory
Causality
๐ข Uppsala University
This paper introduces a novel nonparametric method to make policy evaluations from randomized trials externally valid, even when trial and target populations differ. It leverages additional covariate…
Extending Video Masked Autoencoders to 128 frames
·2466 words·12 mins·
loading
·
loading
Computer Vision
Video Understanding
๐ข Google Research
Long-video masked autoencoders (LVMAE) achieve state-of-the-art performance by using an adaptive masking strategy that prioritizes important video tokens, enabling efficient training on 128 frames.
Extending Multi-modal Contrastive Representations
·2089 words·10 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
๐ข Zhejiang University
Ex-MCR: Efficiently build unified multi-modal representations by extending, not connecting, pre-trained spaces, achieving superior performance with less paired data and training.
Expressive Gaussian Human Avatars from Monocular RGB Video
·1431 words·7 mins·
loading
·
loading
Computer Vision
3D Vision
๐ข University of Texas at Austin
EVA: a novel method generates expressive 3D Gaussian human avatars from monocular RGB videos, excelling in detailed hand and facial expressions via context-aware density control and improved SMPL-X al…
Exponential Quantum Communication Advantage in Distributed Inference and Learning
·2117 words·10 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
๐ข Google Quantum AI
Quantum computing drastically reduces communication needs for distributed machine learning, enabling faster and more private AI.
eXponential FAmily Dynamical Systems (XFADS): Large-scale nonlinear Gaussian state-space modeling
·1596 words·8 mins·
loading
·
loading
Machine Learning
Deep Learning
๐ข Champalimaud Research
XFADS: a novel low-rank structured VAE framework for large-scale nonlinear Gaussian state-space modeling, achieving high predictive accuracy and scalability.
Exploring Token Pruning in Vision State Space Models
·1749 words·9 mins·
loading
·
loading
Computer Vision
Image Classification
๐ข Northeastern University
This paper introduces a novel token pruning method for vision state space models, achieving significant computational reduction with minimal performance impact, addressing the limitations of directly …
Exploring the trade-off between deep-learning and explainable models for brain-machine interfaces
·2641 words·13 mins·
loading
·
loading
AI Generated
AI Applications
Healthcare
๐ข University of Michigan
KalmanNet, a novel BMI decoder, achieves state-of-the-art performance by integrating recurrent neural networks into Kalman filtering, balancing accuracy and explainability.
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models
·2598 words·13 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
๐ข SenseTime Research
LLM-Infused Diffuser boosts text-to-image generation by smartly integrating LLMs, surpassing existing models in prompt understanding and image quality.
Exploring the Precise Dynamics of Single-Layer GAN Models: Leveraging Multi-Feature Discriminators for High-Dimensional Subspace Learning
·1590 words·8 mins·
loading
·
loading
Machine Learning
Representation Learning
๐ข Koรง University
Single-layer GANs learn data subspaces more effectively using multi-feature discriminators, enabling faster training and better feature representation than conventional methods.
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
·3530 words·17 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
๐ข Rutgers University
CE2: A new goal-directed exploration algorithm for efficient reinforcement learning in unknown environments, prioritizing accessible frontier goals via latent state clustering.
Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation
·2718 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Image Segmentation
๐ข Beihang University
DUSA:Unlocking Diffusion Models’ Discriminative Power for Efficient Test-Time Adaptation
Exploring Molecular Pretraining Model at Scale
·2151 words·11 mins·
loading
·
loading
AI Generated
Machine Learning
Self-Supervised Learning
๐ข Peking University
Uni-Mol2, a groundbreaking 1.1B parameter molecular pretraining model, reveals power-law scaling in molecular representation learning, achieving significant performance improvements on downstream task…
Exploring Low-Dimensional Subspace in Diffusion Models for Controllable Image Editing
·2111 words·10 mins·
loading
·
loading
Computer Vision
Image Generation
๐ข University of Michigan
LOCO Edit achieves precise, localized image editing in diffusion models via a single-step, training-free method leveraging low-dimensional semantic subspaces.
Exploring Fixed Point in Image Editing: Theoretical Support and Convergence Optimization
·2322 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
๐ข East China Normal University
This paper theoretically proves the existence and uniqueness of fixed points in DDIM inversion, optimizing the loss function for improved image editing and extending this approach to unsupervised imag…
Exploring DCN-like architecture for fast image generation with arbitrary resolution
·1909 words·9 mins·
loading
·
loading
Computer Vision
Image Generation
๐ข Nanjing University
FlowDCN: A purely convolutional generative model achieves state-of-the-art image generation speed and quality at arbitrary resolutions, surpassing transformer-based models.
Exploring Consistency in Graph Representations: from Graph Kernels to Graph Neural Networks
·2605 words·13 mins·
loading
·
loading
AI Generated
Machine Learning
Representation Learning
๐ข Dartmouth College
Boost GNN graph classification accuracy by enforcing consistency in learned representations across layers using a novel loss function!
Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion Models
·3176 words·15 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
๐ข Georgia Institute of Technology
BeNeDiff uses generative diffusion models to disentangle and interpret neural dynamics linked to specific behaviors, providing interpretable quantifications of behavior in multi-brain region datasets.