Skip to main content

Posters

2024

MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps
·2981 words·14 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 National University of Singapore
MVSDet uses efficient plane sweeps for accurate indoor 3D object detection from multiple images, significantly outperforming previous NeRF-based methods.
MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
·3629 words·18 mins· loading · loading
Computer Vision 3D Vision 🏢 Fudan University
MVInpainter: Pose-free multi-view consistent inpainting bridges 2D and 3D editing by simplifying 3D editing to a multi-view 2D inpainting task.
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
·2497 words·12 mins· loading · loading
Computer Vision 3D Vision 🏢 Nanyang Technological University
MVGamba: A unified, feed-forward 3D content generation model achieving state-of-the-art quality and speed using an RNN-like state space model for efficient multi-view Gaussian reconstruction.
MV2Cyl: Reconstructing 3D Extrusion Cylinders from Multi-View Images
·3293 words·16 mins· loading · loading
Computer Vision 3D Vision 🏢 Korea Advanced Institute of Science and Technology
MV2Cyl: A novel method reconstructs 3D extrusion cylinder CAD models directly from multi-view images, surpassing accuracy of methods using raw 3D geometry.
MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encoding
·2545 words·12 mins· loading · loading
Natural Language Processing Information Retrieval 🏢 Google Research
MUVERA: Revolutionizing multi-vector retrieval with single-vector speed and accuracy!
Mutual Information Estimation via Normalizing Flows
·2080 words·10 mins· loading · loading
AI Theory Representation Learning 🏢 Skoltech
Researchers introduce a novel approach to mutual information (MI) estimation using normalizing flows, providing accurate estimates even in high dimensions.
Mutual Information Estimation via $f$-Divergence and Data Derangements
·3089 words·15 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Klagenfurt
f-DIME: a novel class of discriminative mutual information estimators using f-divergence outperforms state-of-the-art methods by achieving an excellent bias-variance trade-off. This is achieved throug…
Mutli-Armed Bandits with Network Interference
·1421 words·7 mins· loading · loading
AI Theory Causality 🏢 UC Berkeley
New algorithms conquer regret in multi-armed bandits challenged by network interference, achieving provably low regret with both known and unknown network structures.
MutaPLM: Protein Language Modeling for Mutation Explanation and Engineering
·2665 words·13 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Tsinghua University
MutaPLM: a novel protein language model, provides human-understandable mutation explanations and designs novel mutations with desirable properties using a unique protein delta network and chain-of-tho…
Multiview Scene Graph
·2365 words·12 mins· loading · loading
Computer Vision Scene Understanding 🏢 New York University
AI models struggle to understand 3D space like humans do. This paper introduces Multiview Scene Graphs (MSGs) – a new topological scene representation using interconnected place and object nodes buil…
Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking
·1726 words·9 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Cornell University
This paper introduces an efficient multivariate stochastic dominance test using optimal transport, enabling robust model benchmarking by considering metric dependencies.
Multivariate Probabilistic Time Series Forecasting with Correlated Errors
·6695 words·32 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 McGill University
Boost multivariate time series forecasting accuracy by efficiently learning the complex correlation structure of prediction errors, enhancing reliability without expanding model size.
Multistep Distillation of Diffusion Models via Moment Matching
·2156 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 Google DeepMind
New method distills slow diffusion models into fast, few-step models by matching data expectations, achieving state-of-the-art results on ImageNet.
MultiPull: Detailing Signed Distance Functions by Pulling Multi-Level Queries at Multi-Step
·3626 words·18 mins· loading · loading
Computer Vision 3D Vision 🏢 Tsinghua University
MultiPull: a novel method reconstructing detailed 3D surfaces from raw point clouds using multi-step optimization of multi-level features, significantly improving accuracy and detail.
Multiple Physics Pretraining for Spatiotemporal Surrogate Models
·3133 words·15 mins· loading · loading
Machine Learning Self-Supervised Learning 🏢 Flatiron Institute
Multiple Physics Pretraining (MPP) revolutionizes spatiotemporal physical surrogate modeling by pretraining transformers on diverse physics simultaneously, enabling accurate predictions on unseen syst…
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
·2386 words·12 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 IBM Research
Large Multimodal Models (LMMs) are limited by their context length during many-shot in-context learning. This paper introduces Multimodal Task Vectors (MTV), a method to compress numerous in-context …
Multimodal Large Language Models Make Text-to-Image Generative Models Align Better
·4263 words·21 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Microsoft Research
AI-generated preference data improves text-to-image alignment.
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
·4050 words·20 mins· loading · loading
AI Theory Interpretability 🏢 Queen Mary University of London
Multilinear Mixture of Experts (μMoE) achieves scalable expert specialization in deep neural networks through tensor factorization, enabling efficient fine-tuning and interpretable model editing.
Multidimensional Fractional Programming for Normalized Cuts
·1661 words·8 mins· loading · loading
Machine Learning Unsupervised Learning 🏢 School of Science and Engineering, the Chinese University of Hong Kong (Shenzhen)
Multidimensional Fractional Programming (MFP) efficiently solves the challenging Normalized Cut (NCut) problem for multi-class clustering, outperforming existing methods.
Multi-Winner Reconfiguration
·1937 words·10 mins· loading · loading
AI Theory Optimization 🏢 TU Wien
This paper introduces a novel model for multi-winner reconfiguration, analyzing the computational complexity of transitioning between committees using four approval-based voting rules, demonstrating b…