Skip to main content

Posters

2024

A Comprehensive Analysis on the Learning Curve in Kernel Ridge Regression
·2395 words·12 mins· loading · loading
AI Theory Generalization 🏢 University of Basel
This study provides a unified theory for kernel ridge regression’s learning curve, improving existing bounds and validating the Gaussian Equivalence Property under minimal assumptions.
A Compositional Atlas for Algebraic Circuits
·1573 words·8 mins· loading · loading
AI Theory Causality 🏢 UC Los Angeles
This paper introduces a compositional framework for algebraic circuits, deriving novel tractability conditions for compositional inference queries and unifying existing results.
A Combinatorial Algorithm for the Semi-Discrete Optimal Transport Problem
·1938 words·10 mins· loading · loading
AI Theory Optimization 🏢 Duke University
A new combinatorial algorithm dramatically speeds up semi-discrete optimal transport calculations, offering an efficient solution for large datasets and higher dimensions.
A Closer Look at the CLS Token for Cross-Domain Few-Shot Learning
·3699 words·18 mins· loading · loading
AI Generated Computer Vision Few-Shot Learning 🏢 Huazhong University of Science and Technology
Leaving the CLS token of a Vision Transformer randomly initialized during cross-domain few-shot learning consistently improves performance; a novel method leveraging this phenomenon achieves state-of-…
A Closer Look at AUROC and AUPRC under Class Imbalance
·2353 words·12 mins· loading · loading
AI Theory Fairness 🏢 Harvard University
Debunking a common myth, this paper proves that AUPRC is not superior to AUROC for imbalanced datasets, and in fact, can worsen algorithmic bias.
A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization
·3344 words·16 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 National Yang Ming Chiao Tung University
Researchers unveil how causal text encoding in text-to-image models leads to information loss and bias, proposing a novel training-free optimization method that significantly improves information bala…
A Canonicalization Perspective on Invariant and Equivariant Learning
·2927 words·14 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Peking University
Canonicalization simplifies invariant and equivariant learning by connecting frames to canonical forms, leading to novel, superior frame designs for eigenvector symmetries.
A Boosting-Type Convergence Result for AdaBoost.MH with Factorized Multi-Class Classifiers
·358 words·2 mins· loading · loading
AI Generated AI Theory Optimization 🏢 Wuhan University
Solved a long-standing open problem: Factorized ADABOOST.MH now has a proven convergence rate!
A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays
·484 words·3 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Churney ApS
New best-of-both-worlds bandit algorithm tolerates arbitrary excessive delays, overcoming limitations of prior work that required prior knowledge of maximal delay and suffered linear regret dependence…
A Bayesian Approach to Data Point Selection
·3079 words·15 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Microsoft Research
BADS: a novel Bayesian approach to data point selection efficiently optimizes deep learning models by jointly inferring instance weights and model parameters using stochastic gradient Langevin dynamic…
A Bayesian Approach for Personalized Federated Learning in Heterogeneous Settings
·1730 words·9 mins· loading · loading
Machine Learning Federated Learning 🏢 University of Texas at Austin
FedBNN: a novel Bayesian framework for personalized federated learning, achieves superior performance in heterogeneous settings while ensuring strict privacy via differential privacy.
4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
·1721 words·9 mins· loading · loading
Computer Vision Video Understanding 🏢 Snap Inc.
4Real: Photorealistic 4D scene generation from text prompts using video diffusion models, exceeding object-centric approaches for higher realism and efficiency.
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
·5846 words·28 mins· loading · loading
AI Generated Multimodal Learning Vision-Language Models 🏢 Swiss Federal Institute of Technology Lausanne (EPFL)
4M-21 achieves any-to-any predictions across 21 diverse vision modalities using a single model, exceeding prior state-of-the-art performance.
4Diffusion: Multi-view Video Diffusion Model for 4D Generation
·2302 words·11 mins· loading · loading
Computer Vision 3D Vision 🏢 Beihang University
4Diffusion generates high-quality, temporally consistent 4D content from monocular videos using a unified multi-view diffusion model and novel loss functions.
4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization
·1909 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 Seoul National University
Uncertainty-aware 4D Gaussian Splatting enhances dynamic scene reconstruction from monocular videos by selectively applying regularization to uncertain regions, improving both novel view synthesis and…
4-bit Shampoo for Memory-Efficient Network Training
·3782 words·18 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Beijing Normal University
4-bit Shampoo achieves comparable performance to its 32-bit counterpart while drastically reducing memory usage, enabling efficient training of significantly larger neural networks.
3DET-Mamba: Causal Sequence Modelling for End-to-End 3D Object Detection
·1690 words·8 mins· loading · loading
Computer Vision 3D Vision 🏢 Fudan University
3DET-Mamba: A novel end-to-end 3D object detector leveraging the Mamba state space model for efficient and accurate object detection in complex indoor scenes, outperforming previous 3DETR models.
3D Structure Prediction of Atomic Systems with Flow-based Direct Preference Optimization
·2483 words·12 mins· loading · loading
AI Generated AI Applications Healthcare 🏢 Tsinghua University
FlowDPO: Revolutionizing 3D structure prediction with flexible probability paths & Direct Preference Optimization for enhanced accuracy and reduced hallucinations.
3D Gaussian Rendering Can Be Sparser: Efficient Rendering via Learned Fragment Pruning
·1720 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 Georgia Institute of Technology
Learned fragment pruning accelerates 3D Gaussian splatting rendering by selectively removing fragments, achieving up to 1.71x speedup on edge GPUs and 0.16 PSNR improvement.
3D Focusing-and-Matching Network for Multi-Instance Point Cloud Registration
·1762 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 Northwestern Polytechnical University
3DFMNet: A novel two-stage network for multi-instance point cloud registration, achieving state-of-the-art accuracy by focusing on object centers first and then performing pairwise registration.