Posters

A Comprehensive Analysis on the Learning Curve in Kernel Ridge Regression

26 September 2024·2395 words·12 mins· loading · loading

AI Theory Generalization 🏢 University of Basel

This study provides a unified theory for kernel ridge regression’s learning curve, improving existing bounds and validating the Gaussian Equivalence Property under minimal assumptions.

A Compositional Atlas for Algebraic Circuits

26 September 2024·1573 words·8 mins· loading · loading

AI Theory Causality 🏢 UC Los Angeles

This paper introduces a compositional framework for algebraic circuits, deriving novel tractability conditions for compositional inference queries and unifying existing results.

A Combinatorial Algorithm for the Semi-Discrete Optimal Transport Problem

26 September 2024·1938 words·10 mins· loading · loading

AI Theory Optimization 🏢 Duke University

A new combinatorial algorithm dramatically speeds up semi-discrete optimal transport calculations, offering an efficient solution for large datasets and higher dimensions.

A Closer Look at the CLS Token for Cross-Domain Few-Shot Learning

26 September 2024·3699 words·18 mins· loading · loading

AI Generated Computer Vision Few-Shot Learning 🏢 Huazhong University of Science and Technology

Leaving the CLS token of a Vision Transformer randomly initialized during cross-domain few-shot learning consistently improves performance; a novel method leveraging this phenomenon achieves state-of-…

A Closer Look at AUROC and AUPRC under Class Imbalance

26 September 2024·2353 words·12 mins· loading · loading

AI Theory Fairness 🏢 Harvard University

Debunking a common myth, this paper proves that AUPRC is not superior to AUROC for imbalanced datasets, and in fact, can worsen algorithmic bias.

A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization

26 September 2024·3344 words·16 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 National Yang Ming Chiao Tung University

Researchers unveil how causal text encoding in text-to-image models leads to information loss and bias, proposing a novel training-free optimization method that significantly improves information bala…

A Canonicalization Perspective on Invariant and Equivariant Learning

26 September 2024·2927 words·14 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Peking University

Canonicalization simplifies invariant and equivariant learning by connecting frames to canonical forms, leading to novel, superior frame designs for eigenvector symmetries.

A Boosting-Type Convergence Result for AdaBoost.MH with Factorized Multi-Class Classifiers

26 September 2024·358 words·2 mins· loading · loading

AI Generated AI Theory Optimization 🏢 Wuhan University

Solved a long-standing open problem: Factorized ADABOOST.MH now has a proven convergence rate!

A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays

26 September 2024·484 words·3 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Churney ApS

New best-of-both-worlds bandit algorithm tolerates arbitrary excessive delays, overcoming limitations of prior work that required prior knowledge of maximal delay and suffered linear regret dependence…

A Bayesian Approach to Data Point Selection

26 September 2024·3079 words·15 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Microsoft Research

BADS: a novel Bayesian approach to data point selection efficiently optimizes deep learning models by jointly inferring instance weights and model parameters using stochastic gradient Langevin dynamic…

A Bayesian Approach for Personalized Federated Learning in Heterogeneous Settings

26 September 2024·1730 words·9 mins· loading · loading

Machine Learning Federated Learning 🏢 University of Texas at Austin

FedBNN: a novel Bayesian framework for personalized federated learning, achieves superior performance in heterogeneous settings while ensuring strict privacy via differential privacy.

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models

26 September 2024·1721 words·9 mins· loading · loading

Computer Vision Video Understanding 🏢 Snap Inc.

4Real: Photorealistic 4D scene generation from text prompts using video diffusion models, exceeding object-centric approaches for higher realism and efficiency.

4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities

26 September 2024·5846 words·28 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 Swiss Federal Institute of Technology Lausanne (EPFL)

4M-21 achieves any-to-any predictions across 21 diverse vision modalities using a single model, exceeding prior state-of-the-art performance.

4Diffusion: Multi-view Video Diffusion Model for 4D Generation

26 September 2024·2302 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Beihang University

4Diffusion generates high-quality, temporally consistent 4D content from monocular videos using a unified multi-view diffusion model and novel loss functions.

4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization

26 September 2024·1909 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Seoul National University

Uncertainty-aware 4D Gaussian Splatting enhances dynamic scene reconstruction from monocular videos by selectively applying regularization to uncertain regions, improving both novel view synthesis and…

4-bit Shampoo for Memory-Efficient Network Training

26 September 2024·3782 words·18 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Beijing Normal University

4-bit Shampoo achieves comparable performance to its 32-bit counterpart while drastically reducing memory usage, enabling efficient training of significantly larger neural networks.

3DET-Mamba: Causal Sequence Modelling for End-to-End 3D Object Detection

26 September 2024·1690 words·8 mins· loading · loading

Computer Vision 3D Vision 🏢 Fudan University

3DET-Mamba: A novel end-to-end 3D object detector leveraging the Mamba state space model for efficient and accurate object detection in complex indoor scenes, outperforming previous 3DETR models.

3D Structure Prediction of Atomic Systems with Flow-based Direct Preference Optimization

26 September 2024·2483 words·12 mins· loading · loading

AI Generated AI Applications Healthcare 🏢 Tsinghua University

FlowDPO: Revolutionizing 3D structure prediction with flexible probability paths & Direct Preference Optimization for enhanced accuracy and reduced hallucinations.

3D Gaussian Rendering Can Be Sparser: Efficient Rendering via Learned Fragment Pruning

26 September 2024·1720 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Georgia Institute of Technology

Learned fragment pruning accelerates 3D Gaussian splatting rendering by selectively removing fragments, achieving up to 1.71x speedup on edge GPUs and 0.16 PSNR improvement.

3D Focusing-and-Matching Network for Multi-Instance Point Cloud Registration

26 September 2024·1762 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Northwestern Polytechnical University

3DFMNet: A novel two-stage network for multi-instance point cloud registration, achieving state-of-the-art accuracy by focusing on object centers first and then performing pairwise registration.