Posters

A Label is Worth A Thousand Images in Dataset Distillation

26 September 2024·2824 words·14 mins· loading · loading

Computer Vision Image Classification 🏢 Harvard University

Soft labels, not sophisticated data synthesis, are the key to successful dataset distillation, significantly improving data-efficient learning and challenging existing methods.

A Kernel Perspective on Distillation-based Collaborative Learning

26 September 2024·2168 words·11 mins· loading · loading

Machine Learning Federated Learning 🏢 Korea Advanced Institute of Science and Technology

This paper introduces DCL-KR and DCL-NN, novel distillation-based collaborative learning algorithms achieving nearly minimax optimal convergence rates in heterogeneous environments without direct data…

A Huber Loss Minimization Approach to Mean Estimation under User-level Differential Privacy

26 September 2024·334 words·2 mins· loading · loading

AI Generated AI Theory Privacy 🏢 Zhejiang Lab

Huber loss minimization ensures accurate and robust mean estimation under user-level differential privacy, especially for imbalanced datasets and heavy-tailed distributions.

A hierarchical decomposition for explaining ML performance discrepancies

26 September 2024·1953 words·10 mins· loading · loading

AI Applications Healthcare 🏢 UC San Francisco

New nonparametric framework explains ML performance gaps across domains by hierarchically decomposing discrepancies due to covariate and conditional outcome shifts, offering detailed variable-level at…

A Gradient Accumulation Method for Dense Retriever under Memory Constraint

26 September 2024·1813 words·9 mins· loading · loading

Natural Language Processing Question Answering 🏢 Seoul National University

CONTACCUM: Stable, efficient memory reduction for dense retrievers using dual memory banks, surpassing high-resource baselines.

A Globally Optimal Portfolio for m-Sparse Sharpe Ratio Maximization

26 September 2024·1692 words·8 mins· loading · loading

AI Applications Finance 🏢 Department of Mathematics

This paper introduces mSSRM-PGA, achieving globally optimal m-sparse Sharpe ratios, addressing the nonconvexity issue in portfolio optimization through a novel proximal gradient algorithm.

A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding

26 September 2024·1812 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Zhejiang University

Depth-range-free MVS network using pose embedding achieves robust and accurate 3D reconstruction.

A Generative Model of Symmetry Transformations

26 September 2024·3610 words·17 mins· loading · loading

Machine Learning Generative Learning 🏢 University of Cambridge

Generative model learns data symmetries for improved efficiency and higher test log-likelihoods.

A General Protocol to Probe Large Vision Models for 3D Physical Understanding

26 September 2024·4012 words·19 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 University of Oxford

Researchers developed a lightweight protocol to probe large vision models’ 3D physical understanding by training classifiers on model features for various scene properties (geometry, material, lightin…

A Functional Extension of Semi-Structured Networks

26 September 2024·2279 words·11 mins· loading · loading

Machine Learning Deep Learning 🏢 Munich Center for Machine Learning (MCML)

This paper introduces semi-structured functional networks (SSFNNs), a novel approach that combines interpretable functional regression models with deep neural networks, achieving both high accuracy an…

A Full-duplex Speech Dialogue Scheme Based On Large Language Model

26 September 2024·2100 words·10 mins· loading · loading

Natural Language Processing Dialogue Systems 🏢 MThreads AI

This paper introduces a novel full-duplex speech dialogue system based on LLMs, achieving significantly reduced response latency and higher interruption precision compared to half-duplex systems.

A Framework for Bilevel Optimization on Riemannian Manifolds

26 September 2024·1520 words·8 mins· loading · loading

Machine Learning Meta Learning 🏢 RIKEN AIP

This paper introduces a novel framework for bilevel optimization on Riemannian manifolds, providing efficient hypergradient estimation strategies and convergence analysis, with successful applications…

A Foundation Model for Zero-shot Logical Query Reasoning

26 September 2024·2687 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 Intel AI Lab

ULTRAQUERY: a groundbreaking foundation model for zero-shot logical query reasoning on any knowledge graph, surpassing existing methods’ limitations.

A Flexible, Equivariant Framework for Subgraph GNNs via Graph Products and Graph Coarsening

26 September 2024·3886 words·19 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Technion - Israel Institute of Technology

Flexible Subgraph GNNs, achieving scalability via graph products and coarsening, consistently outperform baselines and adapt to varying subgraph numbers.

A Fast Convoluted Story: Scaling Probabilistic Inference for Integer Arithmetics

26 September 2024·2476 words·12 mins· loading · loading

AI Generated AI Theory Optimization 🏢 KU Leuven

Revolutionizing probabilistic inference, PLIA₁ uses tensor operations and FFT to scale integer arithmetic, achieving orders-of-magnitude speedup in inference and learning times.

A distributional simplicity bias in the learning dynamics of transformers

26 September 2024·2474 words·12 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 International School for Advanced Studies

Transformers learn increasingly complex language patterns sequentially, starting with simpler interactions before mastering higher-order ones.

A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health

26 September 2024·2747 words·13 mins· loading · loading

AI Generated AI Applications Healthcare 🏢 MIT

LLMs dynamically adjust restless multi-armed bandit (RMAB) resource allocation policies in public health via human-language commands.

A Critical Evaluation of AI Feedback for Aligning Large Language Models

26 September 2024·2724 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Stanford University

Contrary to popular belief, simple supervised fine-tuning with strong language models outperforms complex reinforcement learning in aligning large language models, significantly improving efficiency.

A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration

26 September 2024·2500 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Zhejiang University

CAST: a novel consistency-aware spot-guided Transformer achieves state-of-the-art accuracy and efficiency in point cloud registration.

A Concept-Based Explainability Framework for Large Multimodal Models

26 September 2024·7122 words·34 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 Sorbonne Université

CoX-LMM unveils a novel concept-based explainability framework for large multimodal models, extracting semantically grounded multimodal concepts to enhance interpretability.