Skip to main content

Posters

2024

A Label is Worth A Thousand Images in Dataset Distillation
·2824 words·14 mins· loading · loading
Computer Vision Image Classification 🏒 Harvard University
Soft labels, not sophisticated data synthesis, are the key to successful dataset distillation, significantly improving data-efficient learning and challenging existing methods.
A Kernel Perspective on Distillation-based Collaborative Learning
·2168 words·11 mins· loading · loading
Machine Learning Federated Learning 🏒 Korea Advanced Institute of Science and Technology
This paper introduces DCL-KR and DCL-NN, novel distillation-based collaborative learning algorithms achieving nearly minimax optimal convergence rates in heterogeneous environments without direct data…
A Huber Loss Minimization Approach to Mean Estimation under User-level Differential Privacy
·334 words·2 mins· loading · loading
AI Generated AI Theory Privacy 🏒 Zhejiang Lab
Huber loss minimization ensures accurate and robust mean estimation under user-level differential privacy, especially for imbalanced datasets and heavy-tailed distributions.
A hierarchical decomposition for explaining ML performance discrepancies
·1953 words·10 mins· loading · loading
AI Applications Healthcare 🏒 UC San Francisco
New nonparametric framework explains ML performance gaps across domains by hierarchically decomposing discrepancies due to covariate and conditional outcome shifts, offering detailed variable-level at…
A Gradient Accumulation Method for Dense Retriever under Memory Constraint
·1813 words·9 mins· loading · loading
Natural Language Processing Question Answering 🏒 Seoul National University
CONTACCUM: Stable, efficient memory reduction for dense retrievers using dual memory banks, surpassing high-resource baselines.
A Globally Optimal Portfolio for m-Sparse Sharpe Ratio Maximization
·1692 words·8 mins· loading · loading
AI Applications Finance 🏒 Department of Mathematics
This paper introduces mSSRM-PGA, achieving globally optimal m-sparse Sharpe ratios, addressing the nonconvexity issue in portfolio optimization through a novel proximal gradient algorithm.
A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
·1812 words·9 mins· loading · loading
Computer Vision 3D Vision 🏒 Zhejiang University
Depth-range-free MVS network using pose embedding achieves robust and accurate 3D reconstruction.
A Generative Model of Symmetry Transformations
·3610 words·17 mins· loading · loading
Machine Learning Generative Learning 🏒 University of Cambridge
Generative model learns data symmetries for improved efficiency and higher test log-likelihoods.
A General Protocol to Probe Large Vision Models for 3D Physical Understanding
·4012 words·19 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 University of Oxford
Researchers developed a lightweight protocol to probe large vision models’ 3D physical understanding by training classifiers on model features for various scene properties (geometry, material, lightin…
A Functional Extension of Semi-Structured Networks
·2279 words·11 mins· loading · loading
Machine Learning Deep Learning 🏒 Munich Center for Machine Learning (MCML)
This paper introduces semi-structured functional networks (SSFNNs), a novel approach that combines interpretable functional regression models with deep neural networks, achieving both high accuracy an…
A Full-duplex Speech Dialogue Scheme Based On Large Language Model
·2100 words·10 mins· loading · loading
Natural Language Processing Dialogue Systems 🏒 MThreads AI
This paper introduces a novel full-duplex speech dialogue system based on LLMs, achieving significantly reduced response latency and higher interruption precision compared to half-duplex systems.
A Framework for Bilevel Optimization on Riemannian Manifolds
·1520 words·8 mins· loading · loading
Machine Learning Meta Learning 🏒 RIKEN AIP
This paper introduces a novel framework for bilevel optimization on Riemannian manifolds, providing efficient hypergradient estimation strategies and convergence analysis, with successful applications…
A Foundation Model for Zero-shot Logical Query Reasoning
·2687 words·13 mins· loading · loading
Machine Learning Deep Learning 🏒 Intel AI Lab
ULTRAQUERY: a groundbreaking foundation model for zero-shot logical query reasoning on any knowledge graph, surpassing existing methods’ limitations.
A Flexible, Equivariant Framework for Subgraph GNNs via Graph Products and Graph Coarsening
·3886 words·19 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏒 Technion - Israel Institute of Technology
Flexible Subgraph GNNs, achieving scalability via graph products and coarsening, consistently outperform baselines and adapt to varying subgraph numbers.
A Fast Convoluted Story: Scaling Probabilistic Inference for Integer Arithmetics
·2476 words·12 mins· loading · loading
AI Generated AI Theory Optimization 🏒 KU Leuven
Revolutionizing probabilistic inference, PLIA₁ uses tensor operations and FFT to scale integer arithmetic, achieving orders-of-magnitude speedup in inference and learning times.
A distributional simplicity bias in the learning dynamics of transformers
·2474 words·12 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏒 International School for Advanced Studies
Transformers learn increasingly complex language patterns sequentially, starting with simpler interactions before mastering higher-order ones.
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
·2747 words·13 mins· loading · loading
AI Generated AI Applications Healthcare 🏒 MIT
LLMs dynamically adjust restless multi-armed bandit (RMAB) resource allocation policies in public health via human-language commands.
A Critical Evaluation of AI Feedback for Aligning Large Language Models
·2724 words·13 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏒 Stanford University
Contrary to popular belief, simple supervised fine-tuning with strong language models outperforms complex reinforcement learning in aligning large language models, significantly improving efficiency.
A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration
·2500 words·12 mins· loading · loading
Computer Vision 3D Vision 🏒 Zhejiang University
CAST: a novel consistency-aware spot-guided Transformer achieves state-of-the-art accuracy and efficiency in point cloud registration.
A Concept-Based Explainability Framework for Large Multimodal Models
·7122 words·34 mins· loading · loading
AI Generated Multimodal Learning Vision-Language Models 🏒 Sorbonne Université
CoX-LMM unveils a novel concept-based explainability framework for large multimodal models, extracting semantically grounded multimodal concepts to enhance interpretability.