Posters
2024
A Label is Worth A Thousand Images in Dataset Distillation
·2824 words·14 mins·
loading
·
loading
Computer Vision
Image Classification
π’ Harvard University
Soft labels, not sophisticated data synthesis, are the key to successful dataset distillation, significantly improving data-efficient learning and challenging existing methods.
A Kernel Perspective on Distillation-based Collaborative Learning
·2168 words·11 mins·
loading
·
loading
Machine Learning
Federated Learning
π’ Korea Advanced Institute of Science and Technology
This paper introduces DCL-KR and DCL-NN, novel distillation-based collaborative learning algorithms achieving nearly minimax optimal convergence rates in heterogeneous environments without direct data…
A Huber Loss Minimization Approach to Mean Estimation under User-level Differential Privacy
·334 words·2 mins·
loading
·
loading
AI Generated
AI Theory
Privacy
π’ Zhejiang Lab
Huber loss minimization ensures accurate and robust mean estimation under user-level differential privacy, especially for imbalanced datasets and heavy-tailed distributions.
A hierarchical decomposition for explaining ML performance discrepancies
·1953 words·10 mins·
loading
·
loading
AI Applications
Healthcare
π’ UC San Francisco
New nonparametric framework explains ML performance gaps across domains by hierarchically decomposing discrepancies due to covariate and conditional outcome shifts, offering detailed variable-level at…
A Gradient Accumulation Method for Dense Retriever under Memory Constraint
·1813 words·9 mins·
loading
·
loading
Natural Language Processing
Question Answering
π’ Seoul National University
CONTACCUM: Stable, efficient memory reduction for dense retrievers using dual memory banks, surpassing high-resource baselines.
A Globally Optimal Portfolio for m-Sparse Sharpe Ratio Maximization
·1692 words·8 mins·
loading
·
loading
AI Applications
Finance
π’ Department of Mathematics
This paper introduces mSSRM-PGA, achieving globally optimal m-sparse Sharpe ratios, addressing the nonconvexity issue in portfolio optimization through a novel proximal gradient algorithm.
A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
·1812 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Zhejiang University
Depth-range-free MVS network using pose embedding achieves robust and accurate 3D reconstruction.
A Generative Model of Symmetry Transformations
·3610 words·17 mins·
loading
·
loading
Machine Learning
Generative Learning
π’ University of Cambridge
Generative model learns data symmetries for improved efficiency and higher test log-likelihoods.
A General Protocol to Probe Large Vision Models for 3D Physical Understanding
·4012 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ University of Oxford
Researchers developed a lightweight protocol to probe large vision models’ 3D physical understanding by training classifiers on model features for various scene properties (geometry, material, lightin…
A Functional Extension of Semi-Structured Networks
·2279 words·11 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Munich Center for Machine Learning (MCML)
This paper introduces semi-structured functional networks (SSFNNs), a novel approach that combines interpretable functional regression models with deep neural networks, achieving both high accuracy an…
A Full-duplex Speech Dialogue Scheme Based On Large Language Model
·2100 words·10 mins·
loading
·
loading
Natural Language Processing
Dialogue Systems
π’ MThreads AI
This paper introduces a novel full-duplex speech dialogue system based on LLMs, achieving significantly reduced response latency and higher interruption precision compared to half-duplex systems.
A Framework for Bilevel Optimization on Riemannian Manifolds
·1520 words·8 mins·
loading
·
loading
Machine Learning
Meta Learning
π’ RIKEN AIP
This paper introduces a novel framework for bilevel optimization on Riemannian manifolds, providing efficient hypergradient estimation strategies and convergence analysis, with successful applications…
A Foundation Model for Zero-shot Logical Query Reasoning
·2687 words·13 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Intel AI Lab
ULTRAQUERY: a groundbreaking foundation model for zero-shot logical query reasoning on any knowledge graph, surpassing existing methods’ limitations.
A Flexible, Equivariant Framework for Subgraph GNNs via Graph Products and Graph Coarsening
·3886 words·19 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
π’ Technion - Israel Institute of Technology
Flexible Subgraph GNNs, achieving scalability via graph products and coarsening, consistently outperform baselines and adapt to varying subgraph numbers.
A Fast Convoluted Story: Scaling Probabilistic Inference for Integer Arithmetics
·2476 words·12 mins·
loading
·
loading
AI Generated
AI Theory
Optimization
π’ KU Leuven
Revolutionizing probabilistic inference, PLIAβ uses tensor operations and FFT to scale integer arithmetic, achieving orders-of-magnitude speedup in inference and learning times.
A distributional simplicity bias in the learning dynamics of transformers
·2474 words·12 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
π’ International School for Advanced Studies
Transformers learn increasingly complex language patterns sequentially, starting with simpler interactions before mastering higher-order ones.
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
·2747 words·13 mins·
loading
·
loading
AI Generated
AI Applications
Healthcare
π’ MIT
LLMs dynamically adjust restless multi-armed bandit (RMAB) resource allocation policies in public health via human-language commands.
A Critical Evaluation of AI Feedback for Aligning Large Language Models
·2724 words·13 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
π’ Stanford University
Contrary to popular belief, simple supervised fine-tuning with strong language models outperforms complex reinforcement learning in aligning large language models, significantly improving efficiency.
A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration
·2500 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Zhejiang University
CAST: a novel consistency-aware spot-guided Transformer achieves state-of-the-art accuracy and efficiency in point cloud registration.
A Concept-Based Explainability Framework for Large Multimodal Models
·7122 words·34 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
π’ Sorbonne UniversitΓ©
CoX-LMM unveils a novel concept-based explainability framework for large multimodal models, extracting semantically grounded multimodal concepts to enhance interpretability.