Posters

Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection

26 September 2024·2705 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 Hong Kong University of Science and Technology

Object-centric occupancy completion boosts 3D object detection accuracy by using temporal information from long sequences to precisely reconstruct object shapes, particularly for incomplete or distant…

Towards Exact Gradient-based Training on Analog In-memory Computing

26 September 2024·1654 words·8 mins· loading · loading

Machine Learning Deep Learning 🏢 Rensselaer Polytechnic Institute

Analog in-memory computing (AIMC) training suffers from asymptotic errors due to asymmetric updates. This paper rigorously proves this limitation, proposes a novel discrete-time model to characterize …

Towards Estimating Bounds on the Effect of Policies under Unobserved Confounding

26 September 2024·1610 words·8 mins· loading · loading

AI Theory Causality 🏢 Google DeepMind

This paper presents a novel framework for estimating bounds on policy effects under unobserved confounding, offering tighter bounds and robust estimators for higher-dimensional data.

Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits

26 September 2024·1492 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK

Novel covariance-adaptive algorithms achieve optimal gap-free regret bounds for combinatorial semi-bandits, improving efficiency with sampling-based approaches.

Towards Effective Planning Strategies for Dynamic Opinion Networks

26 September 2024·4874 words·23 mins· loading · loading

AI Applications Healthcare 🏢 University of South Carolina

This study introduces novel, scalable AI-based planning strategies for controlling misinformation spread in dynamic opinion networks, significantly improving infection rate control.

Towards Editing Time Series

26 September 2024·4219 words·20 mins· loading · loading

AI Generated AI Applications Smart Cities 🏢 Microsoft Research

TEdit: a novel diffusion model edits existing time series to meet specified attribute targets, preserving other properties, solving limitations of prior synthesis methods.

Towards Dynamic Message Passing on Graphs

26 September 2024·2834 words·14 mins· loading · loading

Machine Learning Deep Learning 🏢 Institute of Computing Technology, CAS

N2: A novel dynamic message-passing GNN tackles message-passing bottlenecks and high computational costs by introducing learnable pseudo-nodes and dynamic pathways in a common state space, achieving s…

Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration

26 September 2024·2932 words·14 mins· loading · loading

Machine Learning Federated Learning 🏢 UC San Diego

TAKFL, a novel federated learning framework, tackles device heterogeneity by independently distilling knowledge from diverse devices and integrating it adaptively, achieving state-of-the-art performan…

Towards Combating Frequency Simplicity-biased Learning for Domain Generalization

26 September 2024·2276 words·11 mins· loading · loading

Computer Vision Domain Generalization 🏢 Shenzhen University

This paper introduces novel data augmentation modules that dynamically adjust the frequency characteristics of datasets, preventing neural networks from over-relying on simple frequency-based shortcut…

Towards Calibrated Robust Fine-Tuning of Vision-Language Models

26 September 2024·3938 words·19 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 University of Wisconsin-Madison

Calibrated robust fine-tuning boosts vision-language model accuracy and confidence in out-of-distribution scenarios by using a constrained multimodal contrastive loss and self-distillation.

Towards Accurate and Fair Cognitive Diagnosis via Monotonic Data Augmentation

26 September 2024·2089 words·10 mins· loading · loading

AI Applications Education 🏢 University of Science and Technology of China

CMCD framework tackles data sparsity in cognitive diagnosis by using monotonic data augmentation to improve accuracy and fairness of diagnostic results.

Towards a theory of how the structure of language is acquired by deep neural networks

26 September 2024·3238 words·16 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 École Polytechnique Fédérale De Lausanne

Deep learning models learn language structure through next-token prediction, but the data requirements remain unclear. This paper reveals that the effective context window, determining learning capaci…

Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics

26 September 2024·2572 words·13 mins· loading · loading

Natural Language Processing Large Language Models 🏢 UC Berkeley

LLMs struggle with simple logical reasoning due to the ‘reversal curse.’ This paper reveals that weight asymmetry during training is the culprit, offering a new theoretical perspective and potential s…

Towards a Scalable Reference-Free Evaluation of Generative Models

26 September 2024·3926 words·19 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Chinese University of Hong Kong

FKEA: a novel, scalable method for reference-free evaluation of generative models’ diversity using random Fourier features, overcoming computational limitations of existing entropy-based scores.

Towards a 'Universal Translator' for Neural Dynamics at Single-Cell, Single-Spike Resolution

26 September 2024·2778 words·14 mins· loading · loading

Machine Learning Self-Supervised Learning 🏢 Columbia University

A new self-supervised learning approach, Multi-task Masking (MtM), significantly improves the prediction accuracy of neural population activity by capturing neural dynamics at multiple spatial scales,…

Toward Semantic Gaze Target Detection

26 September 2024·2529 words·12 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Idiap Research Institute

Researchers developed a novel architecture for semantic gaze target detection, achieving state-of-the-art results by simultaneously predicting gaze target localization and semantic label, surpassing e…

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

26 September 2024·2046 words·10 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Tencent AI Lab

ALPHALLM boosts LLM performance in complex reasoning tasks by using imagination, search, and criticism to create a self-improving loop, eliminating the need for extra training data.

Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning

26 September 2024·1671 words·8 mins· loading · loading

Multimodal Learning Sentiment Analysis 🏢 Peking University

Hierarchical Representation Learning Framework (HRLF) significantly improves Multimodal Sentiment Analysis (MSA) accuracy by effectively addressing incomplete data through fine-grained representation …

Toward Real Ultra Image Segmentation: Leveraging Surrounding Context to Cultivate General Segmentation Model

26 September 2024·2381 words·12 mins· loading · loading

Computer Vision Image Segmentation 🏢 Wuhan University

SGNet cultivates general segmentation models for ultra images by integrating surrounding context, achieving significant performance improvements across various datasets.

Toward Global Convergence of Gradient EM for Over-Paramterized Gaussian Mixture Models

26 September 2024·345 words·2 mins· loading · loading

Machine Learning Optimization 🏢 University of Washington

Gradient EM for over-parameterized Gaussian Mixture Models globally converges with a sublinear rate, solving a longstanding open problem in machine learning.