Posters
2024
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection
·2705 words·13 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Hong Kong University of Science and Technology
Object-centric occupancy completion boosts 3D object detection accuracy by using temporal information from long sequences to precisely reconstruct object shapes, particularly for incomplete or distant…
Towards Exact Gradient-based Training on Analog In-memory Computing
·1654 words·8 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Rensselaer Polytechnic Institute
Analog in-memory computing (AIMC) training suffers from asymptotic errors due to asymmetric updates. This paper rigorously proves this limitation, proposes a novel discrete-time model to characterize …
Towards Estimating Bounds on the Effect of Policies under Unobserved Confounding
·1610 words·8 mins·
loading
·
loading
AI Theory
Causality
π’ Google DeepMind
This paper presents a novel framework for estimating bounds on policy effects under unobserved confounding, offering tighter bounds and robust estimators for higher-dimensional data.
Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits
·1492 words·8 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
π’ Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK
Novel covariance-adaptive algorithms achieve optimal gap-free regret bounds for combinatorial semi-bandits, improving efficiency with sampling-based approaches.
Towards Effective Planning Strategies for Dynamic Opinion Networks
·4874 words·23 mins·
loading
·
loading
AI Applications
Healthcare
π’ University of South Carolina
This study introduces novel, scalable AI-based planning strategies for controlling misinformation spread in dynamic opinion networks, significantly improving infection rate control.
Towards Editing Time Series
·4219 words·20 mins·
loading
·
loading
AI Generated
AI Applications
Smart Cities
π’ Microsoft Research
TEdit: a novel diffusion model edits existing time series to meet specified attribute targets, preserving other properties, solving limitations of prior synthesis methods.
Towards Dynamic Message Passing on Graphs
·2834 words·14 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ Institute of Computing Technology, CAS
N2: A novel dynamic message-passing GNN tackles message-passing bottlenecks and high computational costs by introducing learnable pseudo-nodes and dynamic pathways in a common state space, achieving s…
Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration
·2932 words·14 mins·
loading
·
loading
Machine Learning
Federated Learning
π’ UC San Diego
TAKFL, a novel federated learning framework, tackles device heterogeneity by independently distilling knowledge from diverse devices and integrating it adaptively, achieving state-of-the-art performan…
Towards Combating Frequency Simplicity-biased Learning for Domain Generalization
·2276 words·11 mins·
loading
·
loading
Computer Vision
Domain Generalization
π’ Shenzhen University
This paper introduces novel data augmentation modules that dynamically adjust the frequency characteristics of datasets, preventing neural networks from over-relying on simple frequency-based shortcut…
Towards Calibrated Robust Fine-Tuning of Vision-Language Models
·3938 words·19 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
π’ University of Wisconsin-Madison
Calibrated robust fine-tuning boosts vision-language model accuracy and confidence in out-of-distribution scenarios by using a constrained multimodal contrastive loss and self-distillation.
Towards Accurate and Fair Cognitive Diagnosis via Monotonic Data Augmentation
·2089 words·10 mins·
loading
·
loading
AI Applications
Education
π’ University of Science and Technology of China
CMCD framework tackles data sparsity in cognitive diagnosis by using monotonic data augmentation to improve accuracy and fairness of diagnostic results.
Towards a theory of how the structure of language is acquired by deep neural networks
·3238 words·16 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
π’ Γcole Polytechnique FΓ©dΓ©rale De Lausanne
Deep learning models learn language structure through next-token prediction, but the data requirements remain unclear. This paper reveals that the effective context window, determining learning capaci…
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics
·2572 words·13 mins·
loading
·
loading
Natural Language Processing
Large Language Models
π’ UC Berkeley
LLMs struggle with simple logical reasoning due to the ‘reversal curse.’ This paper reveals that weight asymmetry during training is the culprit, offering a new theoretical perspective and potential s…
Towards a Scalable Reference-Free Evaluation of Generative Models
·3926 words·19 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
π’ Chinese University of Hong Kong
FKEA: a novel, scalable method for reference-free evaluation of generative models’ diversity using random Fourier features, overcoming computational limitations of existing entropy-based scores.
Towards a 'Universal Translator' for Neural Dynamics at Single-Cell, Single-Spike Resolution
·2778 words·14 mins·
loading
·
loading
Machine Learning
Self-Supervised Learning
π’ Columbia University
A new self-supervised learning approach, Multi-task Masking (MtM), significantly improves the prediction accuracy of neural population activity by capturing neural dynamics at multiple spatial scales,…
Toward Semantic Gaze Target Detection
·2529 words·12 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
π’ Idiap Research Institute
Researchers developed a novel architecture for semantic gaze target detection, achieving state-of-the-art results by simultaneously predicting gaze target localization and semantic label, surpassing e…
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
·2046 words·10 mins·
loading
·
loading
Natural Language Processing
Large Language Models
π’ Tencent AI Lab
ALPHALLM boosts LLM performance in complex reasoning tasks by using imagination, search, and criticism to create a self-improving loop, eliminating the need for extra training data.
Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
·1671 words·8 mins·
loading
·
loading
Multimodal Learning
Sentiment Analysis
π’ Peking University
Hierarchical Representation Learning Framework (HRLF) significantly improves Multimodal Sentiment Analysis (MSA) accuracy by effectively addressing incomplete data through fine-grained representation …
Toward Real Ultra Image Segmentation: Leveraging Surrounding Context to Cultivate General Segmentation Model
·2381 words·12 mins·
loading
·
loading
Computer Vision
Image Segmentation
π’ Wuhan University
SGNet cultivates general segmentation models for ultra images by integrating surrounding context, achieving significant performance improvements across various datasets.
Toward Global Convergence of Gradient EM for Over-Paramterized Gaussian Mixture Models
·345 words·2 mins·
loading
·
loading
Machine Learning
Optimization
π’ University of Washington
Gradient EM for over-parameterized Gaussian Mixture Models globally converges with a sublinear rate, solving a longstanding open problem in machine learning.