Posters

ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation

26 September 2024·3953 words·19 mins· loading · loading

AI Generated Computer Vision Action Recognition 🏢 Pohang University of Science and Technology

ActFusion: a unified diffusion model achieving state-of-the-art performance in both action segmentation and anticipation by jointly learning visible and invisible parts of video sequences.

ActAnywhere: Subject-Aware Video Background Generation

26 September 2024·1990 words·10 mins· loading · loading

Computer Vision Video Understanding 🏢 Stanford University

ActAnywhere, a novel video diffusion model, seamlessly integrates foreground subjects into new backgrounds by generating realistic video backgrounds tailored to subject motion, significantly reducing …

Achieving Tractable Minimax Optimal Regret in Average Reward MDPs

26 September 2024·1775 words·9 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Univ. Grenoble Alpes

First tractable algorithm achieves minimax optimal regret in average-reward MDPs, solving a major computational challenge in reinforcement learning.

Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes

26 September 2024·2347 words·12 mins· loading · loading

AI Generated Machine Learning Federated Learning 🏢 ETH Zurich

D-AdaST: A novel distributed adaptive minimax optimization method achieves near-optimal convergence by tracking stepsizes, solving the inconsistency problem hindering existing adaptive methods.

Achieving Linear Convergence with Parameter-Free Algorithms in Decentralized Optimization

26 September 2024·1448 words·7 mins· loading · loading

AI Generated Machine Learning Optimization 🏢 Innopolis University

A novel parameter-free decentralized optimization algorithm achieves linear convergence for strongly convex, smooth objectives, eliminating the need for hyperparameter tuning and improving scalability…

Achieving Domain-Independent Certified Robustness via Knowledge Continuity

26 September 2024·2020 words·10 mins· loading · loading

AI Theory Robustness 🏢 Carnegie Mellon University

Certifying neural network robustness across diverse domains, this paper introduces knowledge continuity—a novel framework ensuring model stability independent of input type, norms, and distribution.

Achieving Constant Regret in Linear Markov Decision Processes

26 September 2024·1852 words·9 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 MIT

Cert-LSVI-UCB achieves constant regret in RL with linear function approximation, even under model misspecification, using a novel certified estimator.

Achieving $ ilde{O}(1/psilon)$ Sample Complexity for Constrained Markov Decision Process

26 September 2024·390 words·2 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Hong Kong University of Science and Technology

Constrained Markov Decision Processes (CMDPs) get an improved sample complexity bound of Õ(1/ε) via a new algorithm, surpassing the existing O(1/ε²) bound.

Achievable Fairness on Your Data With Utility Guarantees

26 September 2024·6805 words·32 mins· loading · loading

AI Generated AI Theory Fairness 🏢 ByteDance Research

This paper introduces a computationally efficient method to approximate the optimal accuracy-fairness trade-off curve for various datasets, providing rigorous statistical guarantees and quantifying un…

Achievable distributional robustness when the robust risk is only partially identified

26 September 2024·1876 words·9 mins· loading · loading

AI Generated AI Theory Robustness 🏢 ETH Zurich

This paper introduces a novel framework for evaluating the robustness of machine learning models when the true data distribution is only partially known. It defines a new risk measure (‘identifiable r…

ACFun: Abstract-Concrete Fusion Facial Stylization

26 September 2024·2192 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 Xidian University

ACFun: A novel facial stylization method fusing abstract & concrete features for high-quality, artistically pleasing results from only one style & one face image.

Accurate and Steady Inertial Pose Estimation through Sequence Structure Learning and Modulation

26 September 2024·2334 words·11 mins· loading · loading

AI Applications Robotics 🏢 School of Computer Science & Informatics, Cardiff University

Researchers enhanced transformer models for inertial pose estimation by introducing a Sequence Structure Module, leveraging inherent fixed-length sequence structures for improved accuracy and steadine…

Accuracy is Not All You Need

26 September 2024·5583 words·27 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Microsoft Research

LLM compression accuracy hides crucial behavioral changes; use % flips and KL-divergence for better evaluation.

Acceleration Exists! Optimization Problems When Oracle Can Only Compare Objective Function Values

26 September 2024·1792 words·9 mins· loading · loading

AI Theory Optimization 🏢 Moscow Institute of Physics and Technology

Accelerated gradient-free optimization is achieved using only function value comparisons, significantly improving black-box optimization.

Accelerating Transformers with Spectrum-Preserving Token Merging

26 September 2024·3201 words·16 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 UC San Diego

PITOME: a novel token merging method accelerates Transformers by 40-60% while preserving accuracy, prioritizing informative tokens via an energy score.

Accelerating Relative Entropy Coding with Space Partitioning

26 September 2024·1881 words·9 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Cambridge

Space partitioning dramatically speeds up relative entropy coding (REC) for neural compression, achieving 5-15% better bitrates than previous methods.

Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight

26 September 2024·2216 words·11 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Ant Group

Chain-of-Sight accelerates multimodal LLM pre-training by ~73% using a multi-scale visual resampling technique and a novel post-pretrain token scaling strategy, achieving comparable or superior perfor…

Accelerating Non-Maximum Suppression: A Graph Theory Perspective

26 September 2024·3325 words·16 mins· loading · loading

AI Generated Computer Vision Object Detection 🏢 School of Computer Science and Technology, MOEKLINNS Lab, Xi'an Jiaotong University

This paper presents QSI-NMS and BOE-NMS, novel graph theory-based algorithms that significantly speed up non-maximum suppression (NMS) in object detection without significant accuracy loss, and introd…

Accelerating Nash Equilibrium Convergence in Monte Carlo Settings Through Counterfactual Value Based Fictitious Play

26 September 2024·1841 words·9 mins· loading · loading

AI Theory Optimization 🏢 Huazhong University of Science and Technology

MCCFVFP, a novel Monte Carlo-based algorithm, accelerates Nash equilibrium convergence in large-scale games by combining CFR’s counterfactual value calculations with fictitious play’s best response st…

Accelerating Matroid Optimization through Fast Imprecise Oracles

26 September 2024·485 words·3 mins· loading · loading

AI Theory Optimization 🏢 Technical University of Berlin

Fast imprecise oracles drastically reduce query times in matroid optimization, achieving near-optimal performance with few accurate queries.