Posters
2024
ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation
·3953 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
Action Recognition
π’ Pohang University of Science and Technology
ActFusion: a unified diffusion model achieving state-of-the-art performance in both action segmentation and anticipation by jointly learning visible and invisible parts of video sequences.
ActAnywhere: Subject-Aware Video Background Generation
·1990 words·10 mins·
loading
·
loading
Computer Vision
Video Understanding
π’ Stanford University
ActAnywhere, a novel video diffusion model, seamlessly integrates foreground subjects into new backgrounds by generating realistic video backgrounds tailored to subject motion, significantly reducing …
Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
·1775 words·9 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
π’ Univ. Grenoble Alpes
First tractable algorithm achieves minimax optimal regret in average-reward MDPs, solving a major computational challenge in reinforcement learning.
Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes
·2347 words·12 mins·
loading
·
loading
AI Generated
Machine Learning
Federated Learning
π’ ETH Zurich
D-AdaST: A novel distributed adaptive minimax optimization method achieves near-optimal convergence by tracking stepsizes, solving the inconsistency problem hindering existing adaptive methods.
Achieving Linear Convergence with Parameter-Free Algorithms in Decentralized Optimization
·1448 words·7 mins·
loading
·
loading
AI Generated
Machine Learning
Optimization
π’ Innopolis University
A novel parameter-free decentralized optimization algorithm achieves linear convergence for strongly convex, smooth objectives, eliminating the need for hyperparameter tuning and improving scalability…
Achieving Domain-Independent Certified Robustness via Knowledge Continuity
·2020 words·10 mins·
loading
·
loading
AI Theory
Robustness
π’ Carnegie Mellon University
Certifying neural network robustness across diverse domains, this paper introduces knowledge continuityβa novel framework ensuring model stability independent of input type, norms, and distribution.
Achieving Constant Regret in Linear Markov Decision Processes
·1852 words·9 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
π’ MIT
Cert-LSVI-UCB achieves constant regret in RL with linear function approximation, even under model misspecification, using a novel certified estimator.
Achieving $ ilde{O}(1/psilon)$ Sample Complexity for Constrained Markov Decision Process
·390 words·2 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
π’ Hong Kong University of Science and Technology
Constrained Markov Decision Processes (CMDPs) get an improved sample complexity bound of Γ(1/Ξ΅) via a new algorithm, surpassing the existing O(1/Ρ²) bound.
Achievable Fairness on Your Data With Utility Guarantees
·6805 words·32 mins·
loading
·
loading
AI Generated
AI Theory
Fairness
π’ ByteDance Research
This paper introduces a computationally efficient method to approximate the optimal accuracy-fairness trade-off curve for various datasets, providing rigorous statistical guarantees and quantifying un…
Achievable distributional robustness when the robust risk is only partially identified
·1876 words·9 mins·
loading
·
loading
AI Generated
AI Theory
Robustness
π’ ETH Zurich
This paper introduces a novel framework for evaluating the robustness of machine learning models when the true data distribution is only partially known. It defines a new risk measure (‘identifiable r…
ACFun: Abstract-Concrete Fusion Facial Stylization
·2192 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
π’ Xidian University
ACFun: A novel facial stylization method fusing abstract & concrete features for high-quality, artistically pleasing results from only one style & one face image.
Accurate and Steady Inertial Pose Estimation through Sequence Structure Learning and Modulation
·2334 words·11 mins·
loading
·
loading
AI Applications
Robotics
π’ School of Computer Science & Informatics, Cardiff University
Researchers enhanced transformer models for inertial pose estimation by introducing a Sequence Structure Module, leveraging inherent fixed-length sequence structures for improved accuracy and steadine…
Accuracy is Not All You Need
·5583 words·27 mins·
loading
·
loading
Natural Language Processing
Large Language Models
π’ Microsoft Research
LLM compression accuracy hides crucial behavioral changes; use % flips and KL-divergence for better evaluation.
Acceleration Exists! Optimization Problems When Oracle Can Only Compare Objective Function Values
·1792 words·9 mins·
loading
·
loading
AI Theory
Optimization
π’ Moscow Institute of Physics and Technology
Accelerated gradient-free optimization is achieved using only function value comparisons, significantly improving black-box optimization.
Accelerating Transformers with Spectrum-Preserving Token Merging
·3201 words·16 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
π’ UC San Diego
PITOME: a novel token merging method accelerates Transformers by 40-60% while preserving accuracy, prioritizing informative tokens via an energy score.
Accelerating Relative Entropy Coding with Space Partitioning
·1881 words·9 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ University of Cambridge
Space partitioning dramatically speeds up relative entropy coding (REC) for neural compression, achieving 5-15% better bitrates than previous methods.
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
·2216 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
π’ Ant Group
Chain-of-Sight accelerates multimodal LLM pre-training by ~73% using a multi-scale visual resampling technique and a novel post-pretrain token scaling strategy, achieving comparable or superior perfor…
Accelerating Non-Maximum Suppression: A Graph Theory Perspective
·3325 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
π’ School of Computer Science and Technology, MOEKLINNS Lab, Xi'an Jiaotong University
This paper presents QSI-NMS and BOE-NMS, novel graph theory-based algorithms that significantly speed up non-maximum suppression (NMS) in object detection without significant accuracy loss, and introd…
Accelerating Nash Equilibrium Convergence in Monte Carlo Settings Through Counterfactual Value Based Fictitious Play
·1841 words·9 mins·
loading
·
loading
AI Theory
Optimization
π’ Huazhong University of Science and Technology
MCCFVFP, a novel Monte Carlo-based algorithm, accelerates Nash equilibrium convergence in large-scale games by combining CFR’s counterfactual value calculations with fictitious play’s best response st…
Accelerating Matroid Optimization through Fast Imprecise Oracles
·485 words·3 mins·
loading
·
loading
AI Theory
Optimization
π’ Technical University of Berlin
Fast imprecise oracles drastically reduce query times in matroid optimization, achieving near-optimal performance with few accurate queries.