Posters

Multi-Agent Imitation Learning: Value is Easy, Regret is Hard

26 September 2024·1706 words·9 mins· loading · loading

AI Theory Reinforcement Learning 🏢 Carnegie Mellon University

In multi-agent imitation learning, achieving regret equivalence is harder than value equivalence; this paper introduces novel algorithms that efficiently minimize the regret gap under various assumpti…

Multi-Agent Domain Calibration with a Handful of Offline Data

26 September 2024·2343 words·11 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 National Key Laboratory of Novel Software Technology

Madoc: A novel multi-agent framework calibrates RL policies for new environments using limited offline data, achieving superior performance in various locomotion tasks.

Multi-Agent Coordination via Multi-Level Communication

26 September 2024·1851 words·9 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Peking University

SeqComm, a novel multi-level communication scheme, tackles multi-agent coordination by leveraging asynchronous decision-making and a two-phase communication process for improved efficiency and theoret…

MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction

26 September 2024·3398 words·16 mins· loading · loading

AI Generated Computer Vision Video Understanding 🏢 Idiap Research Institute

MTGS: a unified framework jointly predicts gaze and social gaze (shared attention, mutual gaze) for multiple people in videos, achieving state-of-the-art results using a temporal transformer model and…

MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution

26 September 2024·2539 words·12 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 Tsinghua University

MSPE empowers Vision Transformers to handle any image resolution by cleverly optimizing patch embedding, achieving superior performance on low-resolution images and comparable results on high-resoluti…

MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-Training

26 September 2024·3685 words·18 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Tsinghua University

MSAGPT: Revolutionizing protein structure prediction by generating accurate virtual MSAs from limited data, boosting prediction accuracy by up to +8.5% TM-Score!

MSA Generation with Seqs2Seqs Pretraining: Advancing Protein Structure Predictions

26 September 2024·2100 words·10 mins· loading · loading

Machine Learning Self-Supervised Learning 🏢 Fudan University

Self-supervised generative model MSA-Generator boosts protein structure prediction accuracy by producing high-quality MSAs, especially for challenging sequences lacking homologs.

MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs

26 September 2024·4032 words·19 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Chinese University of Hong Kong

MR-Ben: A new benchmark reveals LLMs’ meta-reasoning flaws, pushing the boundaries of AI evaluation beyond simple accuracy.

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

26 September 2024·2418 words·12 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 CUHK MMLab

MoVA, a novel MLLM, enhances multimodal understanding by adaptively routing and fusing task-specific vision experts for improved generalization across diverse image content.

MotionTTT: 2D Test-Time-Training Motion Estimation for 3D Motion Corrected MRI

26 September 2024·2648 words·13 mins· loading · loading

AI Applications Healthcare 🏢 Technical University of Munich

MotionTTT: Deep learning enables accurate 3D motion-corrected MRI by cleverly estimating motion parameters during test-time training, significantly improving image reconstruction.

MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting

26 September 2024·3095 words·15 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 University of Science and Technology of China

MotionGS enhances deformable 3D Gaussian splatting for dynamic scenes by using motion flow to guide deformation, significantly improving reconstruction accuracy and outperforming state-of-the-art meth…

MotionCraft: Physics-Based Zero-Shot Video Generation

26 September 2024·2646 words·13 mins· loading · loading

Computer Vision Video Understanding 🏢 Politecnico Di Torino

MotionCraft: Physics-based zero-shot video generation creates realistic videos with complex motion dynamics by cleverly warping the noise latent space of an image diffusion model using optical flow fr…

Motion Graph Unleashed: A Novel Approach to Video Prediction

26 September 2024·2948 words·14 mins· loading · loading

Computer Vision Video Understanding 🏢 Microsoft

Motion Graph unleashes efficient and accurate video prediction by transforming video frames into interconnected graph nodes, capturing complex motion patterns with minimal computational cost.

Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

26 September 2024·2797 words·14 mins· loading · loading

AI Generated Computer Vision Video Understanding 🏢 Microsoft

Boosting video diffusion: Motion Consistency Model (MCM) disentangles motion and appearance learning for high-fidelity, fast video generation using few sampling steps.

Motif-oriented influence maximization for viral marketing in large-scale social networks

26 September 2024·1750 words·9 mins· loading · loading

AI Theory Optimization 🏢 Shenzhen University

Motif-oriented influence maximization tackles viral marketing’s challenge of reaching groups by proving a greedy algorithm with guaranteed approximation ratio and near-linear time complexity.

MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer

26 September 2024·2123 words·10 mins· loading · loading

Computer Vision Video Understanding 🏢 Tongji University

MoTE: A novel framework harmonizes generalization and specialization for visual-language video knowledge transfer, achieving state-of-the-art results.

MOTE-NAS: Multi-Objective Training-based Estimate for Efficient Neural Architecture Search

26 September 2024·2314 words·11 mins· loading · loading

Machine Learning Deep Learning 🏢 National Central University

MOTE-NAS: A new multi-objective training-based estimate drastically improves neural architecture search efficiency, achieving state-of-the-art accuracy with significantly reduced costs.

Most Influential Subset Selection: Challenges, Promises, and Beyond

26 September 2024·1721 words·9 mins· loading · loading

AI Theory Interpretability 🏢 University of Illinois Urbana-Champaign

Adaptive greedy algorithms significantly improve the accuracy of identifying the most influential subset of training data, overcoming limitations of existing methods that fail to capture complex inter…

Monomial Matrix Group Equivariant Neural Functional Networks

26 September 2024·2706 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 National University of Singapore

Monomial-NFNs boost neural network efficiency by leveraging scaling/sign-flipping symmetries, resulting in fewer trainable parameters and competitive performance.

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

26 September 2024·1750 words·9 mins· loading · loading

Computer Vision Object Detection 🏢 UCAS-Terminus AI Lab, University of Chinese Academy of Sciences, China

MonoMAE enhances monocular 3D object detection by using depth-aware masked autoencoders to effectively handle object occlusions, achieving superior performance on both occluded and non-occluded object…