Skip to main content

Posters

2024

Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
·1706 words·9 mins· loading · loading
AI Theory Reinforcement Learning 🏢 Carnegie Mellon University
In multi-agent imitation learning, achieving regret equivalence is harder than value equivalence; this paper introduces novel algorithms that efficiently minimize the regret gap under various assumpti…
Multi-Agent Domain Calibration with a Handful of Offline Data
·2343 words·11 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 National Key Laboratory of Novel Software Technology
Madoc: A novel multi-agent framework calibrates RL policies for new environments using limited offline data, achieving superior performance in various locomotion tasks.
Multi-Agent Coordination via Multi-Level Communication
·1851 words·9 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Peking University
SeqComm, a novel multi-level communication scheme, tackles multi-agent coordination by leveraging asynchronous decision-making and a two-phase communication process for improved efficiency and theoret…
MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction
·3398 words·16 mins· loading · loading
AI Generated Computer Vision Video Understanding 🏢 Idiap Research Institute
MTGS: a unified framework jointly predicts gaze and social gaze (shared attention, mutual gaze) for multiple people in videos, achieving state-of-the-art results using a temporal transformer model and…
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution
·2539 words·12 mins· loading · loading
AI Generated Computer Vision Image Classification 🏢 Tsinghua University
MSPE empowers Vision Transformers to handle any image resolution by cleverly optimizing patch embedding, achieving superior performance on low-resolution images and comparable results on high-resoluti…
MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-Training
·3685 words·18 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Tsinghua University
MSAGPT: Revolutionizing protein structure prediction by generating accurate virtual MSAs from limited data, boosting prediction accuracy by up to +8.5% TM-Score!
MSA Generation with Seqs2Seqs Pretraining: Advancing Protein Structure Predictions
·2100 words·10 mins· loading · loading
Machine Learning Self-Supervised Learning 🏢 Fudan University
Self-supervised generative model MSA-Generator boosts protein structure prediction accuracy by producing high-quality MSAs, especially for challenging sequences lacking homologs.
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs
·4032 words·19 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Chinese University of Hong Kong
MR-Ben: A new benchmark reveals LLMs’ meta-reasoning flaws, pushing the boundaries of AI evaluation beyond simple accuracy.
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
·2418 words·12 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 CUHK MMLab
MoVA, a novel MLLM, enhances multimodal understanding by adaptively routing and fusing task-specific vision experts for improved generalization across diverse image content.
MotionTTT: 2D Test-Time-Training Motion Estimation for 3D Motion Corrected MRI
·2648 words·13 mins· loading · loading
AI Applications Healthcare 🏢 Technical University of Munich
MotionTTT: Deep learning enables accurate 3D motion-corrected MRI by cleverly estimating motion parameters during test-time training, significantly improving image reconstruction.
MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting
·3095 words·15 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 University of Science and Technology of China
MotionGS enhances deformable 3D Gaussian splatting for dynamic scenes by using motion flow to guide deformation, significantly improving reconstruction accuracy and outperforming state-of-the-art meth…
MotionCraft: Physics-Based Zero-Shot Video Generation
·2646 words·13 mins· loading · loading
Computer Vision Video Understanding 🏢 Politecnico Di Torino
MotionCraft: Physics-based zero-shot video generation creates realistic videos with complex motion dynamics by cleverly warping the noise latent space of an image diffusion model using optical flow fr…
Motion Graph Unleashed: A Novel Approach to Video Prediction
·2948 words·14 mins· loading · loading
Computer Vision Video Understanding 🏢 Microsoft
Motion Graph unleashes efficient and accurate video prediction by transforming video frames into interconnected graph nodes, capturing complex motion patterns with minimal computational cost.
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
·2797 words·14 mins· loading · loading
AI Generated Computer Vision Video Understanding 🏢 Microsoft
Boosting video diffusion: Motion Consistency Model (MCM) disentangles motion and appearance learning for high-fidelity, fast video generation using few sampling steps.
Motif-oriented influence maximization for viral marketing in large-scale social networks
·1750 words·9 mins· loading · loading
AI Theory Optimization 🏢 Shenzhen University
Motif-oriented influence maximization tackles viral marketing’s challenge of reaching groups by proving a greedy algorithm with guaranteed approximation ratio and near-linear time complexity.
MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer
·2123 words·10 mins· loading · loading
Computer Vision Video Understanding 🏢 Tongji University
MoTE: A novel framework harmonizes generalization and specialization for visual-language video knowledge transfer, achieving state-of-the-art results.
MOTE-NAS: Multi-Objective Training-based Estimate for Efficient Neural Architecture Search
·2314 words·11 mins· loading · loading
Machine Learning Deep Learning 🏢 National Central University
MOTE-NAS: A new multi-objective training-based estimate drastically improves neural architecture search efficiency, achieving state-of-the-art accuracy with significantly reduced costs.
Most Influential Subset Selection: Challenges, Promises, and Beyond
·1721 words·9 mins· loading · loading
AI Theory Interpretability 🏢 University of Illinois Urbana-Champaign
Adaptive greedy algorithms significantly improve the accuracy of identifying the most influential subset of training data, overcoming limitations of existing methods that fail to capture complex inter…
Monomial Matrix Group Equivariant Neural Functional Networks
·2706 words·13 mins· loading · loading
Machine Learning Deep Learning 🏢 National University of Singapore
Monomial-NFNs boost neural network efficiency by leveraging scaling/sign-flipping symmetries, resulting in fewer trainable parameters and competitive performance.
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
·1750 words·9 mins· loading · loading
Computer Vision Object Detection 🏢 UCAS-Terminus AI Lab, University of Chinese Academy of Sciences, China
MonoMAE enhances monocular 3D object detection by using depth-aware masked autoencoders to effectively handle object occlusions, achieving superior performance on both occluded and non-occluded object…