Posters
2024
Multi-view Masked Contrastive Representation Learning for Endoscopic Video Analysis
·2187 words·11 mins·
loading
·
loading
Computer Vision
Video Understanding
🏢 Xiangtan University
Multi-view Masked Contrastive Representation Learning (M²CRL) significantly boosts endoscopic video analysis by using a novel multi-view masking strategy and contrastive learning, achieving state-of-t…
Multi-turn Reinforcement Learning with Preference Human Feedback
·1515 words·8 mins·
loading
·
loading
Natural Language Processing
Dialogue Systems
🏢 Google Research
Multi-turn RLHF surpasses single-turn methods by aligning LLMs with human preferences across entire conversations, not just individual turns. A novel mirror-descent algorithm, MTPO, is introduced, pr…
Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction
·1845 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Shanghai Jiao Tong University
Ref-MC2 reconstructs high-fidelity 3D objects with inter-reflections by using a novel multi-times Monte Carlo sampling strategy, achieving superior performance in accuracy and efficiency.
Multi-Stage Predict+Optimize for (Mixed Integer) Linear Programs
·2926 words·14 mins·
loading
·
loading
AI Generated
Machine Learning
Optimization
🏢 Chinese University of Hong Kong
Multi-Stage Predict+Optimize tackles optimization problems where parameters are revealed sequentially, improving predictions and decisions through stage-wise updates.
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
·2122 words·10 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 City University of Hong Kong
MSVMamba: A novel multi-scale vision model leveraging state-space models, achieves high accuracy in image classification and object detection while maintaining linear complexity, solving the long-rang…
Multi-Scale Representation Learning for Protein Fitness Prediction
·1447 words·7 mins·
loading
·
loading
Machine Learning
Representation Learning
🏢 Mila - Québec AI Institute
S3F: a novel multi-scale model achieves state-of-the-art protein fitness prediction by integrating protein sequence, structure, and surface features.
Multi-scale Consistency for Robust 3D Registration via Hierarchical Sinkhorn Tree
·2306 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Tsinghua University
Hierarchical Sinkhorn Tree (HST) robustly retrieves accurate 3D point cloud correspondences using multi-scale consistency, outperforming state-of-the-art methods.
Multi-Reward Best Policy Identification
·4494 words·22 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Ericsson AB
This paper introduces efficient algorithms, MR-NaS and DBMR-BPI, for identifying optimal policies across multiple reward functions in reinforcement learning, achieving competitive performance with the…
Multi-Object Hallucination in Vision Language Models
·2226 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 University of Michigan
LVLMs often hallucinate objects, a problem worsened when multiple objects are present. This paper introduces ROPE, a novel automated evaluation protocol that reveals how object class distribution and…
Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial Attention
·3055 words·15 mins·
loading
·
loading
AI Generated
Natural Language Processing
Vision-Language Models
🏢 Department of Computer Science, Purdue University
D-LISA: Dynamic modules & language-informed spatial attention revolutionizes multi-object 3D grounding, surpassing state-of-the-art accuracy by 12.8%.
Multi-model Ensemble Conformal Prediction in Dynamic Environments
·1865 words·9 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 UC Irvine
Adaptive multi-model ensemble conformal prediction achieves strongly adaptive regret, yielding more efficient prediction sets in dynamic environments.
Multi-modal Transfer Learning between Biological Foundation Models
·2170 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 InstaDeep
IsoFormer, a novel multi-modal model, accurately predicts RNA transcript isoform expression by integrating DNA, RNA, and protein sequence information, achieving state-of-the-art results.
Multi-LLM Debate: Framework, Principals, and Interventions
·1604 words·8 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 ByteDance Research
Boosting LLM collaboration, this research introduces a novel theoretical framework for multi-LLM debate, revealing key principles like the effect of similar models and interventions to enhance accurac…
Multi-language Diversity Benefits Autoformalization
·1698 words·8 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 University of Cambridge
Researchers created MMA, a large multilingual dataset of informal-formal mathematical pairs, leveraging a language model for reverse translation. Fine-tuned models achieved significantly improved aut…
Multi-Label Open Set Recognition
·1580 words·8 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 School of Computer Science and Engineering, Southeast University
SLAN: A novel approach for multi-label open-set recognition, enriching sub-labeling info using structural data to identify unknown labels.
Multi-Label Learning with Stronger Consistency Guarantees
·239 words·2 mins·
loading
·
loading
Machine Learning
Optimization
🏢 Courant Institute
Novel surrogate losses with label-independent H-consistency bounds enable stronger guarantees for multi-label learning.
Multi-Instance Partial-Label Learning with Margin Adjustment
·3339 words·16 mins·
loading
·
loading
AI Generated
Machine Learning
Semi-Supervised Learning
🏢 School of Computer Science and Engineering, Southeast University
MIPLMA, a novel algorithm, enhances multi-instance partial-label learning by dynamically adjusting margins for attention scores and predicted probabilities, leading to superior performance.
Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images
·2520 words·12 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 KAIST
MHCDIFF: a novel pipeline using multi-hypotheses conditioned point cloud diffusion for accurate 3D human reconstruction from occluded images, outperforming state-of-the-art methods.
Multi-Head Mixture-of-Experts
·2844 words·14 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Microsoft Research
Multi-Head Mixture-of-Experts (MH-MoE) drastically boosts large language model efficiency by activating almost all expert networks, achieving superior performance compared to existing Sparse Mixture-o…
Multi-Group Proportional Representation in Retrieval
·4416 words·21 mins·
loading
·
loading
AI Theory
Fairness
🏢 Harvard University
Multi-group Proportional Representation (MPR) tackles skewed search results by measuring representation across intersectional groups, improving fairness in image retrieval.