Posters

Multi-view Masked Contrastive Representation Learning for Endoscopic Video Analysis

26 September 2024·2187 words·11 mins· loading · loading

Computer Vision Video Understanding 🏢 Xiangtan University

Multi-view Masked Contrastive Representation Learning (M²CRL) significantly boosts endoscopic video analysis by using a novel multi-view masking strategy and contrastive learning, achieving state-of-t…

Multi-turn Reinforcement Learning with Preference Human Feedback

26 September 2024·1515 words·8 mins· loading · loading

Natural Language Processing Dialogue Systems 🏢 Google Research

Multi-turn RLHF surpasses single-turn methods by aligning LLMs with human preferences across entire conversations, not just individual turns. A novel mirror-descent algorithm, MTPO, is introduced, pr…

Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction

26 September 2024·1845 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Shanghai Jiao Tong University

Ref-MC2 reconstructs high-fidelity 3D objects with inter-reflections by using a novel multi-times Monte Carlo sampling strategy, achieving superior performance in accuracy and efficiency.

Multi-Stage Predict+Optimize for (Mixed Integer) Linear Programs

26 September 2024·2926 words·14 mins· loading · loading

AI Generated Machine Learning Optimization 🏢 Chinese University of Hong Kong

Multi-Stage Predict+Optimize tackles optimization problems where parameters are revealed sequentially, improving predictions and decisions through stage-wise updates.

Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model

26 September 2024·2122 words·10 mins· loading · loading

Computer Vision Image Classification 🏢 City University of Hong Kong

MSVMamba: A novel multi-scale vision model leveraging state-space models, achieves high accuracy in image classification and object detection while maintaining linear complexity, solving the long-rang…

Multi-Scale Representation Learning for Protein Fitness Prediction

26 September 2024·1447 words·7 mins· loading · loading

Machine Learning Representation Learning 🏢 Mila - Québec AI Institute

S3F: a novel multi-scale model achieves state-of-the-art protein fitness prediction by integrating protein sequence, structure, and surface features.

Multi-scale Consistency for Robust 3D Registration via Hierarchical Sinkhorn Tree

26 September 2024·2306 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

Hierarchical Sinkhorn Tree (HST) robustly retrieves accurate 3D point cloud correspondences using multi-scale consistency, outperforming state-of-the-art methods.

Multi-Reward Best Policy Identification

26 September 2024·4494 words·22 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Ericsson AB

This paper introduces efficient algorithms, MR-NaS and DBMR-BPI, for identifying optimal policies across multiple reward functions in reinforcement learning, achieving competitive performance with the…

Multi-Object Hallucination in Vision Language Models

26 September 2024·2226 words·11 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Michigan

LVLMs often hallucinate objects, a problem worsened when multiple objects are present. This paper introduces ROPE, a novel automated evaluation protocol that reveals how object class distribution and…

Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial Attention

26 September 2024·3055 words·15 mins· loading · loading

AI Generated Natural Language Processing Vision-Language Models 🏢 Department of Computer Science, Purdue University

D-LISA: Dynamic modules & language-informed spatial attention revolutionizes multi-object 3D grounding, surpassing state-of-the-art accuracy by 12.8%.

Multi-model Ensemble Conformal Prediction in Dynamic Environments

26 September 2024·1865 words·9 mins· loading · loading

Machine Learning Deep Learning 🏢 UC Irvine

Adaptive multi-model ensemble conformal prediction achieves strongly adaptive regret, yielding more efficient prediction sets in dynamic environments.

Multi-modal Transfer Learning between Biological Foundation Models

26 September 2024·2170 words·11 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 InstaDeep

IsoFormer, a novel multi-modal model, accurately predicts RNA transcript isoform expression by integrating DNA, RNA, and protein sequence information, achieving state-of-the-art results.

Multi-LLM Debate: Framework, Principals, and Interventions

26 September 2024·1604 words·8 mins· loading · loading

Natural Language Processing Large Language Models 🏢 ByteDance Research

Boosting LLM collaboration, this research introduces a novel theoretical framework for multi-LLM debate, revealing key principles like the effect of similar models and interventions to enhance accurac…

Multi-language Diversity Benefits Autoformalization

26 September 2024·1698 words·8 mins· loading · loading

Natural Language Processing Large Language Models 🏢 University of Cambridge

Researchers created MMA, a large multilingual dataset of informal-formal mathematical pairs, leveraging a language model for reverse translation. Fine-tuned models achieved significantly improved aut…

Multi-Label Open Set Recognition

26 September 2024·1580 words·8 mins· loading · loading

Machine Learning Deep Learning 🏢 School of Computer Science and Engineering, Southeast University

SLAN: A novel approach for multi-label open-set recognition, enriching sub-labeling info using structural data to identify unknown labels.

Multi-Label Learning with Stronger Consistency Guarantees

26 September 2024·239 words·2 mins· loading · loading

Machine Learning Optimization 🏢 Courant Institute

Novel surrogate losses with label-independent H-consistency bounds enable stronger guarantees for multi-label learning.

Multi-Instance Partial-Label Learning with Margin Adjustment

26 September 2024·3339 words·16 mins· loading · loading

AI Generated Machine Learning Semi-Supervised Learning 🏢 School of Computer Science and Engineering, Southeast University

MIPLMA, a novel algorithm, enhances multi-instance partial-label learning by dynamically adjusting margins for attention scores and predicted probabilities, leading to superior performance.

Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images

26 September 2024·2520 words·12 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 KAIST

MHCDIFF: a novel pipeline using multi-hypotheses conditioned point cloud diffusion for accurate 3D human reconstruction from occluded images, outperforming state-of-the-art methods.

Multi-Head Mixture-of-Experts

26 September 2024·2844 words·14 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Microsoft Research

Multi-Head Mixture-of-Experts (MH-MoE) drastically boosts large language model efficiency by activating almost all expert networks, achieving superior performance compared to existing Sparse Mixture-o…

Multi-Group Proportional Representation in Retrieval

26 September 2024·4416 words·21 mins· loading · loading

AI Theory Fairness 🏢 Harvard University

Multi-group Proportional Representation (MPR) tackles skewed search results by measuring representation across intersectional groups, improving fairness in image retrieval.