🏢 Southeast University

What Makes Partial-Label Learning Algorithms Effective?

26 September 2024·1666 words·8 mins· loading · loading

Machine Learning Semi-Supervised Learning 🏢 Southeast University

Unlocking Partial-Label Learning: A new study reveals surprisingly simple design principles for highly accurate algorithms, dramatically simplifying future research and boosting performance.

Unveiling LoRA Intrinsic Ranks via Salience Analysis

26 September 2024·1998 words·10 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Southeast University

SalientLoRA unveils optimal LoRA ranks by analyzing rank salience via time-series analysis, improving fine-tuning efficiency and performance significantly.

SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion

26 September 2024·2883 words·14 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Southeast University

SimVG: A simpler, faster visual grounding framework with decoupled multi-modal fusion, achieving state-of-the-art performance.

Prune and Repaint: Content-Aware Image Retargeting for any Ratio

26 September 2024·2137 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 Southeast University

Prune and Repaint: A new content-aware method for superior image retargeting across any aspect ratio, preserving key features and avoiding artifacts.

LIVE: Learnable In-Context Vector for Visual Question Answering

26 September 2024·3429 words·17 mins· loading · loading

Natural Language Processing Question Answering 🏢 Southeast University

LIVE, a novel learnable in-context vector, significantly improves visual question answering by reducing computational costs and enhancing accuracy compared to traditional ICL methods.

Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models

26 September 2024·2923 words·14 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Southeast University

Lever-LM configures effective in-context demonstrations for large vision-language models using a small language model, significantly improving their performance on visual question answering and image …

Generalization Analysis for Label-Specific Representation Learning

26 September 2024·269 words·2 mins· loading · loading

AI Theory Representation Learning 🏢 Southeast University

Researchers derived tighter generalization bounds for label-specific representation learning (LSRL) methods, improving understanding of LSRL’s success and offering guidance for future algorithm develo…

ControlSynth Neural ODEs: Modeling Dynamical Systems with Guaranteed Convergence

26 September 2024·2928 words·14 mins· loading · loading

Machine Learning Deep Learning 🏢 Southeast University

ControlSynth Neural ODEs (CSODEs) guarantee convergence in complex dynamical systems via tractable linear inequalities, improving neural ODE modeling.

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

26 September 2024·4509 words·22 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Southeast University

This paper presents a novel method to align vision models with human aesthetics in image retrieval, using large language models (LLMs) for query rephrasing and preference-based reinforcement learning …