Skip to main content

🏢 Southeast University

What Makes Partial-Label Learning Algorithms Effective?
·1666 words·8 mins· loading · loading
Machine Learning Semi-Supervised Learning 🏢 Southeast University
Unlocking Partial-Label Learning: A new study reveals surprisingly simple design principles for highly accurate algorithms, dramatically simplifying future research and boosting performance.
Unveiling LoRA Intrinsic Ranks via Salience Analysis
·1998 words·10 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Southeast University
SalientLoRA unveils optimal LoRA ranks by analyzing rank salience via time-series analysis, improving fine-tuning efficiency and performance significantly.
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
·2883 words·14 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Southeast University
SimVG: A simpler, faster visual grounding framework with decoupled multi-modal fusion, achieving state-of-the-art performance.
Prune and Repaint: Content-Aware Image Retargeting for any Ratio
·2137 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 Southeast University
Prune and Repaint: A new content-aware method for superior image retargeting across any aspect ratio, preserving key features and avoiding artifacts.
LIVE: Learnable In-Context Vector for Visual Question Answering
·3429 words·17 mins· loading · loading
Natural Language Processing Question Answering 🏢 Southeast University
LIVE, a novel learnable in-context vector, significantly improves visual question answering by reducing computational costs and enhancing accuracy compared to traditional ICL methods.
Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models
·2923 words·14 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Southeast University
Lever-LM configures effective in-context demonstrations for large vision-language models using a small language model, significantly improving their performance on visual question answering and image …
Generalization Analysis for Label-Specific Representation Learning
·269 words·2 mins· loading · loading
AI Theory Representation Learning 🏢 Southeast University
Researchers derived tighter generalization bounds for label-specific representation learning (LSRL) methods, improving understanding of LSRL’s success and offering guidance for future algorithm develo…
ControlSynth Neural ODEs: Modeling Dynamical Systems with Guaranteed Convergence
·2928 words·14 mins· loading · loading
Machine Learning Deep Learning 🏢 Southeast University
ControlSynth Neural ODEs (CSODEs) guarantee convergence in complex dynamical systems via tractable linear inequalities, improving neural ODE modeling.
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
·4509 words·22 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Southeast University
This paper presents a novel method to align vision models with human aesthetics in image retrieval, using large language models (LLMs) for query rephrasing and preference-based reinforcement learning …