Skip to main content

Posters

2024

FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding
·2233 words·11 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Gaoling School of Artificial Intelligence, Renmin University of China
FineCLIP boosts fine-grained image understanding by combining real-time self-distillation with semantically rich regional contrastive learning, significantly outperforming existing methods.
Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients
·1780 words·9 mins· loading · loading
Machine Learning Federated Learning 🏢 EPFL
Fine-tune personalization in federated learning to beat adversarial clients; collaboration level depends on data heterogeneity and adversary fraction.
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
·3674 words·18 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 UC Berkeley
This paper presents a novel RL framework that fine-tunes large vision-language models (VLMs) to become effective decision-making agents. By incorporating chain-of-thought reasoning, the framework enab…
Fine-Tuning is Fine, if Calibrated
·4429 words·21 mins· loading · loading
Machine Learning Transfer Learning 🏢 Ohio State University
Fine-tuning pre-trained models often degrades performance on unseen classes. This work reveals that the problem stems from logit scale discrepancies, not feature loss, and shows that post-processing c…
Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models
·4174 words·20 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 City University of Hong Kong
OLIVINE uses visual foundation models for fine-grained image-to-LiDAR contrastive distillation, mitigating self-conflict issues and improving 3D representation learning.
Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at Random
·1408 words·7 mins· loading · loading
Machine Learning Deep Learning 🏢 MYbank, Ant Group
A new fine-grained dynamic framework jointly optimizes bias and variance for accurate predictions from missing-not-at-random data, surpassing existing methods.
Fine-grained Control of Generative Data Augmentation in IoT Sensing
·2239 words·11 mins· loading · loading
AI Applications Healthcare 🏢 University of Illinois Urbana-Champaign
Fine-grained control is added to generative models for IoT sensing data augmentation, tailoring synthetic data to specific application needs by leveraging domain expertise and statistical metrics of s…
Fine-grained Analysis of In-context Linear Estimation: Data, Architecture, and Beyond
·1351 words·7 mins· loading · loading
Natural Language Processing Large Language Models 🏢 University of Michigan
Researchers crack the code of in-context learning in Transformers, revealing how architecture, low-rank parameters, and data correlations influence model optimization and generalization.
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
·3454 words·17 mins· loading · loading
Computer Vision Image Generation 🏢 German Research Center for Artificial Intelligence
NEMO pinpoints & deactivates neurons memorizing training data in diffusion models, boosting privacy & image diversity.
Finding good policies in average-reward Markov Decision Processes without prior knowledge
·1896 words·9 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Inria
First near-optimal reinforcement learning algorithm achieving best policy identification in average-reward MDPs without prior knowledge of complexity.
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
·3232 words·16 mins· loading · loading
AI Generated AI Applications Finance 🏢 Harvard University
FINCON: an LLM-based multi-agent system uses conceptual verbal reinforcement for superior financial decision-making, generalizing well across various tasks.
FINALLY: fast and universal speech enhancement with studio-like quality
·2546 words·12 mins· loading · loading
Speech and Audio Audio Enhancement 🏢 Samsung Research
FINALLY achieves studio-like speech enhancement speed and quality using a novel GAN-based approach with WavLM-integrated perceptual loss, outperforming existing diffusion models.
FilterNet: Harnessing Frequency Filters for Time Series Forecasting
·2439 words·12 mins· loading · loading
Machine Learning Deep Learning 🏢 University of Oxford
FilterNet: A novel deep learning architecture using learnable frequency filters for superior time series forecasting accuracy and efficiency.
Fight Back Against Jailbreaking via Prompt Adversarial Tuning
·2100 words·10 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Peking University
Prompt Adversarial Tuning (PAT) defends against LLM jailbreaking by training a protective prompt prefix. PAT uses adversarial and benign prompts to optimize this prefix, significantly reducing succes…
FIFO-Diffusion: Generating Infinite Videos from Text without Training
·3112 words·15 mins· loading · loading
Computer Vision Video Understanding 🏢 Seoul National University
FIFO-Diffusion generates infinitely long, high-quality videos from text prompts using a pretrained model, solving the challenge of long video generation without retraining.
FIDE: Frequency-Inflated Conditional Diffusion Model for Extreme-Aware Time Series Generation
·2091 words·10 mins· loading · loading
Machine Learning Generative Learning 🏢 University of Michigan
FIDE, a novel conditional diffusion model, accurately generates time series by inflating high-frequency components, preserving extreme value distributions.
FIARSE: Model-Heterogeneous Federated Learning via Importance-Aware Submodel Extraction
·2079 words·10 mins· loading · loading
Machine Learning Federated Learning 🏢 Purdue University
FIARSE dynamically optimizes submodels in federated learning based on parameter importance, improving efficiency and global model accuracy.
FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors
·2339 words·11 mins· loading · loading
Computer Vision 3D Vision 🏢 School of Computer Science and Engineering, Sun Yat-Sen University
FFAM uses feature factorization and gradient weighting to produce high-quality visual explanations for 3D object detectors, improving model interpretability and trust.
FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training
·2255 words·11 mins· loading · loading
Computer Vision 3D Vision 🏢 University of Amsterdam
FewViewGS: A novel method for high-quality novel view synthesis from sparse images using a multi-stage training scheme and a new locality-preserving regularization for 3D Gaussians.
Few-Shot Task Learning through Inverse Generative Modeling
·2587 words·13 mins· loading · loading
Machine Learning Few-Shot Learning 🏢 MIT
Few-shot task learning through inverse generative modeling (FTL-IGM) enables AI agents to quickly master new tasks from minimal data by leveraging invertible generative models.