Posters
2024
FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding
·2233 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Gaoling School of Artificial Intelligence, Renmin University of China
FineCLIP boosts fine-grained image understanding by combining real-time self-distillation with semantically rich regional contrastive learning, significantly outperforming existing methods.
Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients
·1780 words·9 mins·
loading
·
loading
Machine Learning
Federated Learning
🏢 EPFL
Fine-tune personalization in federated learning to beat adversarial clients; collaboration level depends on data heterogeneity and adversary fraction.
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
·3674 words·18 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 UC Berkeley
This paper presents a novel RL framework that fine-tunes large vision-language models (VLMs) to become effective decision-making agents. By incorporating chain-of-thought reasoning, the framework enab…
Fine-Tuning is Fine, if Calibrated
·4429 words·21 mins·
loading
·
loading
Machine Learning
Transfer Learning
🏢 Ohio State University
Fine-tuning pre-trained models often degrades performance on unseen classes. This work reveals that the problem stems from logit scale discrepancies, not feature loss, and shows that post-processing c…
Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models
·4174 words·20 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 City University of Hong Kong
OLIVINE uses visual foundation models for fine-grained image-to-LiDAR contrastive distillation, mitigating self-conflict issues and improving 3D representation learning.
Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at Random
·1408 words·7 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 MYbank, Ant Group
A new fine-grained dynamic framework jointly optimizes bias and variance for accurate predictions from missing-not-at-random data, surpassing existing methods.
Fine-grained Control of Generative Data Augmentation in IoT Sensing
·2239 words·11 mins·
loading
·
loading
AI Applications
Healthcare
🏢 University of Illinois Urbana-Champaign
Fine-grained control is added to generative models for IoT sensing data augmentation, tailoring synthetic data to specific application needs by leveraging domain expertise and statistical metrics of s…
Fine-grained Analysis of In-context Linear Estimation: Data, Architecture, and Beyond
·1351 words·7 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 University of Michigan
Researchers crack the code of in-context learning in Transformers, revealing how architecture, low-rank parameters, and data correlations influence model optimization and generalization.
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
·3454 words·17 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 German Research Center for Artificial Intelligence
NEMO pinpoints & deactivates neurons memorizing training data in diffusion models, boosting privacy & image diversity.
Finding good policies in average-reward Markov Decision Processes without prior knowledge
·1896 words·9 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Inria
First near-optimal reinforcement learning algorithm achieving best policy identification in average-reward MDPs without prior knowledge of complexity.
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
·3232 words·16 mins·
loading
·
loading
AI Generated
AI Applications
Finance
🏢 Harvard University
FINCON: an LLM-based multi-agent system uses conceptual verbal reinforcement for superior financial decision-making, generalizing well across various tasks.
FINALLY: fast and universal speech enhancement with studio-like quality
·2546 words·12 mins·
loading
·
loading
Speech and Audio
Audio Enhancement
🏢 Samsung Research
FINALLY achieves studio-like speech enhancement speed and quality using a novel GAN-based approach with WavLM-integrated perceptual loss, outperforming existing diffusion models.
FilterNet: Harnessing Frequency Filters for Time Series Forecasting
·2439 words·12 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 University of Oxford
FilterNet: A novel deep learning architecture using learnable frequency filters for superior time series forecasting accuracy and efficiency.
Fight Back Against Jailbreaking via Prompt Adversarial Tuning
·2100 words·10 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Peking University
Prompt Adversarial Tuning (PAT) defends against LLM jailbreaking by training a protective prompt prefix. PAT uses adversarial and benign prompts to optimize this prefix, significantly reducing succes…
FIFO-Diffusion: Generating Infinite Videos from Text without Training
·3112 words·15 mins·
loading
·
loading
Computer Vision
Video Understanding
🏢 Seoul National University
FIFO-Diffusion generates infinitely long, high-quality videos from text prompts using a pretrained model, solving the challenge of long video generation without retraining.
FIDE: Frequency-Inflated Conditional Diffusion Model for Extreme-Aware Time Series Generation
·2091 words·10 mins·
loading
·
loading
Machine Learning
Generative Learning
🏢 University of Michigan
FIDE, a novel conditional diffusion model, accurately generates time series by inflating high-frequency components, preserving extreme value distributions.
FIARSE: Model-Heterogeneous Federated Learning via Importance-Aware Submodel Extraction
·2079 words·10 mins·
loading
·
loading
Machine Learning
Federated Learning
🏢 Purdue University
FIARSE dynamically optimizes submodels in federated learning based on parameter importance, improving efficiency and global model accuracy.
FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors
·2339 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 School of Computer Science and Engineering, Sun Yat-Sen University
FFAM uses feature factorization and gradient weighting to produce high-quality visual explanations for 3D object detectors, improving model interpretability and trust.
FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training
·2255 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Amsterdam
FewViewGS: A novel method for high-quality novel view synthesis from sparse images using a multi-stage training scheme and a new locality-preserving regularization for 3D Gaussians.
Few-Shot Task Learning through Inverse Generative Modeling
·2587 words·13 mins·
loading
·
loading
Machine Learning
Few-Shot Learning
🏢 MIT
Few-shot task learning through inverse generative modeling (FTL-IGM) enables AI agents to quickly master new tasks from minimal data by leveraging invertible generative models.