🏢 Rochester Institute of Technology
What Variables Affect Out-of-Distribution Generalization in Pretrained Models?
·4187 words·20 mins·
loading
·
loading
Computer Vision
Representation Learning
🏢 Rochester Institute of Technology
High-resolution datasets with diverse classes significantly improve the transferability of pretrained DNNs by reducing representation compression and mitigating the ’tunnel effect.'
Visual Fourier Prompt Tuning
·4269 words·21 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Rochester Institute of Technology
Visual Fourier Prompt Tuning (VFPT) leverages the Fast Fourier Transform to seamlessly integrate spatial and frequency information for superior parameter-efficient vision model fine-tuning, even with …
On the Identifiability of Hybrid Deep Generative Models: Meta-Learning as a Solution
·1881 words·9 mins·
loading
·
loading
Machine Learning
Meta Learning
🏢 Rochester Institute of Technology
Meta-learning solves hybrid deep generative model unidentifiability!
Evidential Stochastic Differential Equations for Time-Aware Sequential Recommendation
·2228 words·11 mins·
loading
·
loading
AI Applications
Recommendation Systems
🏢 Rochester Institute of Technology
E-NSDE, a novel time-aware sequential recommendation model, integrates neural stochastic differential equations and evidential learning to improve recommendation accuracy by effectively handling varia…
Evidential Mixture Machines: Deciphering Multi-Label Correlations for Active Learning Sensitivity
·2512 words·12 mins·
loading
·
loading
Machine Learning
Active Learning
🏢 Rochester Institute of Technology
Evidential Mixture Machines (EMM) enhances multi-label active learning by deciphering label correlations for improved accuracy and uncertainty quantification in large, sparse label spaces.
Diffusion-Inspired Truncated Sampler for Text-Video Retrieval
·2366 words·12 mins·
loading
·
loading
Multimodal Learning
Cross-Modal Retrieval
🏢 Rochester Institute of Technology
Diffusion-Inspired Truncated Sampler (DITS) revolutionizes text-video retrieval by progressively aligning embeddings and enhancing CLIP embedding space structure, achieving state-of-the-art results.
Cooperative Hardware-Prompt Learning for Snapshot Compressive Imaging
·1775 words·9 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Rochester Institute of Technology
Federated Hardware-Prompt Learning (FedHP) enables robust cross-hardware SCI training by aligning inconsistent data distributions using a hardware-conditioned prompter, outperforming existing FL metho…
Be Confident in What You Know: Bayesian Parameter Efficient Fine-Tuning of Vision Foundation Models
·3138 words·15 mins·
loading
·
loading
Computer Vision
Few-Shot Learning
🏢 Rochester Institute of Technology
Bayesian-PEFT boosts vision model accuracy and confidence in few-shot learning by integrating Bayesian components into PEFT, solving the underconfidence problem.
Adaptive Important Region Selection with Reinforced Hierarchical Search for Dense Object Detection
·2760 words·13 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Rochester Institute of Technology
AIRS framework, guided by Evidential Q-learning, dynamically balances exploration and exploitation to achieve superior dense object detection accuracy by adaptively selecting important regions.