Skip to main content

🏢 Rochester Institute of Technology

What Variables Affect Out-of-Distribution Generalization in Pretrained Models?
·4187 words·20 mins· loading · loading
Computer Vision Representation Learning 🏢 Rochester Institute of Technology
High-resolution datasets with diverse classes significantly improve the transferability of pretrained DNNs by reducing representation compression and mitigating the ’tunnel effect.'
Visual Fourier Prompt Tuning
·4269 words·21 mins· loading · loading
Computer Vision Image Classification 🏢 Rochester Institute of Technology
Visual Fourier Prompt Tuning (VFPT) leverages the Fast Fourier Transform to seamlessly integrate spatial and frequency information for superior parameter-efficient vision model fine-tuning, even with …
On the Identifiability of Hybrid Deep Generative Models: Meta-Learning as a Solution
·1881 words·9 mins· loading · loading
Machine Learning Meta Learning 🏢 Rochester Institute of Technology
Meta-learning solves hybrid deep generative model unidentifiability!
Evidential Stochastic Differential Equations for Time-Aware Sequential Recommendation
·2228 words·11 mins· loading · loading
AI Applications Recommendation Systems 🏢 Rochester Institute of Technology
E-NSDE, a novel time-aware sequential recommendation model, integrates neural stochastic differential equations and evidential learning to improve recommendation accuracy by effectively handling varia…
Evidential Mixture Machines: Deciphering Multi-Label Correlations for Active Learning Sensitivity
·2512 words·12 mins· loading · loading
Machine Learning Active Learning 🏢 Rochester Institute of Technology
Evidential Mixture Machines (EMM) enhances multi-label active learning by deciphering label correlations for improved accuracy and uncertainty quantification in large, sparse label spaces.
Diffusion-Inspired Truncated Sampler for Text-Video Retrieval
·2366 words·12 mins· loading · loading
Multimodal Learning Cross-Modal Retrieval 🏢 Rochester Institute of Technology
Diffusion-Inspired Truncated Sampler (DITS) revolutionizes text-video retrieval by progressively aligning embeddings and enhancing CLIP embedding space structure, achieving state-of-the-art results.
Cooperative Hardware-Prompt Learning for Snapshot Compressive Imaging
·1775 words·9 mins· loading · loading
Computer Vision Image Generation 🏢 Rochester Institute of Technology
Federated Hardware-Prompt Learning (FedHP) enables robust cross-hardware SCI training by aligning inconsistent data distributions using a hardware-conditioned prompter, outperforming existing FL metho…
Be Confident in What You Know: Bayesian Parameter Efficient Fine-Tuning of Vision Foundation Models
·3138 words·15 mins· loading · loading
Computer Vision Few-Shot Learning 🏢 Rochester Institute of Technology
Bayesian-PEFT boosts vision model accuracy and confidence in few-shot learning by integrating Bayesian components into PEFT, solving the underconfidence problem.
Adaptive Important Region Selection with Reinforced Hierarchical Search for Dense Object Detection
·2760 words·13 mins· loading · loading
Computer Vision Object Detection 🏢 Rochester Institute of Technology
AIRS framework, guided by Evidential Q-learning, dynamically balances exploration and exploitation to achieve superior dense object detection accuracy by adaptively selecting important regions.