🏢 Rochester Institute of Technology

What Variables Affect Out-of-Distribution Generalization in Pretrained Models?

26 September 2024·4187 words·20 mins· loading · loading

Computer Vision Representation Learning 🏢 Rochester Institute of Technology

High-resolution datasets with diverse classes significantly improve the transferability of pretrained DNNs by reducing representation compression and mitigating the ’tunnel effect.'

Visual Fourier Prompt Tuning

26 September 2024·4269 words·21 mins· loading · loading

Computer Vision Image Classification 🏢 Rochester Institute of Technology

Visual Fourier Prompt Tuning (VFPT) leverages the Fast Fourier Transform to seamlessly integrate spatial and frequency information for superior parameter-efficient vision model fine-tuning, even with …

On the Identifiability of Hybrid Deep Generative Models: Meta-Learning as a Solution

26 September 2024·1881 words·9 mins· loading · loading

Machine Learning Meta Learning 🏢 Rochester Institute of Technology

Meta-learning solves hybrid deep generative model unidentifiability!

Evidential Stochastic Differential Equations for Time-Aware Sequential Recommendation

26 September 2024·2228 words·11 mins· loading · loading

AI Applications Recommendation Systems 🏢 Rochester Institute of Technology

E-NSDE, a novel time-aware sequential recommendation model, integrates neural stochastic differential equations and evidential learning to improve recommendation accuracy by effectively handling varia…

Evidential Mixture Machines: Deciphering Multi-Label Correlations for Active Learning Sensitivity

26 September 2024·2512 words·12 mins· loading · loading

Machine Learning Active Learning 🏢 Rochester Institute of Technology

Evidential Mixture Machines (EMM) enhances multi-label active learning by deciphering label correlations for improved accuracy and uncertainty quantification in large, sparse label spaces.

Diffusion-Inspired Truncated Sampler for Text-Video Retrieval

26 September 2024·2366 words·12 mins· loading · loading

Multimodal Learning Cross-Modal Retrieval 🏢 Rochester Institute of Technology

Diffusion-Inspired Truncated Sampler (DITS) revolutionizes text-video retrieval by progressively aligning embeddings and enhancing CLIP embedding space structure, achieving state-of-the-art results.

Cooperative Hardware-Prompt Learning for Snapshot Compressive Imaging

26 September 2024·1775 words·9 mins· loading · loading

Computer Vision Image Generation 🏢 Rochester Institute of Technology

Federated Hardware-Prompt Learning (FedHP) enables robust cross-hardware SCI training by aligning inconsistent data distributions using a hardware-conditioned prompter, outperforming existing FL metho…

Be Confident in What You Know: Bayesian Parameter Efficient Fine-Tuning of Vision Foundation Models

26 September 2024·3138 words·15 mins· loading · loading

Computer Vision Few-Shot Learning 🏢 Rochester Institute of Technology

Bayesian-PEFT boosts vision model accuracy and confidence in few-shot learning by integrating Bayesian components into PEFT, solving the underconfidence problem.

Adaptive Important Region Selection with Reinforced Hierarchical Search for Dense Object Detection

26 September 2024·2760 words·13 mins· loading · loading

Computer Vision Object Detection 🏢 Rochester Institute of Technology

AIRS framework, guided by Evidential Q-learning, dynamically balances exploration and exploitation to achieve superior dense object detection accuracy by adaptively selecting important regions.