Skip to main content

🏢 Georgia Tech

Robust Reinforcement Learning from Corrupted Human Feedback
·2358 words·12 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Georgia Tech
R³M enhances reinforcement learning from human feedback by robustly handling corrupted preference labels, consistently learning the underlying reward and identifying outliers with minimal computationa…
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
·4522 words·22 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Georgia Tech
RefDrop: A training-free method enhances image and video generation consistency by directly controlling the influence of reference features on the diffusion process, enabling precise manipulation of c…
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
·2153 words·11 mins· loading · loading
Natural Language Processing Question Answering 🏢 Georgia Tech
RankRAG: One LLM, dual-purpose instruction-tuning for superior RAG!
Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models
·3033 words·15 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Georgia Tech
Researchers discover ‘safety basins’ in LLMs, proposing a new metric (VISAGE) to quantify finetuning risks and visualize how these basins protect against safety compromise during model training.
Diffusion Spectral Representation for Reinforcement Learning
·1737 words·9 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Georgia Tech
Diffusion Spectral Representation (Diff-SR) enables efficient reinforcement learning by extracting sufficient value function representations from diffusion models, bypassing slow sampling and facilita…