🏢 Georgia Tech

Robust Reinforcement Learning from Corrupted Human Feedback

26 September 2024·2358 words·12 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Georgia Tech

R³M enhances reinforcement learning from human feedback by robustly handling corrupted preference labels, consistently learning the underlying reward and identifying outliers with minimal computationa…

RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance

26 September 2024·4522 words·22 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 Georgia Tech

RefDrop: A training-free method enhances image and video generation consistency by directly controlling the influence of reference features on the diffusion process, enabling precise manipulation of c…

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

26 September 2024·2153 words·11 mins· loading · loading

Natural Language Processing Question Answering 🏢 Georgia Tech

RankRAG: One LLM, dual-purpose instruction-tuning for superior RAG!

Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models

26 September 2024·3033 words·15 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Georgia Tech

Researchers discover ‘safety basins’ in LLMs, proposing a new metric (VISAGE) to quantify finetuning risks and visualize how these basins protect against safety compromise during model training.

Diffusion Spectral Representation for Reinforcement Learning

26 September 2024·1737 words·9 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Georgia Tech

Diffusion Spectral Representation (Diff-SR) enables efficient reinforcement learning by extracting sufficient value function representations from diffusion models, bypassing slow sampling and facilita…