↓Skip to main content

🏢 Georgia Institute of Technology

Language Models can Self-Improve at State-Value Estimation for Better Search

4 March 2025·2765 words·13 mins· loading · loading

AI Generated 🤗 Daily Papers Machine Learning Reinforcement Learning 🏢 Georgia Institute of Technology

Self-Taught Lookahead improves LLM search via self-supervision, matching costly methods at a fraction of the compute!

Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

29 January 2025·3468 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Georgia Institute of Technology

Virus: A new attack method easily bypasses LLM guardrails, highlighting the inadequacy of current safety measures and urging for more robust solutions.

Large Language Models Think Too Fast To Explore Effectively

29 January 2025·3497 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Georgia Institute of Technology

Large language models underperform humans in open-ended exploration due to prioritizing immediate choices over long-term strategic thinking, but innovative models show promise.

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders

12 December 2024·6779 words·32 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 Georgia Institute of Technology

Gaze-LLE achieves state-of-the-art gaze estimation by using a frozen DINOv2 encoder and a lightweight decoder, simplifying architecture and improving efficiency.