↓Skip to main content

🏢 Princeton University

Effectively Controlling Reasoning Models through Thinking Intervention

31 March 2025·3981 words·19 mins· loading · loading

AI Generated 🤗 Daily Papers AI Theory Safety 🏢 Princeton University

Thinking Intervention offers a novel paradigm for controlling reasoning in LLMs, enabling fine-grained guidance and improvements in instruction-following and safety.

Attention IoU: Examining Biases in CelebA using Attention Maps

25 March 2025·3919 words·19 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Classification 🏢 Princeton University

Attention-IoU reveals model biases by analyzing attention maps, offering insights beyond dataset labels and improving debiasing techniques.

Temporal Consistency for LLM Reasoning Process Error Identification

18 March 2025·3234 words·16 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Princeton University

A new test-time method, Temporal Consistency, is introduced to improve LLM reasoning by leveraging iterative self-reflection.

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

10 February 2025·2360 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Princeton University

ReasonFlux boosts LLM mathematical reasoning by using hierarchical thought templates, outperforming top LLMs like OpenAI’s 01-preview and DeepSeek V3.

RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation

15 January 2025·5724 words·27 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Princeton University

RLHS, a novel alignment algorithm, leverages simulated hindsight feedback to mitigate misalignment in RLHF, significantly improving AI’s alignment with human values and goals.

TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning

11 December 2024·1675 words·8 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Princeton University

TidyBot++: Low-cost, open-source holonomic mobile base makes robot learning easier.