🏢 University of Oxford

Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code

24 March 2025·1748 words·9 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 University of Oxford

LLMs generate Python heuristics for classical planning, outperforming traditional methods and challenging the state-of-the-art planning techniques.

Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image

21 March 2025·2762 words·13 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 University of Oxford

Motion blur, usually a problem, is now a solution! This paper estimates camera motion from motion-blurred images, acting like an IMU.

VGGT: Visual Geometry Grounded Transformer

14 March 2025·3346 words·16 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 University of Oxford

VGGT: a fast, end-to-end transformer that infers complete 3D scene attributes from multiple views, outperforming optimization-based methods.

Mixture of Experts Made Intrinsically Interpretable

5 March 2025·3052 words·15 mins· loading · loading

AI Generated 🤗 Daily Papers AI Theory Interpretability 🏢 University of Oxford

MoE-X: An intrinsically interpretable Mixture-of-Experts language model that uses sparse, wide networks to enhance transparency.

LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation

4 March 2025·4618 words·22 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 University of Oxford

LINGOLY-TOO: A new benchmark to disentangle memorization from reasoning in LLMs using linguistic templatization and orthographic obfuscation.

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

27 December 2024·3641 words·18 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 University of Oxford

OS-Genesis: Reverse task synthesis revolutionizes GUI agent training by generating high-quality trajectory data without human supervision, drastically boosting performance on challenging benchmarks.

Video Motion Transfer with Diffusion Transformers

10 December 2024·3141 words·15 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 University of Oxford

DiTFlow: training-free video motion transfer using Diffusion Transformers, enabling realistic motion control in synthesized videos via Attention Motion Flow.

Can sparse autoencoders be used to decompose and interpret steering vectors?

13 November 2024·2017 words·10 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 University of Oxford

Sparse autoencoders fail to accurately decompose and interpret steering vectors due to distribution mismatch and the inability to handle negative feature projections; this paper identifies these issue…

Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive Toxicity Reduction

10 November 2024·2573 words·13 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 University of Oxford

Contrary to common belief, toxicity reduction in language models isn’t simply achieved by dampening toxic neurons; it’s a complex balancing act across multiple neuron groups.