🏢 University of Oxford

No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery

26 September 2024·4811 words·23 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of Oxford

AI agents learn better with well-designed training environments. This paper reveals flaws in current environment-selection methods and introduces Sampling for Learnability (SFL), a new approach that …

No 'Zero-Shot' Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

26 September 2024·6344 words·30 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 University of Oxford

Multimodal models’ impressive ‘zero-shot’ performance hinges on the frequency of concepts in their training data, not inherent generalization ability; exponentially more data is needed for linear impr…

Metric Flow Matching for Smooth Interpolations on the Data Manifold

26 September 2024·2425 words·12 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 University of Oxford

METRIC FLOW MATCHING (MFM) generates smooth interpolations on data manifolds by minimizing kinetic energy, outperforming Euclidean methods and achieving state-of-the-art results in single-cell traject…

Marginal Causal Flows for Validation and Inference

26 September 2024·1827 words·9 mins· loading · loading

AI Theory Causality 🏢 University of Oxford

Frugal Flows: Generate realistic causal benchmarks with exact marginal causal effects, enabling robust causal method validation.

LoCo: Learning 3D Location-Consistent Image Features with a Memory-Efficient Ranking Loss

26 September 2024·1960 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Oxford

LoCo: Memory-efficient location-consistent image features learned via a novel ranking loss, enabling three orders of magnitude memory improvement and outperforming state-of-the-art.

Learning Segmentation from Point Trajectories

26 September 2024·2329 words·11 mins· loading · loading

Image Segmentation 🏢 University of Oxford

This paper introduces a novel unsupervised video object segmentation method using long-term point trajectories and optical flow, outperforming prior art by effectively combining sparse, long-term moti…

Learning on Large Graphs using Intersecting Communities

26 September 2024·2286 words·11 mins· loading · loading

Machine Learning Semi-Supervised Learning 🏢 University of Oxford

Learn on massive graphs efficiently using Intersecting Community Graphs (ICGs)! This method approximates large graphs with ICGs, enabling linear time/memory complexity for node classification.

Language Models as Hierarchy Encoders

26 September 2024·2232 words·11 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Oxford

Language models struggle with hierarchical information. This work introduces Hierarchy Transformer Encoders (HITs), a novel method to retrain transformer encoders using hyperbolic geometry and special…

Label Delay in Online Continual Learning

26 September 2024·4705 words·23 mins· loading · loading

AI Generated Machine Learning Continual Learning 🏢 University of Oxford

Bridging the accuracy gap in online continual learning caused by label delays, a new framework with Importance Weighted Memory Sampling prioritizes relevant memory samples, significantly outperforming…

Interventionally Consistent Surrogates for Complex Simulation Models

26 September 2024·1862 words·9 mins· loading · loading

AI Generated AI Theory Causality 🏢 University of Oxford

This paper introduces a novel framework for creating interventionally consistent surrogate models for complex simulations, addressing computational limitations and ensuring accurate policy evaluation.

Interpreting Learned Feedback Patterns in Large Language Models

26 September 2024·2900 words·14 mins· loading · loading

Natural Language Processing Large Language Models 🏢 University of Oxford

Researchers developed methods to measure and interpret the divergence between learned feedback patterns (LFPs) in LLMs and human preferences, helping minimize discrepancies between LLM behavior and tr…

Improved learning rates in multi-unit uniform price auctions

26 September 2024·442 words·3 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of Oxford

New modeling of bid space in multi-unit uniform price auctions achieves regret of Õ(K4/3T2/3) under bandit feedback, improving over prior work and closing the gap with discriminatory pricing.

G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training

26 September 2024·2099 words·10 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Oxford

G2D: a novel medical VLP framework achieves superior performance in medical image analysis by simultaneously learning global and dense visual features using image-text pairs without extra annotations.

FilterNet: Harnessing Frequency Filters for Time Series Forecasting

26 September 2024·2439 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Oxford

FilterNet: A novel deep learning architecture using learnable frequency filters for superior time series forecasting accuracy and efficiency.

Ensemble sampling for linear bandits: small ensembles suffice

26 September 2024·310 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Oxford

Small ensembles in stochastic linear bandits achieve near-optimal regret; a rigorous analysis shows that ensemble size need only scale logarithmically with horizon.

Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer

26 September 2024·2139 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Oxford

Direct3D: Revolutionizing image-to-3D generation with a scalable, native 3D diffusion model achieving state-of-the-art quality.

Deep Bayesian Active Learning for Preference Modeling in Large Language Models

26 September 2024·2339 words·11 mins· loading · loading

Natural Language Processing Large Language Models 🏢 University of Oxford

BAL-PM, a novel active learning approach, drastically reduces human feedback in LLM preference modeling by leveraging both model uncertainty and prompt distribution diversity, achieving 33%-68% fewer …

CountGD: Multi-Modal Open-World Counting

26 September 2024·2520 words·12 mins· loading · loading

Computer Vision Object Detection 🏢 University of Oxford

COUNTGD: A new multi-modal model counts objects in images using text or visual examples, significantly improving open-world counting accuracy.

Can Learned Optimization Make Reinforcement Learning Less Difficult?

26 September 2024·3614 words·17 mins· loading · loading

Reinforcement Learning 🏢 University of Oxford

Learned optimizer OPEN tackles RL’s non-stationarity, plasticity loss, and exploration using meta-learning, significantly outperforming traditional and other learned optimizers.

Can Large Language Model Agents Simulate Human Trust Behavior?

26 September 2024·3567 words·17 mins· loading · loading

Natural Language Processing Large Language Models 🏢 University of Oxford

LLM agents surprisingly exhibit human-like trust behavior, especially GPT-4, paving the way for simulating complex human interactions in various applications.